Engineering Journal: Science and InnovationELECTRONIC SCIENCE AND ENGINEERING PUBLICATION
Certificate of Registration Media number Эл #ФС77-53688 of 17 April 2013. ISSN 2308-6033. DOI 10.18698/2308-6033
  • Русский
  • Английский
Article

Issues of organizing computations in multicomputer systems with the software-controlled failure- and fault-tolerance. Part III

Published: 17.08.2021

Authors: Asharina I.V.

Published in issue: #8(116)/2021

DOI: 10.18698/2308-6033-2021-8-2106

Category: Aviation and Rocket-Space Engineering | Chapter: Innovation Technologies of Aerospace Engineering

This three-part paper analyzes existing approaches and methods of organizing failure- and fault-tolerant computing in distributed multicomputer systems (DMCS), identifies and provides rationale for a list of issues to be solved. We review the application areas of failure- and fault- tolerant control systems for complex network and distributed objects. The third part proceeds with the study of the problems of organizing failure- and fault-tolerant computing in distributed multicomputer systems (DMCS), carried out in parts I and II of this work, and deals with the issues related to the diagnosis of multiple faults. The paper describes the main differences in ensuring fault tolerance in systems with broadcast communication channels and point-to-point communication channels.


References
[1] Asharina I.V. Inzhenerny zhurnal: nauka i innovatsii — Engineering Journal: Science and Innovation, 2021, iss. 7. http://dx.doi.org/10.18698/2308-6033-2021-7-2097
[2] Vedeshenkov V.A. Avtomatika i telemekhanika — Automation and Remote Control, 2003, no. 4, pp. 114–122.
[3] Karavay M.F. Avtomatika i telemekhanika — Automation and Remote Control, 2000, no. 1, pp. 144–156.
[4] Vedeshenkov V.A. Avtomatika i telemekhanika — Automation and Remote Control, 2014, no. 9, pp. 133–143.
[5] Preparata F.P., Metze G., Chien R.J. On Connection Assignement Problem of Diagnosable Systems. IEEE Trans. El. Comput., 1967, vol. EC-16, no. 12, pp. 848–854.
[6] Vedeshenkov V.A., Kurako E.A., Lebedev V.N. Avtomatika i telemekhanika — Automation and Remote Control, 2016, no. 3, pp. 152–165.
[7] Karavay M.F., Parkhomenko P.P., Podlazov V.S. Avtomatika i telemekhanika —Automation and Remote Control, 2009, no. 2, pp. 153–170.
[8] Vedeshenkov V.A. Problemy upravleniya — Control Sciences, 2009, no. 6, pp. 59–67.
[9] Vedeshenkov V.A. Avtomatika i telemekhanika — Automation and Remote Control, 2009, no. 11, pp. 161–171.
[10] Barsi F., Grandoni F., Maestrini P. A theory of diagnosability of digital systems. IEEE Trans. Comput., 1976, vol. C-25, no. 6, pp. 585–593.
[11] Vedeshenkov V.A. Avtomatika i telemekhanika — Automation and Remote Control, 2005, no. 3, pp. 154–168.
[12] Parkhomenko P.P. Avtomatika i telemekhanika — Automation and Remote Control, 1999, no. 5, pp. 126–134.
[13] Vedeshenkov V.A., Nesterov A.M. Elektronnoe modelirovanie — Engineering Simulation, 1981, vol. 3, no. 2, pp. 53–58.
[14] Karavay M.F., Podlazov V.S. Upravlenie bolshimi sistemami — Large-Scale Systems Control, no. 34. Moscow, Trapeznikov Institute of Control Sciences of Russian Academy of Sciences Publ., 2011, pp. 92–116.
[15] Imbs D., Mostefaoui A., Perrin M., Raynal M. Set-Constrained Delivery Broadcast: Definition, Abstraction Power, and Computability Limits. In: Bellavista P., Garg V.K., eds. Proceedings of the 19th International Conference on Distributed Computing and Networking, ICDCN 2018. Varanasi, India, January 4−7, 2018, pp. 7:1−7:10. ACM, 2018. DOI: 10.1145/3154273.3154296
[16] Auvolat A., Raynal M., Taïani F. Byzantine-Tolerant Set-Constrained Delivery Broadcast. Proceedings of the 23rd International Conference on Principles of Distributed Systems, OPODIS–2019. December 17–19, 2019, University of Neuchâtel, Neuchâtel, Switzerland. Leibniz, Leibniz International Proceedings in Informatics, 2019, article no. 16. DOI: 10.4230/LIPIcs.OPODIS.2019.16
[17] Grishin V.Yu., Lobanov A.V., Sirenko V.G. Avtomatika i telemekhanika —Automation and Remote Control, 2003, no. 4, pp. 123–132.
[18] Lobanov A.V. Avtomatika i telemekhanika — Automation and Remote Control, 2003, no. 6, pp. 175–185.
[19] Pease M., Shostak R., Lamport L. Reaching agreement in the presence of faults. J. ACM, 1980, vol. 27, no. 2, pp. 228–234.
[20] Lamport L., Shostak R., Pease M. The byzantine generals problem. ACM Trans. Progr. Lang. Syst., 1982, vol. 4, no. 3, pp. 382–401.
[21] Asharina I.V., Lobanov A.V., Mischenko I.G. Avtomatika i telemekhanika — Automation and Remote Control, 2003, no. 5, pp. 190−198.
[22] Dolev D., Dwork C., Stockmeyer L. On the minimal synchronics needed for distributed consensus. Proc. 24th Symp. on Foundationcs of Computer Science. USA, 1983, pp. 393−402.
[23] Lobanov A.V., Sirenko V.G. Obrazovatelnye resursy i tekhnologii — Educational Resources and Technologies, 2014, no. 2 (5), pp. 115−121.