In this thesis, a method of rollback recovery in an operating system is presented and an approach to coordination method of recovering concurrent processes for fault tolerant processing is designed and implemented in an operating system. Hardware reliability is improved greatly with the advent of the semiconductor technology but software faults have become increasingly prevalent with the steadily increasing size and complexity of software system. This thesis adopts an approach of rollback with recovery block for the concurrent real-time system. Recovery function in the operating system supports structured incorporation of processings by special recovering actions. The operating system includes several functions to make recovery blocks and to manage concurrent processes. The cost of the rollback recovery scheme is measured by processor time and memory space which are required for saving process states and the results are discussed.
하드웨어의 신뢰도가 향상되고 있는데 반하여, 소프트웨어는 그 크기와 복잡성이 증가함으로 인해 fault가 점점 증가되고 있다. 본 논문에서는 소프트웨어 fault가 발생했을 때 concurrent 프로세스 시스템에 있어서의 rollback recovery 방법이 운영체제 내에서 구현되었다.
각 프로세스는 recovery block 들로 구성이 되어 rollback recovery가 수행되어 진다. recovery block을 구성하고 rollback이 발생했을 때 concurrent 프로세스들을 관리하는 기능이 운영체제 내에 포함된다. 프로세스 상태를 주기억장치 내에 저장함으로써 빠른 응답시간을 얻을 수 있었다.