冯懿,左德承,张展,周海鹰,杨孝宗.事务处理型容错计算机可用性评测系统设计与实现[J].高技术通讯(中文),2012,22(9):912~917 |
事务处理型容错计算机可用性评测系统设计与实现 |
Design and implementation of an availability assessment system for transaction processing oriented fault tolerant computers |
修订日期:2011-07-22 |
DOI: |
中文关键词: 容错, 事务处理, 可用性评测, 故障注入 |
英文关键词: fault tolerant, transaction processing, availability assessment, fault injection |
基金项目:863计划(2008AA01A204)资助项目 |
作者 | 单位 | 冯懿 | 哈尔滨工业大学计算机科学与技术学院 | 左德承 | 哈尔滨工业大学计算机科学与技术学院 | 张展 | 哈尔滨工业大学计算机科学与技术学院 | 周海鹰 | 哈尔滨工业大学计算机科学与技术学院 | 杨孝宗 | 哈尔滨工业大学计算机科学与技术学院 |
|
摘要点击次数: 3003 |
全文下载次数: 2490 |
中文摘要: |
针对事务处理型容错计算机可用性测试中存在的目标系统数量少、测试时长有限的问题,设计了一种可用性评测方法,并实现了相应的可用性评测系统,用于评测事务处理型容错计算机的可用性指标。评测系统由多层次故障注入平台、模拟应用负载、可用性评测套件组成。多层次故障注入平台实现自动化故障注入,针对目标系统施加多批量、多种类故障负载;模拟应用负载能够针对目标系统施加事务处理型工作负载;可用性评测套件用于测试目标系统中各功能子系统和现场可更换部件(FRU)的可靠性连接关系,测试各类FRU的冗余程度,测试各类FRU的平均修复时间,以及测试验证目标系统是否满足指定的可用性设计要求。针对HP Superdome容错服务器进行的评测结果与官方文档一致,证明了该评测系统的有效性。本研究对于事务处理型容错计算机研制商预测系统可用性以及终端用户应用选型具有重要作用。 |
英文摘要: |
To overcome the limitation in sample system number and test period during the availability test for a transaction processing oriented fault tolerant computer, an availability assessment method was proposed and a corresponding assessment system was realized. The availability assessment system consists of a multi level fault injection platform, an application workloads simulator and an availability assessment toolkit. The fault injection platform is designed for automatically injecting various fault loads into target systems in batches. The application workloads simulator can generate transactions launched by end users and send them to target systems as workloads. The availability assessment toolkit is designed for several tests, including reliability relationship test among functional subsystems, reliability relationship test among field replaceable units (FRUs), redundancy test of different kind of FRUs, mean time to recovery (MTTR) test, and availability validation test. The evaluation results of the tests on HP Superdome fault tolerant server accord with official documents, which proves the effectiveness of the assessment system. This research is important for computer manufacturers to predict availability metric and it is also important for end users to verify system availability. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |
|
|
|