|Table of Contents|

[1] Wang Yun, Wang Junling,. Load balancing framework for actively replicated servers [J]. Journal of Southeast University (English Edition), 2005, 21 (4): 419-426. [doi:10.3969/j.issn.1003-7985.2005.04.009]
Copy

Load balancing framework for actively replicated servers()
一种面向主动复制服务器的负载平衡框架
Share:

Journal of Southeast University (English Edition)[ISSN:1003-7985/CN:32-1325/N]

Volumn:
21
Issue:
2005 4
Page:
419-426
Research Field:
Computer Science and Engineering
Publishing date:
2005-12-30

Info

Title:
Load balancing framework for actively replicated servers
一种面向主动复制服务器的负载平衡框架
Author(s):
Wang Yun Wang Junling
Department of Computer Science and Engineering, Southeast University, Nanjing 210096, China
汪芸 王俊岭
东南大学计算机科学与工程系, 南京 210096
Keywords:
load balancing fault tolerance framework task scheduler group
负载平衡 容错 框架 任务调度组
PACS:
TP391
DOI:
10.3969/j.issn.1003-7985.2005.04.009
Abstract:
This paper focuses on solving a problem of improving system robustness and the efficiency of a distributed system at the same time.Fault tolerance with active replication and load balancing techniques are used. The pros and cons of both techniques are analyzed, and a novel load balancing framework for fault tolerant systems with active replication is presented.Hierarchical architecture is described in detail.The framework can dynamically adjust fault tolerant groups and their memberships with respect to system loads.Three potential task scheduler group selection methods are proposed and simulation tests are made.Further analysis of test data is done and helpful observations for system design are also pointed out, including effects of task arrival intensity and task set size, relationship between total task execution time and single task execution time.
研究解决了在分布式系统中同时提高系统可靠性和运行效率的问题.针对基于主动复制的容错技术和负载平衡技术, 分析了这2种技术的优势和劣处, 提出了一种基于主动复制容错的负载平衡框架, 讨论了该框架的层次结构.该框架能够根据系统负载, 动态地调整系统中容错组的个数以及容错组中成员的个数.提出了3种选择任务调度组的方法, 并进行了仿真测试.通过对仿真测试数据的分析, 对任务到达强度、任务集大小以及单个任务执行时间与任务集执行时间的关系进行了讨论, 这些分析结论将有助于分布式系统的设计.

References:

[1] Polledna S.Fault tolerant real-time systems:the problem of replica determinism [M].Boston:Kluwer Academic Publishers, 1995.
[2] Chandra J, Toueg S.Unreliable failure detectors for reliable distributed systems [J].Journal of the ACM, 1996, 43(2):225-267.
[3] Fischer M J, Lynch N A, Paterson M S.Impossibility of distributed consensus with one faulty process [J].Journal of the ACM, 1985, 32(2):374-382.
[4] Wensley J H, Lamport L, Goldberg J, et al.SIFT:design and analysis of a fault-tolerant computer for aircraft control [J].Proceedings of the IEEE, 1978, 66(10):1240-1255.
[5] Schneider F B.Implementing fault-tolerant services using the state machine approach:a tutorial [J].ACM Computing Surveys, 1990, 22(4):299-319.
[6] Kenneth B, Thomas J, Frank S.ISIS:a distributed programming environment, version 2.1—user’s guide and reference manual [EB/OL].http://www.cs.cornell.edu/Info/Projects/ISIS/ISISpapers.html. 1987/2005-06-05.
[7] Renesse R, Birman K, Maffeis S.Horus:a flexible group communication system [J].Communications of the ACM, 1996, 39(4):76-83.
[8] Peterson L L, Bucholz N C, Schlichting R D.Preserving and using context information in interprocess communication [J].ACM Transactions on Computer Systems, 1989, 7(3):217-246.
[9] Kanmeda H, Li J, Kim C, et al.Optimal load balancing in distributed computer systems [M].London:Springer-Verlag, 1997.
[10] Zhou S.A trace-driven simulation study of dynamic load balancing [J].IEEE Transactions on Software Engineering, 1988, 14(9):1327-1341.
[11] Kostin A E, Aybay I, Oz G.A randomized contention-based load-balancing protocol for a distributed multiserver queuing system [J].IEEE Transactions on Parallel and Distributed Systems, 2000, 11(12):1252-1273.
[12] Eager D, Lazowska E, Zahorjan J.The limited performance benefits of migrating active processes for load sharing [J].ACM SIGMETRICS Performance Evaluation Review, 1988, 16(1):63-72.
[13] Othman O, O’Ryan C, Schmidt D.The design of an adaptive corba load balancing service [EB/OL].http:// www.cs.wustl.edu/~schmidt/PDF/load-balancing2.pdf.2001/2005-06-05.
[14] Friedman R, Mosse D.Load balancing frameworks for high-throughput distributed fault-tolerant servers [J].Journal of Parallel and Distributed Computing, 1999, 59(3):475-488.
[15] Wang Y.Active leave behavior of members in a fault-tolerant group [J].Science in China Ser F Information Sciences, 2004, 47(2):260-272.
[16] Wang Y, Wang J L.A novel load balancing framework for active replicated servers in asynchronous distributed systems [A].In:The 16th IASTED International Conference on Parallel and Distributed Computing and Systems [C]. Cambridge, 2004.298-303.

Memo

Memo:
Biography: Wang Yun(1967—), female, doctor, professor, yunwang@seu.edu.cn.
Last Update: 2005-12-20