面向大规模矩阵乘法的编码计算性能研究
作者:
作者单位:

作者简介:

王艳(1982—),副教授,博士,研究方向为分布式存储、分布式机器学习。E-mail:313624307@qq.com。

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金项目(61402172);江西省自然科学基金项目(20192BAB217006)


Research on the Performance of Coding Calculation for Large-Scale Matrix Multiplication
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为了更好地使用编码计算提高分布式机器学习算法运行效率,需要对大规模矩阵乘法的编码计算方案性能开销进行充分的研究。 该文考察了面向大规模矩阵乘法的编码计算方案的任务完成时间,同时也考虑了所有参与分布式计算的节点总的计算开销,给出了各个工作节点完成计算任务的时间,均匀分布场景下总的任务完成时间和集群机器总的计算时间的表达式, 对比分析了 3 种编码方案的性能,并通过实验对比了不同情况对任务完成时间与计算节点总计算开销影响,提出了一个启发式算法,提供了不同编码计算方案的选择依据。

    Abstract:

    With the growth of machine learning algorithm models and data sets, a single node cannot effectively bear the computing and storage requirements required for large -scale training. A common solution is to run large-scale machine learning algorithms on distributed clusters. However, the performance of distributed clusters is significantly affected by stragglers. In recent studies, researchers have used coding calculations to solve the straggler problem, but the performance of coding calculation schemes for large-scale matrix multiplication has not been fully studied and analyzed. This paper examines the task completion time of the coding calculation scheme for large-scale matrix multiplication, and considers the total calculation overhead of all nodes participating in distributed computing. The expression of the task completion time for each working node to complete the calculation task according with the total time under the uniform distribution scenario and the total computing time of the cluster machines is given. The performance of the three coding schemes is compared and analyzed. The effects of different situations on the task completion time and the total computing cost of the computing node are compared through experiments, and a heuristic algorithm is proposed to provide the basis for the selection of different coding calculation schemes.

    参考文献
    相似文献
    引证文献
引用本文

王艳,王希龄,赖宏达,李念爽.面向大规模矩阵乘法的编码计算性能研究[J].华东交通大学学报,2021,38(3):41-51.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-08-02
  • 出版日期: