The Hidden Vulnerability of Distributed Learning in Byzantium

资源分类

2020-03-19 |

57 |

35 |

Abstract

While machine learning is going through an era of celebrated success, concerns have been raised about the vulnerability of its backbone: stochastic gradient descent (SGD). Recent approaches have been proposed to ensure the robustness of distributed SGD against adversarial (Byzantine) workers sending poisoned gradients during the training phase. Some of these approaches have been proven Byzantine–resilient: they ensure the convergence of SGD despite the presence of a minority of adversarial workers. We show in this paper that convergence is not enough. In high dimension d 图片.png 1, an adversary can build on the loss function’s non–convexity to make SGD converge to ineffective models. More precisely, we bring to light that existing Byzantine–resilien schemes leave a margin of poisoning of (f (d)) where f (d) increases at least like . Based on this leeway, we build a simple attack, and experimentally show its strong to utmost effectivity on CIFAR–10 and MNIST. We introduce Bulyan, and prove it significantly reduces the at- tacker’s leeway to a narrow 图片.png bound. We empirically show that Bulyan does not suffer the fragility of existing aggregation rules and, at a reasonable cost in terms of required batch size, achieves convergence as if only non–Byzantine gradients had been used to update the model.

上一篇：Accurate Uncertainties for Deep Learning Using Calibrated Regression

下一篇：Gradient Coding from Cyclic MDS Codes and Expander Graphs

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to learn...

The move from hand-designed features to learned...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
Rating-Boosted La...

The performance of a recommendation system reli...
Hierarchical Task...

We extend hierarchical task network planning wi...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com