Reducing Discretization Error in the Frank-Wolfe Method

Zhaoyue Chen; Yifan Sun

Reducing Discretization Error in the Frank-Wolfe Method

Zhaoyue Chen, Yifan Sun

Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR 206:9697-9727, 2023.

Abstract

The Frank-Wolfe algorithm is a popular method in structurally constrained machine learning applications, due to its fast per-iteration complexity. However, one major limitation of the method is a slow rate of convergence that is difficult to accelerate due to erratic, zig-zagging step directions, even asymptotically close to the solution. We view this as an artifact of discretization; that is to say, the Frank-Wolfe flow, which is its trajectory at asymptotically small step sizes, does not zig-zag, and reducing discretization error will go hand-in-hand in producing a more stabilized method, with better convergence properties. We propose two improvements: a multistep Frank-Wolfe method that directly applies optimized higher-order discretization schemes; and an LMO-averaging scheme with reduced discretization error, and whose local convergence rate over general convex sets accelerates from a rate of

$O(1/k)$ to up to

$O(1/k^{3/2})$ .

Cite this Paper

BibTeX


@InProceedings{pmlr-v206-chen23g,
  title = 	 {Reducing Discretization Error in the Frank-Wolfe Method},
  author =       {Chen, Zhaoyue and Sun, Yifan},
  booktitle = 	 {Proceedings of The 26th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {9697--9727},
  year = 	 {2023},
  editor = 	 {Ruiz, Francisco and Dy, Jennifer and van de Meent, Jan-Willem},
  volume = 	 {206},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {25--27 Apr},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v206/chen23g/chen23g.pdf},
  url = 	 {https://proceedings.mlr.press/v206/chen23g.html},
  abstract = 	 {The Frank-Wolfe algorithm is a popular method in structurally constrained machine learning applications, due to its fast per-iteration complexity. However, one major limitation of the method is a slow rate of convergence that is difficult to accelerate due to erratic, zig-zagging step directions, even asymptotically close to the solution. We view this as an artifact of discretization; that is to say, the Frank-Wolfe flow, which is its trajectory at asymptotically small step sizes, does not zig-zag, and reducing discretization error will go hand-in-hand in producing a more stabilized method, with better convergence properties. We propose two improvements: a multistep Frank-Wolfe method that directly applies optimized higher-order discretization schemes; and an LMO-averaging scheme with reduced discretization error, and whose local convergence rate over general convex sets accelerates from a rate of $O(1/k)$ to up to $O(1/k^{3/2})$.}
}

Endnote

%0 Conference Paper
%T Reducing Discretization Error in the Frank-Wolfe Method
%A Zhaoyue Chen
%A Yifan Sun
%B Proceedings of The 26th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2023
%E Francisco Ruiz
%E Jennifer Dy
%E Jan-Willem van de Meent	
%F pmlr-v206-chen23g
%I PMLR
%P 9697--9727
%U https://proceedings.mlr.press/v206/chen23g.html
%V 206
%X The Frank-Wolfe algorithm is a popular method in structurally constrained machine learning applications, due to its fast per-iteration complexity. However, one major limitation of the method is a slow rate of convergence that is difficult to accelerate due to erratic, zig-zagging step directions, even asymptotically close to the solution. We view this as an artifact of discretization; that is to say, the Frank-Wolfe flow, which is its trajectory at asymptotically small step sizes, does not zig-zag, and reducing discretization error will go hand-in-hand in producing a more stabilized method, with better convergence properties. We propose two improvements: a multistep Frank-Wolfe method that directly applies optimized higher-order discretization schemes; and an LMO-averaging scheme with reduced discretization error, and whose local convergence rate over general convex sets accelerates from a rate of $O(1/k)$ to up to $O(1/k^{3/2})$.

APA


Chen, Z. & Sun, Y.. (2023). Reducing Discretization Error in the Frank-Wolfe Method. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 206:9697-9727 Available from https://proceedings.mlr.press/v206/chen23g.html.

Reducing Discretization Error in the Frank-Wolfe Method

Abstract

Cite this Paper

Related Material