Bulletin of the American Physical Society
APS March Meeting 2022
Volume 67, Number 3
Monday–Friday, March 14–18, 2022; Chicago
Session K09: Physics of Machine Learning II
3:00 PM–5:48 PM,
Tuesday, March 15, 2022
Room: McCormick Place W-180
Sponsoring
Units:
GSNP GDS DCOMP DSOFT
Chair: Yuhai Tu, IBM T. J. Watson Research Center
Abstract: K09.00012 : Criticality in Deep Neural Networks using Jacobian(s)*
5:12 PM–5:24 PM
Presenter:
Darshil H Doshi
(Brown University)
Authors:
Darshil H Doshi
(Brown University)
Andrey Gromov
(Brown University)
Tianyu He
(Brown University)
Working in this limit, we look at the “propagation of signal” through the network to identify “phases” in the space of parameter-distributions (weights and biases). Specifically, we focus on the propagation of gradients, using the Jacobian(s) of the network function. The norm of the Jacobian matrices succinctly capture the converging and diverging behavior (phases) of the gradient propagation. Furthermore, we show that the network performs optimally at the boundary of these two phases. The analysis provides us with the optimal values of parameter-initializations for training DNNs.
*NSF CAREER Award DMR-2045181
Follow Us |
Engage
Become an APS Member |
My APS
Renew Membership |
Information for |
About APSThe American Physical Society (APS) is a non-profit membership organization working to advance the knowledge of physics. |
© 2024 American Physical Society
| All rights reserved | Terms of Use
| Contact Us
Headquarters
1 Physics Ellipse, College Park, MD 20740-3844
(301) 209-3200
Editorial Office
100 Motor Pkwy, Suite 110, Hauppauge, NY 11788
(631) 591-4000
Office of Public Affairs
529 14th St NW, Suite 1050, Washington, D.C. 20045-2001
(202) 662-8700