Ohad Shamir (Weizmann) – Training Neural Networks: The Bigger the Better?

Date & Time:

November 1, 2019 10:30 am – 11:30 am

Location:

Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

11/01/2019 10:30 AM 11/01/2019 11:30 AM America/Chicago Ohad Shamir (Weizmann) – Training Neural Networks: The Bigger the Better? CS / Toyota Technological Institute of Chicago Machine Learning Seminar Series Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

Training Neural Networks: The Bigger the Better?

Artificial neural networks are nowadays routinely trained to solve challenging learning tasks, but our theoretical understanding of this phenomenon remains quite limited. One increasingly popular approach, which is aligned with practice, is to study how making the network sufficiently large (a.k.a. “over-parameterized'') makes the associated training problem easier. In this talk, I'll describe some of the possibilities and challenges in understanding neural networks using this approach. Based on joint works with Itay Safran and Gilad Yehudai.

Ohad Shamir

Faculty Member, Department of Computer Science and Applied Mathematics, Weizmann Institute

Ohad Shamir is a faculty member at the Department of Computer Science and Applied Mathematics at the Weizmann Institute. He received his PhD in 2010 at the Hebrew University, and between 2010-2013 and 2017-2018 was a researcher at Microsoft Research in Boston. His research focuses on theoretical machine learning, in areas such as theory of deep learning, learning with information and communication constraints, and topics at the intersection of machine learning and optimization. He received several awards, and served as program co-chair of COLT as well as a member of its steering committee.

Resources

Community

Two UChicago CS Students Awarded NSF Graduate Research Fellowship

Non-Unital Noise Adds a New Wrinkle to the Quantum Supremacy Debate

The Science of Computer Security: An Interview with Grant Ho, Assistant Professor in Computer Science

Moon Duchin (Tufts University) – Design for Democracy

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Ian Foster – Better Information Faster: Programming the Continuum

Training Neural Networks: The Bigger the Better?

Ohad Shamir

Two UChicago CS Students Awarded NSF Graduate Research Fellowship

Non-Unital Noise Adds a New Wrinkle to the Quantum Supremacy Debate

The Science of Computer Security: An Interview with Grant Ho, Assistant Professor in Computer Science

Four Students Receive Honorable Mention in CRA Undergraduate Research Awards

Navigating the Intersection of Technology and Public Policy: The Journey of Ranya Sharma at UChicago

Assistant Professor Aloni Cohen Receives Prestigious Award for Groundbreaking Research in Machine Learning Complexity

Haifeng Xu named a AI2050 Early Career Fellow

FabRobotics: The Fusion of 3D Printing and Mobile Robots

Professor Andrew A. Chien on the Environmental Impacts of Technology

Assistant Professor Yanjing Li Awarded NSF CAREER Grant for Innovative Computer Architecture and Deep Learning Research

Prof. Rebecca Willett awarded the SIAG DATA Career prize

Argonne scientists use AI to identify new materials for carbon capture