Representation Alignment in Neural Networks

Imani, Ehsan

doi:doi:10.7939/r3-d3ds-7758

This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

View

Download

Communities and Collections

Graduate and Postdoctoral Studies (GPS), Faculty of / Theses and Dissertations

Usage

67 views
139 downloads

Representation Alignment in Neural Networks

Author / Creator

Imani, Ehsan
Classical wisdom in machine learning advises controlling the complexity of the hypothesis space for achieving good generalization. Despite this, modern overparametrized neural networks demonstrate remarkably high generalization performance, oftentimes with larger and more expressive architectures outperforming smaller ones. Motivated by these observations and other studies that produced similar phenomena in kernel regression, we study generalization in high-dimensional linear models through the lens of representation alignment, a measure of how much the labels vary in directions where the data is more spread out. Understanding when this relationship between the features and the labels holds and its potential for refining theoretical analyses and algorithms underlie the contributions in this thesis. We formally describe representation alignment and show how it connects to optimization and generalization. We then evaluate neural network hidden representations with this measure and find that training neural networks increase representation alignment in their hidden representations under a wide range of architectures and design choices. Based on these observation, we derive a regularization method for domain adaptation and find that enforcing alignment between the predictions and the given representation can help in domain adaptation. Finally, we extend the insights to policy evaluation and study generalization with temporal-difference learning.
Subjects / Keywords
Graduation date

Fall 2024
Type of Item

Thesis
Degree

Doctor of Philosophy
DOI

https://doi.org/10.7939/r3-d3ds-7758
License

This thesis is made available by the University of Alberta Library with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.

Language

English
Institution

University of Alberta
Degree level

Doctoral
Department
- Department of Computing Science
Supervisor / co-supervisor and their department(s)
- White, Martha (Computing Science)