Title: Controllability and Uncertainty in Generative Models

 

Date: Friday, November 3, 2023

Time: 9-10am ET

Location: Coda C0915 Atlantic & Zoom

 

Cusuh Ham

Machine Learning PhD Student

School of Interactive Computing
Georgia Institute of Technology

 

Committee

Dr. James Hays (Advisor) - School of Interactive Computing, Georgia Tech

Dr. Judy Hoffman - School of Interactive Computing, Georgia Tech

Dr. Zsolt Kira - School of Interactive Computing, Georgia Tech

Dr. Humphrey Shi - School of Interactive Computing, Georgia Tech

Dr. Jun-Yan Zhu - School of Computer Science, Carnegie Mellon

 

Abstract

This dissertation describes methods for enhancing generative models with either added controllability or expressiveness of uncertainty, demonstrating how a strong prior enables both features. One general approach is to introduce new architectures or training objectives. However, current trends towards massive upscaling of model size, training data, and computational resources can make retraining or fine-tuning difficult and expensive. Thus, another approach is to build upon existing pre-trained models. We consider both types of approaches with an emphasis on the latter. We first tackle the tasks of controllable image synthesis and uncertainty estimation through training-based methods and then switch focus towards computationally-efficient methods that do not require direct updates to the base model's parameters. We conclude with an overview of our latest work on personalization of text-to-image diffusion models, which efficiently recontextualizes a target concept into new settings and configurations.