Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 2 more ...

Languages

Item type

Departments

Supervisors

Show 4 more ...

On the Control of Electric Vehicle Charging in the Smart Grid
Download

Fall 2020

Zishan, Abdullah Al

Over the last decade, the demand for electric vehicles (EVs) has surged across the globe. This spurred an increase in the installation of public and private EV charging points which are typically connected to low-voltage power distribution feeders. A high penetration of plug-in EVs in...
Online Learning for Linearly Parametrized Control Problems
Download

Spring 2013

Abbasi-Yadkori, Yasin

In a discrete-time online control problem, a learner makes an effort to control the state of an initially unknown environment so as to minimize the sum of the losses he suffers, where the losses are assumed to depend on the individual state-transitions. Various models of control problems have...
Optimal Real-Time Battery Scheduling with Reinforcement Learning and Neural Networks
Download

Fall 2021

Quiroz Juarez, Carolina

Climate change concerns have raised awareness about the importance of decarbonizing the power sector. In achieving such a goal, energy storage is a critical operation that is currently done using mostly fossil fuels as chemical energy storage. The only viable alternative is battery energy storage...
Policy Gradient Reinforcement Learning Without Regret
Download

Spring 2015

Dick, Travis B

This thesis consists of two independent projects, each contributing to a central goal of artificial intelligence research: to build computer systems that are capable of performing tasks and solving problems without problem-specific direction from us, their designers. I focus on two formal...
Policy Selection for Transfer Learning in the Building Control Domain
Download

Fall 2023

Krishna Guruvayur Sasikumar, Aakash

The application of reinforcement learning (RL) to the optimal control of building systems has gained traction in recent years as it can reduce building energy consumption and improve human comfort, without requiring the knowledge of the building model. However, existing RL solutions for building...
Predictive Knowledge in Robots: An Empirical Comparison of Learning Algorithms
Download

Fall 2018

Banafsheh Rafiee

Knowledge is central to intelligence. Intelligence can be thought of as the ability to acquire knowledge and apply it effectively. Despite being a subject of intense interest in artificial intelligence, it is not yet clear what the best approach is for an intelligent system to acquire and...
Predictive Representation Learning for Language Modeling
Download

Fall 2020

Lan, Qingfeng

Language Modeling (LM) is often formulated as a next-word prediction problem over a large vocabulary, which makes it challenging. To effectively perform the task of next-word prediction, Long Short Term Memory networks (LSTMs) must keep track of many types of information. Some information is...
Primal-Dual Algorithms for Learning in Constrained Markov Decision Processes
Download

Fall 2023

Liu, Chang

Many real-world tasks in fields such as robotics and control can be formulated as constrained Markov decision processes (CMDPs). In CMDPs, the objective is usually to optimize the return while ensuring some constraints being satisfied at the same time. The primal-dual approach is a common...
Regret Minimization with Function Approximation in Extensive-Form Games
Download

Fall 2020

D'Orazio, Ryan

Computing a Nash equilibrium in zero-sum games, or more generally saddle point optimization, is a fundamental problem in game theory and machine learning, with applications spanning across a wide variety of domains, from generative modeling and computer vision to super-human AI in imperfect...
Regularization in reinforcement learning
Download

Fall 2011

Farahmand, Amir-massoud

This thesis studies the reinforcement learning and planning problems that are modeled by a discounted Markov Decision Process (MDP) with a large state space and finite action space. We follow the value-based approach in which a function approximator is used to estimate the optimal value function....