Search
Skip to Search Results- 2Abdi Oskouie, Mina
- 2Birkbeck, Neil Aylon Charles
- 2Cai, Zhipeng
- 2Chen, Jiyang
- 2Chowdhury, Md Solimul
- 2Chubak, Pirooz
- 74Machine Learning
- 70Reinforcement Learning
- 41Artificial Intelligence
- 36Machine learning
- 22Natural Language Processing
- 22Reinforcement learning
-
Fall 2023
Curiosity appears to motivate and guide effective learning in humans, which has led to high hopes in the machine learning community for machine analogues of curiosity. While a variety of machine curiosity algorithms have been introduced, they are rarely compared with other existing curiosity...
-
Spreadsheets for Legal Reasoning: The Continued Promise of Declarative Logic Programming in Law
DownloadFall 2020
The legal services market is one in which there is too much demand, and too little supply. One method of increasing supply in a market is to increase efficiency by automating. Automated legal services require the automation of legal reasoning. Declarative logic programming (DLP) has long been...
-
Fall 2016
Big data applications demand and consequently lead to developments of diverse scalable data management systems, ranging from NoSQL systems to the emerging NewSQL systems. In order to serve thousands of applications and their huge amounts of data, data management systems must be capable of...
-
Spring 2024
In reinforcement learning, the notion of state plays a central role. A reinforcement learning agent requires the state to evaluate its current situation, select actions, and construct a model of the environment. In the classic setting, it is assumed that the environment provides the agent with...
-
Fall 2014
Designing competitive Artificial Intelligence (AI) systems for Real-Time Strategy (RTS) games often requires a large amount of expert knowledge (resulting in hard-coded rules for the AI system to follow). However, aspects of an RTS agent can be learned from human replay data. In this thesis, we...
-
Spring 2015
In this thesis, I study the problem of Monte-Carlo Planning in deterministic do- mains with sparse rewards. A popular algorithm in this suite, UCT, is studied. A new algorithm to incorporate state generalization in UCT using estimates of sim- ilar nodes and a distance metric is presented. The...