Search
Skip to Search Results- 2Abdi Oskouie, Mina
- 2Birkbeck, Neil Aylon Charles
- 2Cai, Zhipeng
- 2Chen, Jiyang
- 2Chowdhury, Md Solimul
- 2Chubak, Pirooz
- 74Machine Learning
- 70Reinforcement Learning
- 41Artificial Intelligence
- 36Machine learning
- 22Natural Language Processing
- 22Reinforcement learning
-
Spring 2022
Policy gradient (PG) estimators are ineffective in dealing with softmax policies that are sub-optimally saturated, which refers to the situation when the policy concentrates its probability mass on sub-optimal actions. Sub-optimal policy saturation may arise from a bad policy initialization or a...
-
Fall 2013
The amount of information available on the Web is rapidly growing, and the need for extracting more useful and relevant data from this tremendously large source has become an interesting research challenge. Among various types of useful information that can be extracted, lists in particular are...
-
Spring 2019
Forouzandehmoghadam, Amirhosein
A biomarker is a feature (e.g., gene expression, SNP, etc.) that is significantly different between two classes of instances – typically case and control. Knowing these biomarkers can help us understand a biological condition or identify the appropriate treatment for a certain disease. Many...
-
Spring 2016
This thesis describes a novel controversy model that helps the current manual process in automatically identifying controversial Wikipedia articles and warning readers about disputable information contained in these articles. The model is based on identifying collaboration patterns among editors...