Search
Skip to Search Results- 6Energy consumption
- 6Mining software repositories
- 5Android
- 4LDA
- 4Machine learning
- 4Software energy consumption
- 95Computing Science, Department of
- 71Computing Science, Department of/Conference Papers (Computing Science)
- 23Computing Science, Department of/Journal Articles (Computing Science)
- 2WISEST Summer Research Program
- 2WISEST Summer Research Program/WISEST Research Posters
- 1Toolkit for Grant Success
-
Judging a commit by its cover: Correlating commit message entropy with build status on Travis-CI
Download2016
Developers summarize their changes to code in commit messages. When a message seems "unusual", however, this puts doubt into the quality of the code contained in the commit. We trained n-gram language models and used cross-entropy as an indicator of commit message "unusualness" of over 120,000...
-
2015
Chowdhury, S.A., Hindle, Abram
Internet Relay Chat (IRC) is a commonly used tool by Open Source developers. Developers use IRC channels to discuss programming related problems, but much of the discussion is irrelevant and off-topic. Essentially if we treat IRC discussions like email messages, and apply spam filtering, we can...
-
2013
Hindle, Abram, Wang, Jiemin, Nascimento, Mario A., Jiang, Feng
The open source development community consists of both paid and volunteer developers as well as new and experienced users. Previous work has applied social network analysis (SNA) to open source communities and has demonstrated value in expertise discovery and triaging. One problem with applying...
-
2011
Hindle, Abram, Godfrey, M.M., Holt, R.C.
Software development is difficult to model, particularly the noisy, non-stationary signals of changes per time unit, extracted from version control systems (VCSs). Currently researchers are utilizing timeseries analysis tools such as ARIMA to model these signals extracted from a project's VCS....
-
2021
Wong, A.W., Salimi, A., Hindle, Abram, Kalmady, S.V., Kaul, P.
The 12-lead electrocardiogram (ECG) measures the electrical activity of the heart for physicians to use in diagnosing cardiac disorders. This paper investigates the multi-label, multi-class classification of ECG records into one or more of 27 possible medical diagnoses. Our multi-step approach...
-
2020
Wong, A.W., Sun, W., Kalmady, S.V., Kaul, P., Hindle, Abram
The 12-lead electrocardiogram (ECG) is a commonly used tool for detecting cardiac abnormalities such as atrial fibrillation, blocks, and irregular complexes. For the PhysioNet/CinC 2020 Challenge, we built an algorithm using gradient boosted tree ensembles fitted on morphology and signal...
-
2014
Consumer demand for longer lasting battery life in mobile computers, as well as industry interest in energy efficient cloud infrastructure, creates a need for hardware and software energy efficiency improvements. One way to tackle this problem is from a software perspective. If it were known...
-
2011
Uddin, S., Roy, C.K., Schneider, K.A., Hindle, Abram
Clone detection techniques essentially cluster textually, syntactically and/or semantically similar code fragments in or across software systems. For large datasets, similarity identification is costly both in terms of time and memory, and especially so when detecting near-miss clones where lines...
-
2016
Hindle, Abram, Barr, E.T., Su, Z., Devanbu, P.T., Gabel, M.
Natural languages like English are rich, complex, and powerful. The highly creative and graceful use of languages like English and Tamil, by masters like Shakespeare and Avvaiyar, can certainly delight and inspire. But in practice, given cognitive constraints and the exigencies of daily life,...
-
2012
Hindle, Abram, Barr, E.T., Su, Z., Devanbu, P.T., Gabel, M.
Natural languages like English are rich, complex, and powerful. The highly creative and graceful use of languages like English and Tamil, by masters like Shakespeare and Avvaiyar, can certainly delight and inspire. But in practice, given cognitive constraints and the exigencies of daily life,...