This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.
Search
Skip to Search Results-
Fall 2011
Data is growing at an unforeseen rate, with TBs being generated daily. A large part of this data is unstructured in nature. This has pushed the traditional techniques of storing it in relational databases to its limit and new alternatives are necessary. Cloud databases have emerged as a viable...
-
2016
Polato, I., Barbosa, D., Hindle, Abram, Kon, F.
Apache Hadoop has evolved significantly over the last years, with more than 60 releases bringing new features. By implementing the MapReduce programming paradigm and leveraging HDFS, its distributed file system, Hadoop has become a reliable and fault tolerant middleware for parallel and...
-
Spring 2015
Cluster analysis plays a very important role for understanding various phenomena about data without any prior knowledge. However, hierarchical clustering algorithms, which are widely used for its representation of data, are computationally expensive. Recently large datasets are prevalent in many...