Search
Skip to Search Results- 3Software Systems
- 1Artificial Intelligence
- 1Benchmarking
- 1Data Dependencies
- 1GPU programming
- 1High-Performance Computing
-
2011
Amaral, Jose Nelson, Garg, Rahul
Technical report TR11-05. A new compilation framework enables the execution of numerical-intensive applications in an execution environment that is formed by multi-core Central Processing Units (CPUs) and Graphics Processing Units (GPUs). A critical innovation is the use of a variation of Linear...
-
2008
Niewiadomski, Robert, Amaral, Jose Nelson, Holte, Robert
Technical report TR08-18. We present an advanced Bidirectional A* algorithm featuring an application of Frontier Search and a strategy for the performance-efficient utilization of External Memory. We present the results of an experimental evaluation demonstrating that this algorithm is capable of...
-
2014
Finkel, Hal, Bhattacharyya, Arnamoy, Amaral, Jose Nelson
Thread Level Speculation (TLS) is a hardware/software technique that guarantees correct parallel execution of loops even in the presence of dependence and has potential to lead to performance gains through the parallelization of loops that cannot be proven to be free of dependencies at compile...
-
2019-01-01
Lloyd, Taylor, Ali, Karim, Amaral, Jose Nelson
Graphics Processing Units~(GPUs) have been widely used to accelerate the performance of programs. However, such performance gains can be significantly degraded by irregular data accesses and by control-flow divergence. Both of these performance issues arise only in the presence of...