| |
| |
Revisiting graph coloring register allocation : a study of the Chaitin-Briggs and Callahan-Koblenz algorithms | |
| |
| |
| |
Register pressure in software-pipelined loop nests : fast computation and impact on architecture design | |
| |
| |
| |
Manipulating MAXLIVE for spill-free register allocation | |
| |
| |
| |
Optimizing packet accesses for a domain specific language on network processors | |
| |
| |
| |
Array replication to increase parallelism in applications mapped to configurable architectures | |
| |
| |
| |
Generation of control and data flow graphs from scheduled and pipelined assembly code | |
| |
| |
| |
Applying data copy to improve memory performance of general array computations | |
| |
| |
| |
A cache-conscious profitability model for empirical tuning of loop fusion | |
| |
| |
| |
Optimizing matrix multiplication with a classifier learning system | |
| |
| |
| |
A language for the compact representation of multiple program versions | |
| |
| |
| |
Efficient computation of may-happen-in-parallel information for concurrent Java programs | |
| |
| |
| |
Evaluating the impact of thread escape analysis on a memory consistency model-aware compiler | |
| |
| |
| |
Concurrency analysis for parallel programs with textually aligned barriers | |
| |
| |
| |
Titanium performance and potential : an NPB experimental study | |
| |
| |
| |
Efficient search-space pruning for integrated fusion and tiling transformations | |
| |
| |
| |
Automatic measurement of instruction cache capacity | |
| |
| |
| |
Combined ILP and register tiling : analytical model and optimization framework | |
| |
| |
| |
Analytic models and empirical search : a hybrid approach to code optimization | |
| |
| |
| |
Testing speculative work in a lazy/eager parallel functional language | |
| |
| |
| |
Loop selection for thread-level speculation | |
| |
| |
| |
Software thread level speculation for the Java language and virtual machine environment | |
| |
| |
| |
Lightweight monitoring of the progress of remotely executing computations | |
| |
| |
| |
Using platform-specific performance counters for dynamic compilation | |
| |
| |
| |
A domain-specific interpreter for parallelizing a large mixed-language visualisation application | |
| |
| |
| |
Compiler control power saving scheme for multi core processors | |
| |
| |
| |
Code transformations for one-pass analysis | |
| |
| |
| |
Scalable array SSA and array data flow analysis | |
| |
| |
| |
Interprocedural symbolic range propagation for optimizing compilers | |
| |
| |
| |
Parallelization of utility programs based on behavior phase analysis | |
| |
| |
| |
A systematic approach to model-guided empirical search for memory hierarchy optimization | |
| |
| |
| |
An efficient approach for self-scheduling parallel loops on multiprogrammed parallel computers | |
| |
| |
| |
Dynamic compilation for reducing energy consumption of I/O-intensive applications | |
| |
| |
| |
Supporting SELL for high-performance computing | |
| |
| |
| |
Compiler supports and optimizations for PAC VLIW DSP processors | |
| |