Some of the I-ACOMA Publications


Architecture and Systems:

  1. Executing Sequential Binaries on a Multithreaded Architecture with Speculation Support
    by Venkata Krishnan and Josep Torrellas,
    Workshop on Multi-Threaded Execution, Architecture and Compilation (MTEAC'98), January 1998.

  2. A Clustered Approach to Multithreaded Processors
    by Venkata Krishnan and Josep Torrellas,
    International Parallel Processing Symposium, March 1998.

  3. Cache-Only Memory Architectures
    by Fredrik Dahlgren and Josep Torrellas,
    Submitted for publication, January 1998.

  4. Hardware for Speculative Run-Time Parallelization in Distributed Shared-Memory Multiprocessors
    by Ye Zhang, Lawrence Rauchwerger, and Josep Torrellas,
    Fourth International Symposium on High-Performance Computer Architecture, February 1998.

  5. Enhancing Memory Use in Simple Coma: Multiplexed Simple Coma
    by Sujoy Basu and Josep Torrellas,
    Fourth International Symposium on High-Performance Computer Architecture, February 1998.

  6. How Processor-Memory Integration Affects the Design of DSMs
    by Liuxi Yang, Anthony-Trung Nguyen, and Josep Torrellas,
    Workshop on Mixing Logic and DRAM: Chips that Compute and Remember, June 1997.

  7. Efficient Use of Processing Transistors for Larger On-Chip Storage: Multithreading
    by Venkata Krishnan and Josep Torrellas,
    Workshop on Mixing Logic and DRAM: Chips that Compute and Remember, June 1997.

  8. The Memory Performance of DSS Commercial Workloads in Shared-Memory Multiprocessors
    by Pedro Trancoso, Josep-L. Larriba-Pey, Zheng Zhang, and Josep Torrellas,
    Third International Symposium on High-Performance Computer Architecture, January 1997.

  9. Reducing Remote Conflict Misses: NUMA with Remote Cache versus COMA
    by Zheng Zhang and Josep Torrellas,
    Third International Symposium on High-Performance Computer Architecture, January 1997.

  10. Speeding up the Memory Hierarchy in Flat COMA Multiprocessors
    by Liuxi Yang and Josep Torrellas,
    Third International Symposium on High-Performance Computer Architecture, January 1997.

  11. The Illinois Aggressive Coma Multiprocessor Project (I-ACOMA)
    by Josep Torrellas and David Padua,
    6th Symposium on the Frontiers of Massively Parallel Computing, October 1996.

  12. An Efficient Implementation of Tree-Based Multicast Routing for Distributed Shared-Memory Multiprocessors
    by Manuel Perez Malumbres(*), Jose Duato(*), and Josep Torrellas,
    (* Universidad Politecnica de Valencia). 1996 Symposium on Parallel and Distributed Processing, October 1996.

  13. Comprehensive Hardware and Software Support for Operating Systems to Exploit MP Memory Hierarchies
    by Chun Xia and Josep Torrellas,
    Submitted for publication, 1996.

  14. The Impact of Speeding up Critical Sections with Data Prefetching and Forwarding
    by Pedro Trancoso and Josep Torrellas,
    1996 International Conference on Parallel Processing, August 1996.

  15. Instruction Prefetching of Systems Codes With Layout Optimized for Reduced Cache Misses
    by Chun Xia and Josep Torrellas,
    23rd International Symposium on Computer Architecture, June 1996.

  16. Optimizing the Primary Cache for Parallel Scientific Applications: The Pool Buffer Approach
    by Liuxi Yang and Josep Torrellas,
    1996 International Conference on Supercomputing, June 1996.

  17. Distance-Adaptive Update Protocols for Scalable Shared-Memory Multiprocessors
    by Alain Raynaud, Zheng Zhang, and Josep Torrellas,
    Second International Symposium on High-Performance Computer Architecture, January 1996.

  18. Improving the Data Cache Performance of Multiprocessor Operating Systems,
    by Chun Xia and Josep Torrellas,
    2nd International Symposium on High Performance Computer Architecture, January 1996.

  19. Data Forwarding in Scalable Shared-Memory Multiprocessors,
    by D. A. Koufaty, X. Chen, D. K. Poulsen, and J. Torrellas,
    1995 International Conference on Supercomputing, July 1995.
    Extended Version in IEEE Transactions on Parallel and Distributed Systems, December 1996.

  20. Speeding up Irregular Applications in Shared-Memory Multiprocessors: Memory Binding and Group Prefetching,
    by Zheng Zhang and Josep Torrellas,
    22nd International Symposium on Computer Architecture, June 1995.

  21. Optimizing Instruction Cache Performance for Operating System Intensive Workloads,
    by Josep Torrellas, Chun Xia and Russell Daigle,
    1st International Symposium on High Performance Computer Architecture, January 1995.
    Extended Version To appear in IEEE Transactions on Computers, 1996.

  22. Low Perturbation Address Trace Collection with Simple Hardware Performance Monitors,
    by Russell Daigle, Chun Xia, and Josep Torrellas,
    Submitted for publication to a Journal, 1995.

  23. The Performance of the Cedar Multistage Switching Network,
    by Josep Torrellas and Zheng Zhang,
    Supercomputing'94, November 1994.
    Extended Version in IEEE Transactions on Parallel and Distributed Systems, April 1997.

  24. An Efficient Algorithm for the Run-time Parallelization of DOACROSS Loops,
    by Ding-Kai Chen, David Oesterreich, Josep Torrellas and Pen-Chung Yew,
    Supercomputing'94, November 1994.
    Extended Version Submitted to a Journal, 1995.

  25. Comparing the Performance and Programmibility of the DASH and Cedar Multiprocessors for Scientific Loads,
    by Josep Torrellas and David Koufaty, and David Padua,
    1994 International Conference on Parallel Processing, August 1994.
    Extended Version Submitted to a Journal, 1995.

Tools and Other:

  1. The Augmint Multiprocessor Simulation Toolkit for Intel x86 Architectures,
    by Anthony-Trung Nguyen, Maged Michael, Arun Sharma, and Josep Torrellas,
    1996 International Conference on Computer Design, October 1996.

  2. Computer Architecture Education at the University of Illinois: Current Status and Some Thoughts,
    by Josep Torrellas,
    IEEE Computer Architecture Technical Committee Newsletter, June 1996.

  3. Scalable Shared-Memory Architectures: Introduction to MiniTrack,
    by Josep Torrellas,
    28th Hawaii International Conference on System Sciences, January 1995.

  4. Evaluating the Performance of Cache-Affinity Scheduling in Shared-Memory Multiprocessors,
    by Josep Torrellas, Andrew Tucker and Anoop Gupta,
    Journal of Parallel and Distributed Computing, February 1995.

  5. False Sharing and Spatial Locality in Multiprocessor Caches,
    by Josep Torrellas, Monica S. Lam and John L. Hennessy,
    Transactions on Computers, June 1994.

  6. Characterizing the Caching and Synchronization Performance of a Multiprocessor Operating System,
    by Josep Torrellas, Anoop Gupta, and John Hennessy,
    ASPLOS V, October 1992.