ACM International Conference on Supercomputing, ICS 2016


Title/Authors Title Research Artifacts
[?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.
Details

Simulation and Analysis Engine for Scale-Out Workloads

Nadav Chachmon, Daniel Richins, Robert S. Cohn, Magnus Christensson, Wenzhi Cui, Vijay Janapa Reddi

Simulation and Analysis Engine for Scale-Out Workloads

Details
Discussion Comments: 0
Verification: Authors have not verified information

Origami: Folding Warps for Energy Efficient GPUs

Mohammad Abdel-Majeed, Daniel Wong, Justin Kuang, Murali Annavaram

Origami: Folding Warps for Energy Efficient GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs

Mehmet E. Belviranli, Farzad Khorasani, Laxmi N. Bhuyan, Rajiv Gupta

CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimizing Sparse Matrix-Vector Multiplication for Large-Scale Data Analytics

Daniele Buono, Fabrizio Petrini, Fabio Checconi, Xing Liu, Xinyu Que, Chris Long, Tai-Ching Tuan

Optimizing Sparse Matrix-Vector Multiplication for Large-Scale Data Analytics

Details
Discussion Comments: 0
Verification: Authors have not verified information

GCaR: Garbage Collection aware Cache Management with Improved Performance for Flash-based SSDs

Suzhen Wu, Yanping Lin, Bo Mao, Hong Jiang

GCaR: Garbage Collection aware Cache Management with Improved Performance for Flash-based SSDs

Details
Discussion Comments: 0
Verification: Authors have not verified information

GreenGear: Leveraging and Managing Server Heterogeneity for Improving Energy Efficiency in Green Data Centers

Xu Zhou, Haoran Cai, Qiang Cao, Hong Jiang, Lei Tian, Changsheng Xie

GreenGear: Leveraging and Managing Server Heterogeneity for Improving Energy Efficiency in Green Data Centers

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fast Multiplication in Binary Fields on GPUs via Register Cache

Eli Ben-Sasson, Matan Hamilis, Mark Silberstein, Eran Tromer

Fast Multiplication in Binary Fields on GPUs via Register Cache

Details
Discussion Comments: 0
Verification: Authors have not verified information

TurboTiling: Leveraging Prefetching to Boost Performance of Tiled Codes

Sanyam Mehta, Rajat Garg, Nishad Trivedi, Pen-Chung Yew

TurboTiling: Leveraging Prefetching to Boost Performance of Tiled Codes

Details
Discussion Comments: 0
Verification: Authors have not verified information

Coherence-Free Multiview: Enabling Reference-Discerning Data Placement on GPU

Guoyang Chen, Xipeng Shen

Coherence-Free Multiview: Enabling Reference-Discerning Data Placement on GPU

Details
Discussion Comments: 0
Verification: Authors have not verified information

DSMR: A Parallel Algorithm for Single-Source Shortest Path Problem

Saeed Maleki, Donald Nguyen, Andrew Lenharth, María Jesús Garzarán, David A. Padua, Keshav Pingali

DSMR: A Parallel Algorithm for Single-Source Shortest Path Problem

Details
Discussion Comments: 0
Verification: Authors have not verified information

Proteus: Exploiting Numerical Precision Variability in Deep Neural Networks

Patrick Judd, Jorge Albericio, Tayler H. Hetherington, Tor M. Aamodt, Natalie D. Enright Jerger, Andreas Moshovos

Proteus: Exploiting Numerical Precision Variability in Deep Neural Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

SARVAVID: A Domain Specific Language for Developing Scalable Computational Genomics Applications

Kanak Mahadik, Christopher Wright, Jinyi Zhang, Milind Kulkarni, Saurabh Bagchi, Somali Chaterji

SARVAVID: A Domain Specific Language for Developing Scalable Computational Genomics Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Mini-Ckpts: Surviving OS Failures in Persistent Memory

David Fiala, Frank Mueller, Kurt B. Ferreira, Christian Engelmann

Mini-Ckpts: Surviving OS Failures in Persistent Memory

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Barrier-Aware Warp Scheduling for Throughput Processors

Yuxi Liu, Zhibin Yu, Lieven Eeckhout, Vijay Janapa Reddi, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Cheng-Zhong Xu

Barrier-Aware Warp Scheduling for Throughput Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Variation Among Processors Under Turbo Boost in HPC Systems

Bilge Acun, Phil Miller, Laxmikant V. Kalé

Variation Among Processors Under Turbo Boost in HPC Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fairness-oriented OS Scheduling Support for Multicore Systems

Changdae Kim, Jaehyuk Huh

Fairness-oriented OS Scheduling Support for Multicore Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Noise Aware Scheduling in Data Centers

Hameedah Sultan, Arpit Katiyar, Smruti R. Sarangi

Noise Aware Scheduling in Data Centers

Details
Discussion Comments: 0
Verification: Authors have not verified information

Galaxyfly: A Novel Family of Flexible-Radix Low-Diameter Topologies for Large-Scales Interconnection Networks

Fei Lei, Dezun Dong, Xiangke Liao, Xing Su, Cunlu Li

Galaxyfly: A Novel Family of Flexible-Radix Low-Diameter Topologies for Large-Scales Interconnection Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Replichard: Towards Tradeoff between Consistency and Performance for Metadata

Zhiying Li, Ruini Xue, Lixiang Ao

Replichard: Towards Tradeoff between Consistency and Performance for Metadata

Details
Discussion Comments: 0
Verification: Authors have not verified information

Reusing Data Reorganization for Efficient SIMD Parallelization of Adaptive Irregular Applications

Peng Jiang, Linchuan Chen, Gagan Agrawal

Reusing Data Reorganization for Efficient SIMD Parallelization of Adaptive Irregular Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Peruse and Profit: Estimating the Accelerability of Loops

Snehasish Kumar, Vijayalakshmi Srinivasan, Amirali Sharifian, Nick Sumner, Arrvindh Shriraman

Peruse and Profit: Estimating the Accelerability of Loops

Details
Author Comments:
Discussion Comments: 0
Sharing: Not able to share produced artifacts
Verification: Authors have verified information

Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes

Dimitrios Chasapis, Marc Casas, Miquel Moretó, Martin Schulz, Eduard Ayguadé, Jesús Labarta, Mateo Valero

Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploiting Private Local Memories to Reduce the Opportunity Cost of Accelerator Integration

Emilio G. Cota, Paolo Mantovani, Luca P. Carloni

Exploiting Private Local Memories to Reduce the Opportunity Cost of Accelerator Integration

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards an Adaptive Multi-Power-Source Datacenter

Longjun Liu, Hongbin Sun, Chao Li, Yang Hu, Nanning Zheng, Tao Li

Towards an Adaptive Multi-Power-Source Datacenter

Details
Discussion Comments: 0
Verification: Authors have not verified information

BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing

Linnan Wang, Wei Wu, Zenglin Xu, Jianxiong Xiao, Yi Yang

BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing

Details
Discussion Comments: 0
Verification: Authors have not verified information

SReplay: Deterministic Sub-Group Replay for One-Sided Communication

Xuehai Qian, Koushik Sen, Paul Hargrove, Costin Iancu

SReplay: Deterministic Sub-Group Replay for One-Sided Communication

Details
Discussion Comments: 0
Verification: Authors have not verified information

Tag-Split Cache for Efficient GPGPU Cache Utilization

Lingda Li, Ari B. Hayes, Shuaiwen Leon Song, Eddy Z. Zhang

Tag-Split Cache for Efficient GPGPU Cache Utilization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient Timestamp-Based Cache Coherence Protocol for Many-Core Architectures

Yuan Yao, Guanhua Wang, Zhiguo Ge, Tulika Mitra, Wenzhi Chen, Naxin Zhang

Efficient Timestamp-Based Cache Coherence Protocol for Many-Core Architectures

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploiting Dynamic Reuse Probability to Manage Shared Last-level Caches in CPU-GPU Heterogeneous Processors

Siddharth Rai, Mainak Chaudhuri

Exploiting Dynamic Reuse Probability to Manage Shared Last-level Caches in CPU-GPU Heterogeneous Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Polly-ACC Transparent compilation to heterogeneous hardware

Tobias Grosser, Torsten Hoefler

Polly-ACC Transparent compilation to heterogeneous hardware

Details
Discussion Comments: 0
Verification: Authors have not verified information

Write-Aware Management of NVM-based Memory Extensions

Amro Awad, Sergey Blagodurov, Yan Solihin

Write-Aware Management of NVM-based Memory Extensions

Details
Discussion Comments: 0
Verification: Authors have not verified information

SFU-Driven Transparent Approximation Acceleration on GPUs

Ang Li, Shuaiwen Leon Song, Mark Wijtvliet, Akash Kumar, Henk Corporaal

SFU-Driven Transparent Approximation Acceleration on GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Prefetching Techniques for Near-memory Throughput Processors

Reena Panda, Yasuko Eckert, Nuwan Jayasena, Onur Kayiran, Michael Boyer, Lizy Kurian John

Prefetching Techniques for Near-memory Throughput Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Balanced Hashing and Efficient GPU Sparse General Matrix-Matrix Multiplication

Pham Nguyen Quang Anh, Rui Fan, Yonggang Wen

Balanced Hashing and Efficient GPU Sparse General Matrix-Matrix Multiplication

Details
Discussion Comments: 0
Verification: Authors have not verified information

Graph Prefetching Using Data Structure Knowledge

Sam Ainsworth, Timothy M. Jones

Graph Prefetching Using Data Structure Knowledge

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Parallel Transposition of Sparse Data Structures

Hao Wang, Weifeng Liu, Kaixi Hou, Wu-chun Feng

Parallel Transposition of Sparse Data Structures

Details
Discussion Comments: 0
Verification: Authors have not verified information

HOPE: Enabling Efficient Service Orchestration in Software-Defined Data Centers

Yang Hu, Chao Li, Longjun Liu, Tao Li

HOPE: Enabling Efficient Service Orchestration in Software-Defined Data Centers

Details
Discussion Comments: 0
Verification: Authors have not verified information

Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication

Konstantina Mitropoulou, Vasileios Porpodas, Xiaochun Zhang, Timothy M. Jones

Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

AEQUITAS: Coordinated Energy Management Across Parallel Applications

Haris Ribic, Yu David Liu

AEQUITAS: Coordinated Energy Management Across Parallel Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Hybrid CPU-GPU scheduling and execution of tree traversals

Jianqiao Liu, Nikhil Hegde, Milind Kulkarni

Hybrid CPU-GPU scheduling and execution of tree traversals

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scheduling Tasks with Mixed Timing Constraints in GPU-Powered Real-Time Systems

Yunlong Xu, Rui Wang, Tao Li, Mingcong Song, Lan Gao, Zhongzhi Luan, Depei Qian

Scheduling Tasks with Mixed Timing Constraints in GPU-Powered Real-Time Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

TokenTLB: A Token-Based Page Classification Approach

Albert Esteve, Alberto Ros, Antonio Robles, María Engracia Gómez, José Duato

TokenTLB: A Token-Based Page Classification Approach

Details
Discussion Comments: 0
Verification: Authors have not verified information

High Performance Design for HDFS with Byte-Addressability of NVM and RDMA

Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dhabaleswar K. Panda

High Performance Design for HDFS with Byte-Addressability of NVM and RDMA

Details
Discussion Comments: 0
Verification: Authors have not verified information