ACM International Conference on Supercomputing, ICS, 2016

Title/Authors	Title	Research Artifacts [?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.	Details

Simulation and Analysis Engine for Scale-Out Workloads Nadav Chachmon, Daniel Richins, Robert S. Cohn, Magnus Christensson, Wenzhi Cui, Vijay Janapa Reddi	Simulation and Analysis Engine for Scale-Out Workloads Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Origami: Folding Warps for Energy Efficient GPUs Mohammad Abdel-Majeed, Daniel Wong, Justin Kuang, Murali Annavaram	Origami: Folding Warps for Energy Efficient GPUs Details		Discussion Comments: 0 Verification: Authors have not verified information More...
CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs Mehmet E. Belviranli, Farzad Khorasani, Laxmi N. Bhuyan, Rajiv Gupta	CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Optimizing Sparse Matrix-Vector Multiplication for Large-Scale Data Analytics Daniele Buono, Fabrizio Petrini, Fabio Checconi, Xing Liu, Xinyu Que, Chris Long, Tai-Ching Tuan	Optimizing Sparse Matrix-Vector Multiplication for Large-Scale Data Analytics Details		Discussion Comments: 0 Verification: Authors have not verified information More...
GCaR: Garbage Collection aware Cache Management with Improved Performance for Flash-based SSDs Suzhen Wu, Yanping Lin, Bo Mao, Hong Jiang	GCaR: Garbage Collection aware Cache Management with Improved Performance for Flash-based SSDs Details		Discussion Comments: 0 Verification: Authors have not verified information More...
GreenGear: Leveraging and Managing Server Heterogeneity for Improving Energy Efficiency in Green Data Centers Xu Zhou, Haoran Cai, Qiang Cao, Hong Jiang, Lei Tian, Changsheng Xie	GreenGear: Leveraging and Managing Server Heterogeneity for Improving Energy Efficiency in Green Data Centers Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Fast Multiplication in Binary Fields on GPUs via Register Cache Eli Ben-Sasson, Matan Hamilis, Mark Silberstein, Eran Tromer	Fast Multiplication in Binary Fields on GPUs via Register Cache Details	https://github.com/HamilM/GpuBinFieldMult	Discussion Comments: 0 Verification: Authors have not verified information More...
TurboTiling: Leveraging Prefetching to Boost Performance of Tiled Codes Sanyam Mehta, Rajat Garg, Nishad Trivedi, Pen-Chung Yew	TurboTiling: Leveraging Prefetching to Boost Performance of Tiled Codes Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Coherence-Free Multiview: Enabling Reference-Discerning Data Placement on GPU Guoyang Chen, Xipeng Shen	Coherence-Free Multiview: Enabling Reference-Discerning Data Placement on GPU Details		Discussion Comments: 0 Verification: Authors have not verified information More...
DSMR: A Parallel Algorithm for Single-Source Shortest Path Problem Saeed Maleki, Donald Nguyen, Andrew Lenharth, María Jesús Garzarán, David A. Padua, Keshav Pingali	DSMR: A Parallel Algorithm for Single-Source Shortest Path Problem Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Proteus: Exploiting Numerical Precision Variability in Deep Neural Networks Patrick Judd, Jorge Albericio, Tayler H. Hetherington, Tor M. Aamodt, Natalie D. Enright Jerger, Andreas Moshovos	Proteus: Exploiting Numerical Precision Variability in Deep Neural Networks Details		Discussion Comments: 0 Verification: Authors have not verified information More...
SARVAVID: A Domain Specific Language for Developing Scalable Computational Genomics Applications Kanak Mahadik, Christopher Wright, Jinyi Zhang, Milind Kulkarni, Saurabh Bagchi, Somali Chaterji	SARVAVID: A Domain Specific Language for Developing Scalable Computational Genomics Applications Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Mini-Ckpts: Surviving OS Failures in Persistent Memory David Fiala, Frank Mueller, Kurt B. Ferreira, Christian Engelmann	Mini-Ckpts: Surviving OS Failures in Persistent Memory Details	mueller@acm.org	Author Comments: Discussion Comments: 0 Sharing: Research produced artifacts Verification: Authors have verified information More...
Barrier-Aware Warp Scheduling for Throughput Processors Yuxi Liu, Zhibin Yu, Lieven Eeckhout, Vijay Janapa Reddi, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Cheng-Zhong Xu	Barrier-Aware Warp Scheduling for Throughput Processors Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Variation Among Processors Under Turbo Boost in HPC Systems Bilge Acun, Phil Miller, Laxmikant V. Kalé	Variation Among Processors Under Turbo Boost in HPC Systems Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Fairness-oriented OS Scheduling Support for Multicore Systems Changdae Kim, Jaehyuk Huh	Fairness-oriented OS Scheduling Support for Multicore Systems Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Noise Aware Scheduling in Data Centers Hameedah Sultan, Arpit Katiyar, Smruti R. Sarangi	Noise Aware Scheduling in Data Centers Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Galaxyfly: A Novel Family of Flexible-Radix Low-Diameter Topologies for Large-Scales Interconnection Networks Fei Lei, Dezun Dong, Xiangke Liao, Xing Su, Cunlu Li	Galaxyfly: A Novel Family of Flexible-Radix Low-Diameter Topologies for Large-Scales Interconnection Networks Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Replichard: Towards Tradeoff between Consistency and Performance for Metadata Zhiying Li, Ruini Xue, Lixiang Ao	Replichard: Towards Tradeoff between Consistency and Performance for Metadata Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Reusing Data Reorganization for Efficient SIMD Parallelization of Adaptive Irregular Applications Peng Jiang, Linchuan Chen, Gagan Agrawal	Reusing Data Reorganization for Efficient SIMD Parallelization of Adaptive Irregular Applications Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Peruse and Profit: Estimating the Accelerability of Loops Snehasish Kumar, Vijayalakshmi Srinivasan, Amirali Sharifian, Nick Sumner, Arrvindh Shriraman	Peruse and Profit: Estimating the Accelerability of Loops Details		Author Comments: Discussion Comments: 0 Sharing: Not able to share produced artifacts Verification: Authors have verified information More...
Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes Dimitrios Chasapis, Marc Casas, Miquel Moretó, Martin Schulz, Eduard Ayguadé, Jesús Labarta, Mateo Valero	Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Exploiting Private Local Memories to Reduce the Opportunity Cost of Accelerator Integration Emilio G. Cota, Paolo Mantovani, Luca P. Carloni	Exploiting Private Local Memories to Reduce the Opportunity Cost of Accelerator Integration Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Towards an Adaptive Multi-Power-Source Datacenter Longjun Liu, Hongbin Sun, Chao Li, Yang Hu, Nanning Zheng, Tao Li	Towards an Adaptive Multi-Power-Source Datacenter Details		Discussion Comments: 0 Verification: Authors have not verified information More...
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing Linnan Wang, Wei Wu, Zenglin Xu, Jianxiong Xiao, Yi Yang	BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing Details	https://github.com/linnanwang/BLASX	Discussion Comments: 0 Verification: Authors have not verified information More...
SReplay: Deterministic Sub-Group Replay for One-Sided Communication Xuehai Qian, Koushik Sen, Paul Hargrove, Costin Iancu	SReplay: Deterministic Sub-Group Replay for One-Sided Communication Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Tag-Split Cache for Efficient GPGPU Cache Utilization Lingda Li, Ari B. Hayes, Shuaiwen Leon Song, Eddy Z. Zhang	Tag-Split Cache for Efficient GPGPU Cache Utilization Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Efficient Timestamp-Based Cache Coherence Protocol for Many-Core Architectures Yuan Yao, Guanhua Wang, Zhiguo Ge, Tulika Mitra, Wenzhi Chen, Naxin Zhang	Efficient Timestamp-Based Cache Coherence Protocol for Many-Core Architectures Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Exploiting Dynamic Reuse Probability to Manage Shared Last-level Caches in CPU-GPU Heterogeneous Processors Siddharth Rai, Mainak Chaudhuri	Exploiting Dynamic Reuse Probability to Manage Shared Last-level Caches in CPU-GPU Heterogeneous Processors Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Polly-ACC Transparent compilation to heterogeneous hardware Tobias Grosser, Torsten Hoefler	Polly-ACC Transparent compilation to heterogeneous hardware Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Write-Aware Management of NVM-based Memory Extensions Amro Awad, Sergey Blagodurov, Yan Solihin	Write-Aware Management of NVM-based Memory Extensions Details		Discussion Comments: 0 Verification: Authors have not verified information More...
SFU-Driven Transparent Approximation Acceleration on GPUs Ang Li, Shuaiwen Leon Song, Mark Wijtvliet, Akash Kumar, Henk Corporaal	SFU-Driven Transparent Approximation Acceleration on GPUs Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Prefetching Techniques for Near-memory Throughput Processors Reena Panda, Yasuko Eckert, Nuwan Jayasena, Onur Kayiran, Michael Boyer, Lizy Kurian John	Prefetching Techniques for Near-memory Throughput Processors Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Balanced Hashing and Efficient GPU Sparse General Matrix-Matrix Multiplication Pham Nguyen Quang Anh, Rui Fan, Yonggang Wen	Balanced Hashing and Efficient GPU Sparse General Matrix-Matrix Multiplication Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Graph Prefetching Using Data Structure Knowledge Sam Ainsworth, Timothy M. Jones	Graph Prefetching Using Data Structure Knowledge Details	https://www.repository.cam.ac.uk/handle/1810/254642	Author Comments: Discussion Comments: 0 Sharing: Research produced artifacts Verification: Authors have verified information More...
Parallel Transposition of Sparse Data Structures Hao Wang, Weifeng Liu, Kaixi Hou, Wu-chun Feng	Parallel Transposition of Sparse Data Structures Details		Discussion Comments: 0 Verification: Authors have not verified information More...
HOPE: Enabling Efficient Service Orchestration in Software-Defined Data Centers Yang Hu, Chao Li, Longjun Liu, Tao Li	HOPE: Enabling Efficient Service Orchestration in Software-Defined Data Centers Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication Konstantina Mitropoulou, Vasileios Porpodas, Xiaochun Zhang, Timothy M. Jones	Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication Details	https://www.repository.cam.ac.uk/handle/1810/254651	Author Comments: Discussion Comments: 0 Sharing: Research produced artifacts Verification: Authors have verified information More...
AEQUITAS: Coordinated Energy Management Across Parallel Applications Haris Ribic, Yu David Liu	AEQUITAS: Coordinated Energy Management Across Parallel Applications Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Hybrid CPU-GPU scheduling and execution of tree traversals Jianqiao Liu, Nikhil Hegde, Milind Kulkarni	Hybrid CPU-GPU scheduling and execution of tree traversals Details		Discussion Comments: 0 Verification: Authors have not verified information More...
Scheduling Tasks with Mixed Timing Constraints in GPU-Powered Real-Time Systems Yunlong Xu, Rui Wang, Tao Li, Mingcong Song, Lan Gao, Zhongzhi Luan, Depei Qian	Scheduling Tasks with Mixed Timing Constraints in GPU-Powered Real-Time Systems Details		Discussion Comments: 0 Verification: Authors have not verified information More...
TokenTLB: A Token-Based Page Classification Approach Albert Esteve, Alberto Ros, Antonio Robles, María Engracia Gómez, José Duato	TokenTLB: A Token-Based Page Classification Approach Details		Discussion Comments: 0 Verification: Authors have not verified information More...
High Performance Design for HDFS with Byte-Addressability of NVM and RDMA Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dhabaleswar K. Panda	High Performance Design for HDFS with Byte-Addressability of NVM and RDMA Details		Discussion Comments: 0 Verification: Authors have not verified information More...

ACM International Conference on Supercomputing, ICS 2016