NSF CSR PI meeting 2017 in Orlando, FL (colocated with IPDPS)

Overview

The PI meeting will be a one day event on Friday June 2nd, 2017 in Orlando, Florida and be co-located with the IPDPS conference. Also the meeting is held administratively as an IPDPS workshop, so one will need to register to the workshops to be able to attend.

PIs should bring posters to foster discussion about their work. The meeting provides the opportunity for PIs of awards to inspire the community with their ideas and new developments. The meeting also provides a platform for NSF personnel to present the current CSR program, share their vision of where the CSR program is going, and hear ideas for future program changes from the CSR community. Time is be allotted for all participants to interact, to share results and ideas, and to foster collaboration.

Program

8:30am: Opening remarks
(in the Citron-Center)
- by Kenneth L. Calvert (Division Director of CNS)
8:45am: Introduction to the current NSF CSR program
(in the Citron-Center)
- By Yan Solihin (Program Director, CNS), Sandip Kundu (Program Director, CNS)
9:15am: Presentations of research accomplishments by PIs
(in the Citron-Center)
- Mosharaf Chowdhury, “Rack-Scale Memory Disaggregation” [abstract] [slides]
- David Kotz, “Computational Jewelry for Mobile Health” [abstract] [slides]
9:55am: Break
10:20am: Presentation of research accomplishments by PIs
(in the Citron-Center)
- Kirk Cameron, “VarSys: Managing Variability in High-Performance Computing Systems”[abstract]
- Wei Gao, “Designing Hierarchical Edge Cloud for Mobile Computing”[abstract] [slides]
- Avinash Kodi, “Reconfigurable Photonic Network-on-Chip for Heterogeneous Multicores”[abstract] [slides]
- Abhishek Chandra, “Location, location, location (L3): Support for Geo-Centric Applications”[abstract] [slides]
- Lucy Dunne, “One Shirt to Rule Them All: Pursuing the Vision of a Garment-Based Wearable Technology Platform”[abstract] [slides]
12pm: Lunch
(in the Citron-Center)
- Kate Keahey, "A Large Scale Reconfigurable Instrument for Computer Science Experimentation"[abstract] [slides]
- Lunch speaker: Gurdip Singh, "Perspective on Distributed Systems Research" [slides]
1:30pm: Breakout session : On the CSR program and directions
(in the Event Center)
- Mimi McClure and Samee Khan
2pm: Parallel breakout sessions
(in the Event Center)
- Hardware and Architecture (Discussion Lead: Pen-Chung Yew) in breakout room 1 [summary]
- Operating Systems and Virtual Machine (Discussion Lead: Jason Flinn) in breakout room 2 [summary]
- Programming Language, Compilers, and Runtime Systems (Discussion Lead: Chen Ding) in breakout room 3 [summary]
- Cloud and Edge Computing (Discussion Lead: Krishna Kant) in breakout room 4 [summary]
2:45pm: Parallel breakout sessions
(in the Event Center)
- Storage Systems (Discussion Lead: Ali Butt) in breakout room 1 [summary]
- Integrated Networked Systems and Internet of Things (Discussion Lead: Saurabh Bagchi) in breakout room 2 [summary]
- High Performance Computing (Discussion Lead: Viktor Prasana) in breakout room 3 [summary]
- Embedded and Real-Time Systems (Discussion Lead: Ulrich Kremer) in breakout room 4 [summary]
3:30pm: Break and poster session
4:30pm: Summary of breakout sessions
(in the Citron-Center) Reports
5pm: Concluding remarks
(in the Citron-Center)
5:15pm: PI meeting ends
(in the Citron-Center)

Abstracts

Mosharaf Chowdhury, “Rack-Scale Memory Disaggregation”

Memory-intensive applications suffer large performance loss when their working sets do not fully fit in memory. Yet, they cannot leverage otherwise unused remote memory when paging out to disks even in the presence of large imbalance in memory utilizations across a cluster. Existing proposals for memory disaggregation call for new architectures, new hardware designs, and/or new programming models, making them infeasible. In this talk, I'll describe the design and implementation of Infiniswap, which opportunistically harvests and transparently exposes unused memory to unmodified applications. We have implemented and deployed Infiniswap on an RDMA cluster without any modifications to user applications or the OS and evaluated its effectiveness using multiple workloads running on unmodified VoltDB, Memcached, PowerGraph, GraphX, and Apache Spark. Using Infiniswap, throughputs of these applications improve between 4X (0.94X) to 15.4X (7.8X) over disk (Mellanox nbdX), and median and tail latencies between 5.4X (2X) and 61X (2.3X). Infiniswap achieves these with negligible remote CPU usage, increases the overall memory utilization of a cluster, and works well at scale.

David Kotz, “Computational Jewelry for Mobile Health”

Wearable technology enables a range of exciting new applications in health, commerce, and beyond. For many important applications, wearables must have battery life measured in weeks or months, not hours and days as in most current devices. Our vision of wearable platforms aims for long battery life but with the flexibility and security to support multiple applications. To achieve long battery life with a workload comprising apps from multiple developers, these platforms must have robust mechanisms for app isolation and developer tools for optimizing resource usage.

We introduce the Amulet Platform for constrained wearable devices, which includes an ultra-low-power hardware architecture and a companion software framework, including a highly efficient event-driven programming model, low-power operating system, and developer tools for profiling ultra-low-power applications at compile time. We present the design and evaluation of our prototype Amulet hardware and software, and show how the framework enables developers to write energy-efficient applications. Our prototype has battery lifetime lasting weeks or even months, depending on the application, and our interactive resource-profiling tool predicts battery lifetime within 6-10% of the measured lifetime.

Kirk Cameron, “VarSys: Managing Variability in High-Performance Computing Systems”

The usefulness of the smallest mobile system in a pocket and the largest and fastest supercomputers in datacenters around the world require unrelenting advances in systems software design. These advances make computers faster, more reliable, more secure, better able to analyze large data sets, and ultimately essential to the lives of nearly everyone on the planet. Variability can wreak havoc on the performance of large-scale computer systems that support high-performance computing and e-commerce. In high-performance computing, variability threatens U.S. competitiveness and our ability to achieve exascale performance within the cost and energy constraints of supercomputers. In e-commerce (e.g., Amazon and Wall Street trading), variability threatens profit margins by requiring greater capital expenditures to compensate for potential swings in the performance of datacenters and the cloud. This project develops first-principal techniques within the VarSys software framework to identify and control aspects of variability to improve the design and operational efficiencies of both high-performance and cloud systems. The resulting framework is integrated in undergraduate systems courses where students perform variability experiments and provide data and insights in service to research goals. Early results indicate variability challenges require and await a broad community of researchers from nearly every subdomain in computer science.

Wei Gao, “Designing Hierarchical Edge Cloud for Mobile Computing”

Cloud computing can be leveraged to bridge the gap between the increasing complexity of mobile applications and the limited capabilities of mobile devices, but traditional cloud computing based on data centers are incapable of efficiently executing mobile applications due to excessive network latency accessing data centers and significant overhead for provisioning and managing Virtual Machines (VMs). Existing edge cloud solutions reduce the network latency accessing data centers by deploying servers at the network edge, but cannot handle the peak load from mobile users exceeding the capacity of individual servers. The goal of this research is to design the edge cloud as a tree hierarchy of geo-distributed servers, so as to efficiently handle the peak load and satisfy the performance requirements of remote program execution.

Avinash Kodi, “Reconfigurable Photonic Network-on-Chip for Heterogeneous Multicores”

As the relentless quest for higher throughput and lower energy cost continues in heterogenous multicores, there is a strong demand for energy-efficient and high-performance Network-on-Chip (NoC) architectures. Heterogeneous architectures that can simultaneously utilize both the serialized nature of the CPU as well as the thread level parallelism of the GPU are gaining traction in the industry. A critical issue with heterogeneous architectures is finding an optimal way to utilize the shared resources such as the last level cache and NoC without hindering the performance of either the CPU or the GPU core. Photonic interconnects are a disruptive technology solution that has the potential to increase the bandwidth, reduce latency, and improve energy-efficiency over traditional metallic interconnects. In this poster/talk, we propose a CPU-GPU heterogeneous architecture called SHARP (Shared Heterogeneous Architecture with Reconfigurable Photonic Network-on-Chip) that clusters CPU and GPU cores around the same router and dynamically allocates bandwidth between the CPU and GPU cores based on application demands. The SHARP architecture is designed as a Single-Writer Multiple-Reader (SWMR) crossbar with reservation-assist to connect CPU/GPU cores that dynamically reallocates bandwidth using buffer utilization information at runtime. As network traffic exhibits temporal and spatial fluctuations due to application behavior, SHARP can dynamically reallocate bandwidth and thereby adapt to application demands. SHARP demonstrates 34% performance (throughput) improvement over a baseline electrical CMESH while consuming 25% less energy per bit. Simulation results have also shown 6.9% to 14.9% performance improvement over other flavors of the proposed SHARP architecture without dynamic bandwidth allocation.

Abhishek Chandra, “Location, location, location (L3): Support for Geo-Centric Applications”

Recent years have seen the proliferation of a variety of sensors embedded in different environments, and the increasing availability of smart wearable devices. Together these trends have resulted in the growth of sensor data of interest to many communities across social, economic, health-care, and scientific domains. This has led to the emergence of geo-centric applications: a class of applications that can process and extract rich information from the sensor data to provide novel services to users. However, these applications currently suffer from poor performance and failures due to the limited computing and storage resources available on the devices and their location dependency. This project will develop new computing abstractions, algorithms, and systems, that enable a new frontier of geo-centric applications to be supported. The goal of this project is to catalyze the role of computer systems in meeting the needs of emerging geo-centric applications in mobile, sensor, and Internet-of-things (IoT) areas. This project will build a system called Location, location, location: Support for Geo-Centric Applications or L3 for short. This project will develop a number of novel system and application abstractions to manage the dynamism that arises from location in support of geo-centric applications. First is the concept of a Resource cloud, a system-facing abstraction that is geo-aware and manages a set of changing resources based on publish-subscribe and matchmaking. Second is the concept of a Resource container, an application-facing abstraction that provides policy-based resource selection and allocation across a diverse set of resources including storage, computation, and even data sources, to meet the specific requirements of an application. The project will address specific research problems that arise in the design and implementation of the Resource cloud and Resource container, including: on-demand resource provision to the Resource cloud, collective matching of resource requests that scale to diverse resource types and to highly shared resources, and automated resource policy generation and optimization based on application requirements.

Lucy Dunne, “One Shirt to Rule Them All: Pursuing the Vision of a Garment-Based Wearable Technology Platform”

The second wave of wearable technology hype saw a dramatic increase in commercially available consumer devices. However, this advancement came at a cost to the scope of applications, and the limitations of wristbands and activity trackers are becoming ever more evident to consumers. From an industry perspective, durability, scalability, and interaction constraints play a large part in the decision to develop discrete, stand-alone devices fabricated from hard goods. We see an urgent need for scalable manufacturing of textile and garment-based wearables, and an opportunity to develop a unified "platform" garment that could support a wide range of technology applications. But what would such a garment do, how would it look, and how would it be made? Our work pursues two thrusts: 1) an effort to map the application space of wearable technology in order to derive the requisite technological functionality needed in a platform; and 2) an effort to develop manufacturing methods for distributing and embedding electronic functionality into garments while leveraging the infrastructure and techniques of the cut-and-sewn apparel industry. In short, we seek to develop a preliminary proof-of-concept based on a standardized garment (currently: a button-up dress shirt), manufacturable in an apparel factory, that embeds versatile functionality.

Kate Keahey, “A Large Scale Reconfigurable Instrument for Computer Science Experimentation”

Computer Science experiments require a platform that is deeply reconfigurable, provides up-to-date resources, as well as sufficient scale. This talk will describe the Chameleon testbed, consisting of ~15,000 cores and 5PB of total storage, distributed between two sites (UC and TACC) connected by a 100 Gbps network. The testbed contains a large homogenous partition to support experiments at scale, as well as heterogeneous elements including Infiniband networking, high-bandwidth I/O nodes, storage hierarchy nodes, multiple types of GPUs, FPGAs, as well as clusters of ARM and Atom processors. To support Computer Science experiments, ranging from operating system and virtualization research to innovative applications, Chameleon provides a configuration system giving users full control of the software stack: provisioning of bare metal, support for custom kernel reboot, and console access — but also a fully functioning cloud environment to support educational projects and cloud development. This talk will describe the capabilities of the system and give examples of a few research projects currently leveraging them.

Posters

"Self-organizing Cyber-Substrates: Exploring a Modular Computing and Communications Architecture for Structural Health Monitoring." by Shantanu Chakrabartty, Subir Biswas, Rigoberto Burgueno
"Write locality and its optimization for persistent memory" by Dong Chen, Chencheng Ye, Chen Ding
"DIAL: Dynamic Interference-Aware Load Balancing" by Anshul Gandhi
"Trueno: A High Performance Graph Datastore and Computational Engine " by Victor Santos, Servio Palacios, Edgardo Barsallo, Tyler Cowman, Miguel Rivera, Peng Hao, Chih-Hao, Mehmet Koyuturk, Ananth Grama
"Optimal Fully Homomorphic Encryption Approach " by Meikang Qiu
"QuestV: A Separation Kernel for Mixed Criticality Systems" by Richard West, Zhuoqun Cheng, Matthew Danish, Ye Li, Eric Missimer, Ying Ye
"Moving MapReduce into the Cloud: Flexibility, Efficiency, and Elasticity" by Xiaobo Zhou, Jia Rao
"GPU-Accelerated Multi-Display Applications for Large CAD Model Visualization on a Commodity Desktop" by Chao Peng, Yangzi Dong
"Performance Optimization on Massively Parallel Platforms with Heterogeneous Memory Architectures" by Bo Wu
"ProfDP: Lightweight Profiler to Guide Data Placement in Heterogenous Memory" by Xu Liu
"Skipping Repetitive Dirty Pages for Restore-Express Incremental Checkpointing" by Purushottam Sigdel, Nian-Feng Tzeng
"Sensorprint: Information Authentication for Mobile Systems" by Bogdan Carubnar, Radu Sion
"Nonvolatile Computing: Towards Ubiquitous Low-Power Embedded Computing" by Jingtong Hu
"Scaling the Real-time Capabilities of Powertrain Controllers in Automotive Systems" by Aviral Shrivastava
"CAREER: Energy-Efficient Sensor Networks Using Analog Signal Processing" by David Graham
"Taming Mobile Hardware & OS Diversity for Comprehensive Software Analysis" by Ardalan Amiri Sani, Zhiyun Qian
"A Study of Long-Tail Latency in n-Tier Systems: RPC vs Asynchronous Invocations" by Qingyan Wang
"Enabling Secure and Trusted Device I/Os on Smartphones" by Yifan Zhang
"Simulation Driven Tools for Designing Efficient Big Data Systems" by Ali R. Butt
"Modern Stream Processing on a Multicore Machine" by Felix Xiozhu Lin
"Infiniswap, Efficient Memory Disaggregation" by Juncheng Gu, Youngmoon Lee, Yiwen Zhang, Mosharaf Chowdhury, Kang G. Shin
"Marcher - A Heterogeneous High Performance Computing Infrastructure for Research and Education in Green Computing" by Ziliang Zong, Rong Ge, Qijun Gu
"CAREER: Enabling Robust Virtualized Hosting Infrastructures via Coordinated Learning, Recovery, and Diagnosis" by Xiaohui Gu, Hiep Nguyen, Daniel Dean, Ting Dai, Jingzu He
"Resource and Performance Optimization of Big Data Scientific Workflows in Distributed Network Environments" by Yi Gu
"DASH2M: Exploring HTTP/2 for Internet Streaming to Mobile Devices" by Songqing Chen, Fei Li
"EPOXY – Enabling Robust Protections for Bare-metal Systems" by Abraham A Clements, Naif Saleh Almakhdhub, Khaled Saab, Prashast Srivastava, Jinkyu Koo, Saurabh Bagchi, and Mathias Payer
"Network-Level of Cyber-Physical Networks-of-Systems" by Sabine Francis, Kamyar Mirzazad, Zhuoran Zhao, Andreas Gerstlauer
"A Cooperative Framework for Topology Awareness on Large-Scale Systems" by Zhiling Lan, Jia Wang
"PortHadoop-R: Support the Merging of HPC and Cloud" by Xian-He Sun
"Span VM: Multi-Hypervisor Virtual Machines" by Kartik Gopalan
"CSR: SMALL: Robust Algorithms for an Open Source Software Reliability Tool" by Lance Fiondella
"The Internet of Things has a battery problem" by Jacob Sorber, Josiah Hester, Kevin Storer
"Qualityof Service-Aware, Scalable Cache Tuning Algorithm in Consumer-based Embedded Devices" by M. Hammam Alsafrjalani, Ann Gordon-Ross
"CSR:Small: Effective Sampling-Based Miss Ratio Curves - Theory and Practices" by Zhenlin Wang
"CSR:Small: Adaptive Memory Resource Management in a Data Center - A Transfer Learning Approach" by Laura E. Brown, Zhenlin Wang
"Enriching Mobile User Experience Through The Cloud" by Abhishek Chandra, George Karypis, Jon Weissman
"Exploiting Predictability & Interdependencies of Physical Variables in Real-Time Embedded System Design" by Nathan Fisher, Aaron Willcock, Thidapat (Tam) Chantem), and Sandeep Kumar
"Boosting Resilience Efficiency in the Face of Frequent, Clustered, and Diverse Faults" by Chengmo Yang
"CNS: GCTC: Distributed Data Analytics for Real-Time Monitoring and Detection of Flash Floods in Smart City" by Nirmalya Roy, Aryya Gangopadhyay
"Understanding and Mitigating the Impact of Web Robot Traffic on Web Systems" by Derek Doran, Ning Xie, Kyle Brown, Nathan Rude, Mahdieh Zabihimayvan
"Eidetic storage: Cheap versions in the cloud" by Xianzheng Dou, Peter M. Chen, Jason Flinn
"DevAgent: Analyzing the Reliability of Storage Software Stack via Smart Devicecs" by Mai Zheng
"Retargetable and Behaviorally-Accurate Dynamic Binary Translation (DBT)" by Pen-Chung Yew
"Cheap and Available State Machine Replication" by Rong Shi, Yang Wang
"Wearable Cognitive Assistance" by Mahadev Satayanarayanan, Martial Hebest, Roberta Klatzky, Dan Siewiorek
"Efficient and Scalable Systems Support for Mobile Group Formation, Inferance, Recommendation, and Classification" by Shuo Zhang, Khaled Alanezi, Mike Gartrell, Richard Han, Qin Lv, Shivakant Mishra
"Provably Correct Design of Observation for Fault Diagnosis and State Estimation under Privacy and Network Constraints" by Agung Julius
"Scalable Transaction Management in Cloud Data Systems" by Anand Tripathi
"Reducing The Graphical User Interface's Consumption of Power on Mobile Devices" by Brad Vander Zanden, Wei Gao
"Realizing API Virtualization on Android" by Taeyeon Ki, Alexander Simeonov, Bhavika Pravin Jain, Chang Min Park, Keshav Sharma, Karthik Dantu, Steven Y. Ko, Lukasz Ziarek
"Leveraging Intra-chip/Inter-chip Silicon-Photonic Networks for Designing Next-Generation Accelerators" by Amir Kavyan Ziabari, Saiful Mojumder, Yifan Sun, José L. Abellán, John Kim, Ajay Joshi, and David Kaeli
"Dynamic Reconfguration for Adaptive Computing in Heterogeneous SoCs" by Prabhat Mishra, Umit Ogras
"Pushing Compute to Low-Latency Kernel-bypass Key-value Stores" by Tian Zhang, Chinmay Kulkarni, Ryan Stutsman
"Timely Power-Aware Data Management in Embedded Systems" by KD Kang
"CSR:Small: Energy-Aware Redundancy Management" by Ulrich Kremer
"CSR: Small: Surviving Cybersecurity and Privacy Threats in Wearable Mobile Cyber-Physical Systems" by Murtuza Jadliwala, Jibo He
"MERCATOR: Supporting Irregular Behaviors in Streaming SIMD Computation" by Steve Cole, Jeremy Buhler, Roger Chamberlain
"Karma: Cost-effective Geo-replicated Cloud Storage with Dynamic Enforcement of Causal Consistency" by Tariq Mahmood, Shankaranarayanan P. N., Sanjay Rao, T. N. Vijaykumar, and Mithuna Thottethodi
"Design Virtualization for Mainstream Reconfigurable Computing" by James Coole, David Wilson, Austin Baylis, and Greg Stitt
"A Data-Centric Architecture for Pervasive Edge Computing in Heterogeneous Extensible Distributed Systems" by Yuanyuan Yang, Fan Ye
"Task Mapping for Emerging Network Topologies" by David Bunde
"NDN-Hadoop: Exploring Applicability of NDN for Big-Data Computing" by Chris Gniady, Beichuan Zhang
"Mobile Distributed Computing in the Cloud" by Cristian Corcea, Narain Gehani, Reza Curtmola, Xiaoning Ding
"Building Faster, Elastic, and Durable Large-scale Data Store with Consistent Hashing" by Wei Xie, Yong Chen
"Automatically Generating Performance Imperatives via Machine Learning" by Biplab Kumar Saha, Tiffany Connors, Saami Rahman, Apan Qasem
"Improving Quality of Experience in Adaptive Video Streaming" by Yao Liu
"Benchmarking of Data Integration Workloads" by Anthony Cabrera, Clayton Faber, Kyle Cepeda, Robert Derber, Cooper Epstein, Jason Zhang, Ron Cytron, Roger Chamberlain
"Exploiting 3D Memory for Energy-Efficient Memory-Driven Computing" by Viktor Prasana, Charalampos Chelmis
"Architecture and System Support for Power-Agile Computing" by Geoffrey Challen and Mark Hempstead
"Provably Efficient Scheduling of Cache-Oblivious Recursive Wavefront Algorithms" by Rezaul Cchowdhury, Pramod Ganapathi, Yuan Tang, Jesmin Jahan Tithi
"Towards Sustainable Computing: Innovative Design and Management Strategies Across the Computing Stack" by Ayse K Coskun
"The Amulet Wearable Platform" by Kelly Caine, Ryan Halter, David Kotz, Sarah Lord, Jacob Sorber
"SparseKaffe: High-performance, auto-tuned, energy-aware algorithms for sparse direct methods on modern heterogeneous architectures" by Meng Tang, Mohammed, Aref Gadou, Tania Banerjee, Sanjay Ranka
"Collaborative Research: Comprehensive Algorithmic Resilience (CAR) for Big Data Analytics" by Lu Peng
"Hardware and Software Architecture for Wireless Sensing Systems Under Dynamic Load and Energy Availability" by Jason Hallstrom
"SPARCNet: A Hardware Accelerator for Efficient Deployment of Sparse Convolutional Networks" by Tinoosh Mohsenin
"CSR: Quality of Configuration in Large Scale Data Centers" by Krishna Kant
"Collaborative Proposal: Securing Smart Grid by Understanding Communications Infrastructure Dependencies" by Krishna Kant, Sajal K. Das
"Internet of Things: Sensing, Systems, and Networking" by Muhammad Shahzad
"CSR: Small: Predictable Real-Time Computing in GPU-enabled Systems" by Cong Liu

Frequently Asked Questions

Do I need to register for IPDPS to attend the PI meeting?: Yes, mostly. You only need to be registered for the day when the meeting happens. That's Friday. You can register here
IPDPS hotel is full! Which hotel should I stay at?: There are a few hotels around the workshop venue. Some have shuttles to the venue. See information at the bottom of IPDPS' hotel page.
My poster does not appear in the list?: Send it to Erik Saule by email. (esaule@uncc.edu)
I have been asked to prepare a poster, how big should it be?: The poster board will be 4'x8'. So anything smaller than that will fit.
I am attending IPDPS' workshop, can I come to see the wonderful work done by NSF CSR?: Sure! Feel free to stop by and do not miss the poster session!
How do I get from the airport to the resort hotels?: You can take a cab, or Uber. But MEARS is running shuttles from the airport to the hotels around IPDPS. They are much cheaper and take about the same time.
Where is the event precisely?: The event is in the conference center of the Hilton Buena Vista Palace Hotel. It is on the conference floor. The plenary sessions are in the Citron-Center. The Poster and Breakout sessions will be in the Event Center.

Contact

The PI meeting is organized by Erik Saule and Geoffrey Brown.

For any information regarding the NSF PI meeting, contact Erik Saule by email (esaule@uncc.edu).

The PI meeting is supported by NSF under fund CNS-1740398.