Research

My research interests are in the areas of real-time/embedded systems, cyber-physical systems, operating systems and Computer Science education.

Real-Time / Embedded Systems

An embedded system is a combination of computer hardware and software that is designed to achieve specific functionality. Most embedded systems contain tasks or programs that have both logical and temporal constraints. Embedded systems with temporal constraints are referred to as real-time embedded systems, or simply real-time systems. Real-time systems may be broadly classified into two types, namely hard-real-time systems and soft-real-time systems. Hard-real-time systems are those in which tasks must strictly adhere to deadlines and failure to meet deadlines have severe consequences (e.g., aircraft control systems, emergency life support systems, etc). On the other hand, soft-real-time systems are those in which tasks may occasionally miss deadlines and missed deadlines result in decreased Quality-of-Service or QoS (e.g., multimedia applications, game stations and video streaming applications). An embedded system that contains hard-real-time, soft-real-time and non-real-time tasks is referred to as a mixed-criticality system. Real-time systems must ensure robustness, safety and determinism. Failing to meet any one of these requirements may result in undesirable (in soft-real-time systems) and/or severe (in hard-real-time and mixed-criticality systems) consequences. To ensure determinism and schedulability, real-time systems require a-priori knowledge of the worst-case execution times (WCETs) of all tasks.

Static Timing Analysis

Static timing analysis is considered a viable approach for calculating upper bounds on the WCET of a task. Static timing analysis models the architectural components of a computing system in order to obtain the time taken by the longest path identified in a given task. Architectural features such as out-of-order pipelines and caches cause unpredictability in timing analysis, leading to overly pessimistic estimates of the WCET of a task. One such feature that is particularly hard to model is the data cache.

Single Task Analysis

In this work, a data cache analyzer framework known as the Cache Miss Equations framework was enhanced. The new framework provides three fundamental enhancements over the original one and thus relaxes certain constraints that the original framework imposed on tasks.

Forced loop fusion is employed to concatenate loop bodies of sequential loop nests in order to build one single loop nest that may then be analyzed by the framework. This technique also allows non-rectangular loops in tasks.
Certain types of data dependent conditionals (those with only if-then clauses, but no else clauses) are permitted. An upper bound on the number of data cache misses is provided in such a case.
While the original framework gave an upper bound on the number of data cache misses incurred by a task, the new one produces reference patterns that identify the positions of each of these misses as well.

This data cache analysis framework was integrated into an existing static timing analyzer framework that calculates the WCET of a task.

[Publications: RTAS 2005]

Preemptive systems

Real-time systems typically have multiple tasks, each with a unique priority, executing in a preemptive environment. In analyzing such systems, three important questions are answered.

What is the maximum number of preemptions a task may undergo? By considering ranges of execution times between the best and the worst-case execution times, infeasible preemption points are eliminated and realistic worst-case scenarios are constructed.
Where would each preemption be placed in the iteration space of the preempted task? A range of iteration points where a task might be executing when it is preempted is identified.
What is the additional delay incurred by the preempted task due to each of these preemptions? The delay at each point is calculated using access chains (reuse chains) and the maximum delay among them is considered to be the preemption delay at the given point.

[Publications: RTAS 2006, RTSS 2006, TECS 2010]

Tasks with Critical Sections

Some real-time tasks may have regions where they use shared resources that are non-premptable. Traditionally, such tasks have been scheduled completely non-preemptively to simplify timing analysis. This suffers from the downside of decreased schedulability. A methodology to make only critical sections non-preemptive regions (NPRs), but keep legal preemptions was proposed and implemented. This framework was enhanced to support resource-sharing policies such as the Priority Inheritance Protocol (PIP) instead of making critical sections non-preemptive. This framework may be used as a means to identify what method is suitable for what sets of tasks.

[Publications: RTAS 2008, RTAS 2009]

Aperiodic Job Handling in Cache-Based Real-Time Systems

While most of the normal operation in a real-time system is modeled using time-driven, hard-deadline sporadic tasks, event-driven behavior is modeled using aperiodic jobs with soft or no deadlines. To provide good Quality-of-Service for aperiodic jobs in the presence of sporadic tasks, aperiodic servers were introduced. Aperiodic servers act as a sporadic task and reserve a quota periodically to serve aperiodic jobs. The use of aperiodic servers in systems with caches is unsafe because aperiodic servers do not take into account, the indirect cache-related preemption delays that the execution of aperiodic jobs might impose on the lower-priority sporadic tasks, thus jeopardizing their safety. To solve this problem, we propose an enhancement to the aperiodic server that we call a Cache Delay Server. Here, each lower-priority sporadic task is assigned a delay quota to accommodate the cache-related preemption delay imposed by the execution of aperiodic jobs. Aperiodic jobs are allowed to execute at their assigned server priority only when all the active lower-priority sporadic tasks have a sufficient delay quota to accommodate it. Simulation results demonstrate that a Cache Delay Server ensures the safety of sporadic tasks while providing acceptable Quality-of-Service for aperiodic jobs.

[Publications: RTCSA 2013]

Real-Time Support for Multi-Core Architectures

Single core processor designs have reached a clock frequency wall due to area and power considerations, leading to designs with multiple processors on a single chip, known as chip multiprocessors or simply multicore processors. As real-time embedded systems integrate more and more functionality, they are demanding increasing amounts of computational power that can only be met by deploying multicore architectures. The use of multicore architectures with on-chip memory hierarchies and shared communication infrastructure in the context of real-time systems poses several research challenges. In general, there are two broad classes of multi-core real-time scheduling algorithms, namely global and partitioned approaches. A global policy assumes that tasks may be migrated among cores while a partitioned policy allocates tasks to cores statically. Semi-partitioned scheduling strategies form a middle ground between these two approaches. By making most tasks non-migrating (partitioned), runtime migration overhead is minimized. On the other hand, by allowing some tasks to migrate among cores, schedulability of task sets may be improved.

Mechanisms for Task Migration on Multi-Core Architectures

Migration of tasks among cores (necessary in global and semi-partitioned scheduling) reduces time predictability of tasks due to cache warm-up overheads while increasing traffic on the Network-on-Chip (NoC) interconnect. In order to improve time predictability of real-time tasks in the presence of task migrations, novel micro-architectural migration mechanisms with bounded migration overheads are proposed.

[Publications: LCTES 2009, LCTES 2011, CASES 2012, TECS 2014]

Policies for Task Migration on Multi-Core Architectures

A semi-partitioned policy for predictably scheduling periodic hard-real-time tasks using locked cache migration on a Network-on-Chip (NoC) based multicore architecture is proposed. As many tasks as possible are statically partitioned onto cores. The remaining tasks are allowed to migrate in a predetermined manner among a preselected subset of cores to improve task set schedulability. A dynamic-priority policy for scheduling memory accesses on the NoC and a location-aware partitioning policy that takes explicit advantage of this NoC scheduling policy to improve the efficiency of task allocation.

[Publications: ECRTS 2012, SIES 2014, RTS 2015]

Cyber-Physical Systems

A Cyber-Physical System (CPS) is composed of distributed computational elements, connected via a computer network, that monitor and control switched physical systems interconnected by physical infrastructures. A fundamental challenge in the design and analysis of a CPS is the lack of common semantics across the components. This challenge is addressed by employing a novel approach that composes the correctness of the components instead of their functionality using a conjunction of non-interfering logical invariants.

[Publications: ICCPS 2013, COMPSAC 2013, SEAA 2014]

Mixed-Criticality Systems

Embedded real-time applications such as Avionics and Automotive systems are increasingly using multi-core platforms. Such systems have strict certification requirements of subsets of their functionality, which result in strict temporal constraints on those subsets, while other subsets may have less strict requirements. Migrating such mixed criticality systems from single-core to multi-core platforms is challenging because application/component isolation and freedom from interference among them must be guaranteed. Safe and efficient, architecture-aware mapping and scheduling of system components (e.g., partitions, tasks, etc. as relevant to a particular domain) on the multiple cores is at the center of any scheme to migrate such systems from single-core to multi-core platforms. We propose, develop and evaluate a unified framework to automate the mapping and scheduling process with the consideration of several architectural and application level requirements/constraints (e.g., communication and cache conflicts among system components, constraints prohibiting the allocation of certain system components on the same core, etc.)

[Publications: RTNS 2018]

Real-Time Execution of Graphics Processing Units

Graphics Processing Units (GPUs) are computational powerhouses that were originally designed for accelerating graphics applications. In recent years, there has been a tremendous increase in support for general purpose computing on GPUs (GPGPU). GPU based architectures provide unprecedented magnitudes of computation at a fraction of the power used by traditional CPU based architectures. As real-time systems integrate more and more functionality, GPU based architectures are very attractive for their deployment. However, in a real-time system, predictability and meeting temporal requirements are much more important than raw performance. While some realtime jobs may benefit from the performance that all cores of the GPU can provide, most jobs may require only a subset of cores in order to successfully meet their temporal requirements. We propose to study concurrent scheduling of soft-real-time jobs on a GPU based platform.

[Publications: RTAS 2014]

Teaching

My teaching interests are in the areas of operating systems, computer networks, real-time systems, compiler construction, object-oriented programming, computer architecture, software systems design and implementation and database systems.

Teaching Methodology

I strongly believe that active learning improves student engagement and student learning. As such, I have adopted numerous active learning techniques in all my classes, resulting in fully flipped versions of the classes. I have worked on and continue to work on significantly adapting existing techniques to my own classroom setting and to the unique context of students here at UNC Charlotte.

Below are the key aspects of engaged pedagogy and active learning that I have incorporated into my teaching:

Pre-class lecture videos: Due to the passive nature of listening to someone else talk, students often zone-out during long in-class lectures. Since there is no rewind in real life, this means that they may not have a good learning experience. Lecture videos allow students to go over lecture material at their own pace, make notes, go back & forth as needed, etc. I have created my own lecture videos for all my classes. This helps me provide content specifically relevant to my classes and lends the course material a personal touch. I strive to keep my lecture videos short (on an average, 10-15 minutes long) and to the point.

Forcing functions to assess preparation: Experience has taught me that simply flipping a class does not automatically guarantee success, especially since students may not (yet) be receptive to taking increased, proactive responsibility for their learning. It is essential to ensure that students complete pre-class preparation (e.g., watching lecture videos, reading textbook, etc.) by using forcing functions. In collaboration with two other colleagues, I have extensively researched and tried different formats of forcing functions. In-class observations suggest that there is a significant increase in the level of student preparation because of our forcing functions.

Interactive quizzes: Even though students are expected to be prepared for class, jumping directly into application-oriented activities can leave students at a loss. I use interactive tools such as Clickers/Poll Everywhere for in-class quizzes at the beginning of class. Students discuss concepts they learned through pre-class lecture material in groups and answer quiz questions. The interactive nature of these quizzes provides a great way to initiate discussions.

In-class activities: I have designed numerous in-class activities that give students hands-on application-oriented experience on relevant course concepts. Students work on these activities in lightweight teams [Latulipe et al., SIGCSE 2015].

Low-stakes tests: Research has shown that having one or two comprehensive, high-stakes tests can result in significant stress and grade pressure for students. To alleviate this, all my courses have 3-4 tests with lower individual stakes.

Professional skills development: Apart from learning technical content, it is vital that students learn team-work/collaboration skills, social skills of engagement with their peers, timeliness, etc. In all my classes, a small portion of the overall course grade is associated with the demonstration of these professional skills.

Recent courses

Software Systems Design and Implementation

Database Systems

Operating Systems and Networks

Real-Time Embedded Systems

Publications

Thesis

H. Ramaprasad. Analytically Bounding Data Cache Behavior for Real-Time Systems. PhD Thesis, North Carolina State University, July 2008.

Refereed journal publications

[RTS 2015] M. Shekhar, H. Ramaprasad, A. Sarkar, F. Mueller. Architecture Aware Semi Partitioned Real-Time Scheduling on Multicore Platforms. In Springer Real-Time Systems Journal (RTS), February 2015, accepted.
[TECS 2014] A. Sarkar, F. Mueller and H. Ramaprasad. Static Task Partitioning for Locked Caches in Multi-Core Real-Time Systems. In Transactions on Embedded Computing Systems (TECS), Volume 5, Number 1, June 2014, accepted.
[TECS 2010] H. Ramaprasad and F. Mueller. Tightening the Bounds on Feasible Preemptions. In Transactions on Embedded Computing Systems (TECS), Volume 10, Number 2, Article 27, 2010.

Refereed conference publications

[RTNS 2018] A. Vasu, H. Ramaprasad. “Architecture-Aware Mapping and Scheduling of IMA partitions on Multicore platforms” In Proc. of the International Conference on Real-Time Networks and Systems (RTNS), 2018.
[MCSoC 2017] S. Dutta, D. Kagaris and H. Ramaprasad. “Hypervisor-Induced Negative Interference in Virtualized Multi-core Platforms: The P4080 Case” In Proc. of the International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC), 2017.
[CASES 2014] B. Holton, A. Shrivastava, K. Bai and H. Ramaprasad. GCCFG: A New Graphical Representation for Inter-procedural Optimization for Software Managed Manycore (SMM) Architectures. In Proc. of the International Conference on Compilers, Architectures and Synthesis of Embedded Systems (CASES), 2014.
[SEAA 2014] A. Choudhari, H. Ramaprasad, S. Chellappan, B. McMillin, J. Kimball, M. Zawodniok. Adaptive Scheduling with Explicit Congestion Notification in a Cyber-Physical Smart Grid System In Proc. of the Euromicro Conference series on Software Engineering and Advanced Applications (SEAA), 2014.
[SIES 2014] M. Shekhar, H. Ramaprasad, F. Mueller. Network-on-Chip Aware Scheduling of Hard-Real-Time Tasks. In Proc. of the International Symposium on Industrial Embedded Systems (SIES), 2014.
[RTCSA 2013] S. Motakpalli, V.P. Jain, H. Ramaprasad. Aperiodic Job Handling in Cache-Based Real-Time Systems. In Proc. of the International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA), 2013.
[COMPSAC 2013] A. Choudhari, H. Ramaprasad, T. Paul, J. Kimball, M. Zawodniok, B. McMillin, S. Chellappan. Stability of a Cyber-Physical Smart Grid System using Cooperating Invariants. Full paper in Proc. of the International Computers, Software and Applications Conference (COMPSAC), 2013.
O. Acevedo, D. Kagaris, K. Poluri, H. Ramaprasad and S. Warner. Towards Optimal Design of Avionics Networking Infrastructures. In Proc. of the Digital Avionics Systems Conference (DASC), 2012.
[CASES 2012] A. Sarkar, F. Mueller and H. Ramaprasad. Static Task Partitioning for Locked Caches in Multi-Core Real-Time Systems. To appear in Proc. of the Conference on Compiler, Architecture and Synthesis for Embedded Systems (CASES), 2012.
[ECRTS 2012] M. Shekhar, A. Sarkar, H. Ramaprasad and F. Mueller. Semi-Partitioned Hard-Real-Time Scheduling Under Locked Cache Migration in Multicore Systems. In Proc. of the Euromicro Conference on Real-Time Systems (ECRTS), 2012.
[LCTES 2011] A. Sarkar, F. Mueller and H. Ramaprasad. Predictable Task Migration for Locked Caches in Multi-Core Systems. In Proc. of the ACM SIGPLAN Conference on Languages, Compilers and Tools for Embedded Systems (LCTES), 2011.
[LCTES 2009] A. Sarkar, F. Mueller, H. Ramaprasad and S. Mohan. Push-Assisted Migration of Real-Time Tasks in Multi-Core Processors. In Proc. of the ACM SIGPLAN Conference on Languages, Compilers and Tools for Embedded Systems (LCTES), 2009.
[RTAS 2009] H. Ramaprasad and F. Mueller. Bounding Worst-Case Response Time for Tasks under PIP. In Proc. of the Real-Time and Embedded Technology and Applications Symposium (RTAS), 2009.
[RTAS 2008] H. Ramaprasad and F. Mueller. Bounding Worst-Case Response Time for Tasks With Non-Preemptive Regions. In Proc. of the Real-Time and Embedded Technology and Applications Symposium (RTAS), 2008.
[RTSS 2006] H. Ramaprasad and F. Mueller. Tightening the Bounds on Feasible Preemption Points. In Proc. of the Real-Time Systems Symposium (RTSS), December 2006, pages 212-222.
[RTAS 2006] H. Ramaprasad and F. Mueller. Bounding Preemption Delay within Data Cache Reference Patterns for Real-Time Tasks. In Proc. of the Real-Time and Embedded Technology and Applications Symposium (RTAS), April 2006, pages 71-80.
[RTAS 2005] H. Ramaprasad and F. Mueller. Bounding Worst-Case Data Cache Behavior by Analytically Deriving Cache Reference Patterns. In Proc. of the Real-Time and Embedded Technology and Applications Symposium (RTAS), March 2005, pages 148-157.

Refereed posters / work-in-progress papers

[SIGCSE 2018] S. Watson, J. Bahamón, H. Ramaprasad, H. Lipford. Developing Soft Skills with a Classroom Behavior Management Game. Presented at the ACM Technical Symposium on Computer Science Education (SIGCSE), February 2018.
[RTAS 2014] K. N. Gowda, H. Ramaprasad. Concurrent Soft-real-time Execution on GPUs. In Work-in-Progress (WiP) session of the IEEE Symposium on Real-Time and Embedded Technology and Applications (RTAS), 2014.
[ICCPS 2013] A. Choudhari, H. Ramaprasad, T. Paul, J. Kimball, M. Zawodniok, B. McMillin and S. Chellappan. Stability of a Cyber-Physical Smart Grid System using Cooperating Invariants. In Work-in-Progress (WiP) session of the International Conference on Cyber-Physical Systems (ICCPS), 2013. Recipient of Best Work-in-Progress paper award.
[LCTES 2010] V.P. Jain and H. Ramaprasad. Cache-Related Delay Server for Aperiodic Job Handling in Embedded Real-Time Systems . In Work-in-Progress (WiP) session of the Conference on Languages, Compilers and Tools for Embedded Systems (LCTES), 2010.
[RTSS 2009] K. Katre, H. Ramaprasad, A. Sarkar and F. Mueller. Policies for Migration of Real-Time Tasks in Embedded Multi-Core Systems. In Work-in-Progress (WiP) session of the Real-Time Systems Symposium (RTSS), December 2009.
[PLOSA 2009] H. Ramaprasad and F. Mueller.Analytically Bounding Data Cache Behavior for Real-Time Systems. CRA-W/CDC Programming Languages, Operating Systems and Architecture Workshop (PLOSA), March 2009.
[LCTES 2006] H. Ramaprasad and F. Mueller. Feasible Preemption Point Analysis for Data Cache Reference Patterns. ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), June 2006.
[LCTES 2004] H. Ramaprasad and F. Mueller. Deriving Data Cache Reference Patterns From Cache Miss Equations. ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), June 2004.

Non-refereed / invited conference papers

[ICESS 2015] M. Shekhar, H. Ramaprasad and F. Mueller. Evaluation of Memory Access Arbitration Algorithm on Tilera's TILEPro64 platform. Invited Paper in Proc. of the International Conference on Embedded Software and Systems (ICESS), 2015.

Harini Ramaprasad

Curriculum Vitae

Education

Doctor of Philosophy

Computer Science

Master of Science

Computer Science

Bachelor of Science

Computer Science

Professional Experience

Teaching Full Professor

Department of Computer Science

Director of Undergrad Programs

College of Computing and Informatics

Assistant Chair

Department of Computer Science

Teaching Associate Professor

Department of Computer Science

Assistant Professor

Department of Electrical and Computer Engineering

Research

Real-Time / Embedded Systems

Static Timing Analysis

Single Task Analysis

Preemptive systems

Tasks with Critical Sections

Aperiodic Job Handling in Cache-Based Real-Time Systems

Real-Time Support for Multi-Core Architectures

Mechanisms for Task Migration on Multi-Core Architectures

Policies for Task Migration on Multi-Core Architectures

Cyber-Physical Systems

Mixed-Criticality Systems

Real-Time Execution of Graphics Processing Units

Teaching

Teaching Methodology

Recent courses

Software Systems Design and Implementation

Database Systems

Operating Systems and Networks

Real-Time Embedded Systems

Software Systems Design and Implementation

Topics covered

Database Systems

Topics covered

Operating Systems and Networks

Topics covered

Operating Systems

Computer Networks

System Administration

Real-Time Embedded Systems

Topics covered

Publications

Thesis

Refereed journal publications

Refereed conference publications

Refereed posters / work-in-progress papers

Non-refereed / invited conference papers

Contact Information

Address

Email

Phone