A&S Graduate Studies
This calendar presented by
A&S Graduate Studies
[PAST EVENT] Feng Yan, Computer Science - Ph.D. Defense
April 8, 2016
10:30am - 12pm
Abstract:
In the era of global, large scale data centers residing in clouds, many applications and users share the same pool of resources for the purposes of reducing energy and operating costs, and of improving availability and reliability. Along with the above benefits, resource sharing also introduces performance challenges: when multiple workloads access the same resources concurrently, contention may occur and introduce delays in the performance of individual workloads. Providing performance isolation to individual workloads needs effective management methodologies. The challenges of deriving effective management methodologies lie in finding accurate, robust, compact metrics and models to drive algorithms that can meet different performance objectives while achieving efficient utilization of resources. This dissertation proposes a set of methodologies aiming at solving the challenging performance isolation problem in workload interleaving in data centers, focusing on both storage components and computing components.
At the storage node level, we focus on methodologies for better interleaving user traffic with background workloads, such as tasks for improving reliability, availability, and power savings. More specifically, a scheduling policy for background workload based on the statistical characteristics of the system busy periods and a methodology that quantitatively estimates the performance impact of power savings are developed. At the storage cluster level, we consider methodologies on how to efficiently conduct work consolidation and schedule asynchronous updates without violating user performance targets. More specifically, we develop a framework that can estimate beforehand the benefits and overheads of each option in order to automate the process of reaching intelligent consolidation decisions while achieving faster eventual consistency.
At the computing node level, we focus on improving workload interleaving at off-the-shelf servers as they are the basic building blocks of large-scale data centers. We develop priority scheduling middleware that employs different policies to schedule background tasks based on the instantaneous resource requirements of the high priority applications running on the server node. Finally, at the computing cluster level, we investigate popular computing frameworks for large-scale data intensive distributed processing, such as MapReduce and its Hadoop implementation. We develop a new Hadoop scheduler called DyScale to exploit capabilities offered by heterogeneous cores in order to achieve a variety of performance objectives.
Bio:
Feng Yan has been working on his Ph.D. in Computer Science at William & Mary since Fall 2011. He has been working with Dr. Evgenia Smirni and his research interests include big data, cloud computing, performance modeling and evaluation, resource management, and scheduling. He has done internships in Microsoft Research in 2014 and in HP Labs in 2013. He has served as TPC member for ESEC/FSE (artifact evaluation track), IEEE BigData, ALLDATA, DATA ANALYTICS, and as reviewer for about 20 different journals and conferences, including IEEE TCC, ACM TOS, IEEE TII, ACM SIGMETRICS, IFIP Performance, IEEE ICDCS, IEEE/IFIP DSN, USENIX ICAC, ACM/SPEC ICPE, IEEE/ACM CCGrid, IEEE IC2E. He received his M.S. from William & Mary in 2011 and B.S. from Northeastern in 2008.
In the era of global, large scale data centers residing in clouds, many applications and users share the same pool of resources for the purposes of reducing energy and operating costs, and of improving availability and reliability. Along with the above benefits, resource sharing also introduces performance challenges: when multiple workloads access the same resources concurrently, contention may occur and introduce delays in the performance of individual workloads. Providing performance isolation to individual workloads needs effective management methodologies. The challenges of deriving effective management methodologies lie in finding accurate, robust, compact metrics and models to drive algorithms that can meet different performance objectives while achieving efficient utilization of resources. This dissertation proposes a set of methodologies aiming at solving the challenging performance isolation problem in workload interleaving in data centers, focusing on both storage components and computing components.
At the storage node level, we focus on methodologies for better interleaving user traffic with background workloads, such as tasks for improving reliability, availability, and power savings. More specifically, a scheduling policy for background workload based on the statistical characteristics of the system busy periods and a methodology that quantitatively estimates the performance impact of power savings are developed. At the storage cluster level, we consider methodologies on how to efficiently conduct work consolidation and schedule asynchronous updates without violating user performance targets. More specifically, we develop a framework that can estimate beforehand the benefits and overheads of each option in order to automate the process of reaching intelligent consolidation decisions while achieving faster eventual consistency.
At the computing node level, we focus on improving workload interleaving at off-the-shelf servers as they are the basic building blocks of large-scale data centers. We develop priority scheduling middleware that employs different policies to schedule background tasks based on the instantaneous resource requirements of the high priority applications running on the server node. Finally, at the computing cluster level, we investigate popular computing frameworks for large-scale data intensive distributed processing, such as MapReduce and its Hadoop implementation. We develop a new Hadoop scheduler called DyScale to exploit capabilities offered by heterogeneous cores in order to achieve a variety of performance objectives.
Bio:
Feng Yan has been working on his Ph.D. in Computer Science at William & Mary since Fall 2011. He has been working with Dr. Evgenia Smirni and his research interests include big data, cloud computing, performance modeling and evaluation, resource management, and scheduling. He has done internships in Microsoft Research in 2014 and in HP Labs in 2013. He has served as TPC member for ESEC/FSE (artifact evaluation track), IEEE BigData, ALLDATA, DATA ANALYTICS, and as reviewer for about 20 different journals and conferences, including IEEE TCC, ACM TOS, IEEE TII, ACM SIGMETRICS, IFIP Performance, IEEE ICDCS, IEEE/IFIP DSN, USENIX ICAC, ACM/SPEC ICPE, IEEE/ACM CCGrid, IEEE IC2E. He received his M.S. from William & Mary in 2011 and B.S. from Northeastern in 2008.