Advanced search options

Advanced Search Options 🞨

Browse by author name (“Author name starts with…”).

Find ETDs with:

in
/  
in
/  
in
/  
in

Written in Published in Earliest date Latest date

Sorted by

Results per page:

You searched for id:"handle:1773/43015". One record found.

Search Limiters

Last 2 Years | English Only

No search limiters apply to these results.

▼ Search Limiters


University of Washington

1. Wang, Jingjing. Runtime Optimizations for Large-Scale Data Analytics.

Degree: PhD, 2018, University of Washington

Large-scale data analytics is key to modern science, technology, and business development. Big data systems have emerged rapidly in recent years, but modern data analytics remains challenging due to application requirements. First, users need to analyze vasts amount of data generated from various sources; Second, analysis efficiency is critical in many scenarios; Third, most big data systems are built on top of new operating environments, such as clusters and cloud services, where resource management is important; Fourth, high-level, feature-rich programming interfaces are required to support a high variety of data and workload types. In this dissertation, we present methods to improve system efficiency for large-scale data analytics. We investigate the problem in the context of three research projects, which address the same key problem: optimization of analytical query execution, in different ways. Specifically, these projects focus on runtime optimzation, which considers not only static information that is available prior to the actual execution, but more importantly, runtime information. We demonstrate, from these projects, that runtime optimzation can significantly improve overall system performance: it can lower query execution times, improve resource utilization, and reduce application failures. We first present a full-stack solution for recursive relational query evaluation in shared-nothing engines. Users express their analysis using a high-level declarative language (a subset of Datalog with aggregate functions). Queries are then compiled into distributed query plans with termination guarantee. Multiple execution models for iterative queries are supported, including synchronous, asynchronous, and different processing priorities. Our evaluation shows that application properties determine which model yields the fastest query execution time. Next, we present ElasticMem, an approach for automatic and elastic memory management for cloud data analytics applications. In clouds or clusters, a resource manager schedules applications in containers with hard memory limits, which requires accurate application memory usage estimation before launching containers. However, memory estimation for large-scale analytical applications is difficult, and inappropriate estimate can lead to failures and performance degredation. ElasticMem avoids pre-execution memory usage estimation by elastically allocating memory across containers during runtime. Experiments show that ElasticMem outperforms static memory allocation, leading to fewer query failures, lower garbage collection overheads, and lower query times. Lastly, we present Deluceva, a system that dynamically optimizes neural network inference for video analytics. Many video analysis approaches apply neural network models trained on images directly to each video frame. While being easy to develop, these approaches do not leverage the rich temporal redundancy in videos, which can be used to further reduce model inference time. Deluceva accelerates model inference by dynamically… Advisors/Committee Members: Balazinska, Magdalena (advisor).

Subjects/Keywords:

Record DetailsSimilar RecordsGoogle PlusoneFacebookTwitterCiteULikeMendeleyreddit

APA · Chicago · MLA · Vancouver · CSE | Export to Zotero / EndNote / Reference Manager

APA (6th Edition):

Wang, J. (2018). Runtime Optimizations for Large-Scale Data Analytics. (Doctoral Dissertation). University of Washington. Retrieved from http://hdl.handle.net/1773/43015

Chicago Manual of Style (16th Edition):

Wang, Jingjing. “Runtime Optimizations for Large-Scale Data Analytics.” 2018. Doctoral Dissertation, University of Washington. Accessed January 23, 2019. http://hdl.handle.net/1773/43015.

MLA Handbook (7th Edition):

Wang, Jingjing. “Runtime Optimizations for Large-Scale Data Analytics.” 2018. Web. 23 Jan 2019.

Vancouver:

Wang J. Runtime Optimizations for Large-Scale Data Analytics. [Internet] [Doctoral dissertation]. University of Washington; 2018. [cited 2019 Jan 23]. Available from: http://hdl.handle.net/1773/43015.

Council of Science Editors:

Wang J. Runtime Optimizations for Large-Scale Data Analytics. [Doctoral Dissertation]. University of Washington; 2018. Available from: http://hdl.handle.net/1773/43015

.