Advanced search options

Advanced Search Options 🞨

Browse by author name (“Author name starts with…”).

Find ETDs with:

in
/  
in
/  
in
/  
in

Written in Published in Earliest date Latest date

Sorted by

Results per page:

Sorted by: relevance · author · university · dateNew search

You searched for +publisher:"Delft University of Technology" +contributor:("Meenderinck, C."). Showing records 1 – 2 of 2 total matches.

Search Limiters

Last 2 Years | English Only

No search limiters apply to these results.

▼ Search Limiters


Delft University of Technology

1. Chi, C.C. Parallel H.264 Decoding Strategies for Cell Broadband Engine:.

Degree: 2010, Delft University of Technology

How to develop effcient and scalable parallel applications is the key challenge for emerging many-core architectures. We investigate this question by implementing and comparing two parallel H.264 decoders on the Cell architecture. It is expected that future many-cores will use a Cell-like local store memory hierarchy, rather than a non-scalable shared memory. The two implemented parallel algo-rithms, the Task Pool (TP) and the novel Ring-Line (RL) approach, both exploit macroblock-level parallelism. The TP implementation follows the master-slave paradigm and is very dynamic so that in theory perfect load balancing can be achieved. The RL approach is distributed and more predictable in the sense that the mapping of macroblocks to processing elements is fixed. This allows to better exploit data locality, to overlap communication with computation, and to reduce communication and synchronization overhead. While TP is more scalable in theory, the actual scalability favors RL. Using 16 SPEs, RL obtains a scalability of 12x, while the TP implementation only 10.3x. More importantly, the absolute performance of RL is much higher. Using 16 SPEs, RL achieves a throughput of 139.6 frames per second (fps) while TP achieves only 76.6 fps. A large part of the additional performance advantage is due to hiding the memory latency. From the results we conclude that in order to fully leverage the performance of future many-cores, a centralized master should be avoided and the mapping of tasks to cores should be predictable in order to be able to hide the memory latency. Advisors/Committee Members: Meenderinck, C., Juurlink, B..

Subjects/Keywords: H.264; Cell Broadband Engine; parallel; decoding

Record DetailsSimilar RecordsGoogle PlusoneFacebookTwitterCiteULikeMendeleyreddit

APA · Chicago · MLA · Vancouver · CSE | Export to Zotero / EndNote / Reference Manager

APA (6th Edition):

Chi, C. C. (2010). Parallel H.264 Decoding Strategies for Cell Broadband Engine:. (Masters Thesis). Delft University of Technology. Retrieved from http://resolver.tudelft.nl/uuid:7761537c-0549-4744-8d59-fae2c3398b9e

Chicago Manual of Style (16th Edition):

Chi, C C. “Parallel H.264 Decoding Strategies for Cell Broadband Engine:.” 2010. Masters Thesis, Delft University of Technology. Accessed April 02, 2020. http://resolver.tudelft.nl/uuid:7761537c-0549-4744-8d59-fae2c3398b9e.

MLA Handbook (7th Edition):

Chi, C C. “Parallel H.264 Decoding Strategies for Cell Broadband Engine:.” 2010. Web. 02 Apr 2020.

Vancouver:

Chi CC. Parallel H.264 Decoding Strategies for Cell Broadband Engine:. [Internet] [Masters thesis]. Delft University of Technology; 2010. [cited 2020 Apr 02]. Available from: http://resolver.tudelft.nl/uuid:7761537c-0549-4744-8d59-fae2c3398b9e.

Council of Science Editors:

Chi CC. Parallel H.264 Decoding Strategies for Cell Broadband Engine:. [Masters Thesis]. Delft University of Technology; 2010. Available from: http://resolver.tudelft.nl/uuid:7761537c-0549-4744-8d59-fae2c3398b9e

2. Fernández Abeledo, E. Implementation of Nexus: Dynamic Hardware Management Support for Multicore Platforms:.

Degree: 2010, Delft University of Technology

Current trends in computer architecture focus on multicore platforms. The target of these new platforms is to scale the performance of the system with the number of cores. However, the performance of current archictectures is limited due to thread-level parallelism overhead and programmability. StarSS is a task-based programming model that eases the programmability of multicores and tries to exploit functional parallelism within applications. However, the performance of StarSS does not scale efficiently for fine-grained tasks, as for such tasks the task management overhead becomes significant in comparison to the execution of the tasks. Nexus is a dynamic hardware support system that aims to alleviate the current overhead of StarSS, by offloading the dependency resolution process and the synchronization with the cores to hardware. In this work, we implement Nexus by defining and connecting the new hardware in a Cell archictecture simulator. The scalability, performance, and throughput of the implementation are evaluated for different task sizes and number of cores, using several dependency patterns. Furthermore, different configuration parameters are evaluated, such as the dimension of the new hardware inserted in the existing architecture. Results show a large improvement of the scalability offered by Nexus in comparison with StarSS, especially for fine-grained tasks. Nexus succeeds at alleviating the overhead of StarSS by accelerating the dependency resolution process and the synchronization with the worker cores. Furthermore, the evaluation of the Nexus system dimensions has shown that its scalability decreases slightly with its area. Advisors/Committee Members: Juurlink, B., Meenderinck, C., Van Genderen, A..

Subjects/Keywords: H.264; multicore; scalability; cell; simulator

Page 1 Page 2 Page 3 Page 4 Page 5 Page 6 Page 7

Record DetailsSimilar RecordsGoogle PlusoneFacebookTwitterCiteULikeMendeleyreddit

APA · Chicago · MLA · Vancouver · CSE | Export to Zotero / EndNote / Reference Manager

APA (6th Edition):

Fernández Abeledo, E. (2010). Implementation of Nexus: Dynamic Hardware Management Support for Multicore Platforms:. (Masters Thesis). Delft University of Technology. Retrieved from http://resolver.tudelft.nl/uuid:e74500fe-1e84-40c4-b2ac-25cac7324942

Chicago Manual of Style (16th Edition):

Fernández Abeledo, E. “Implementation of Nexus: Dynamic Hardware Management Support for Multicore Platforms:.” 2010. Masters Thesis, Delft University of Technology. Accessed April 02, 2020. http://resolver.tudelft.nl/uuid:e74500fe-1e84-40c4-b2ac-25cac7324942.

MLA Handbook (7th Edition):

Fernández Abeledo, E. “Implementation of Nexus: Dynamic Hardware Management Support for Multicore Platforms:.” 2010. Web. 02 Apr 2020.

Vancouver:

Fernández Abeledo E. Implementation of Nexus: Dynamic Hardware Management Support for Multicore Platforms:. [Internet] [Masters thesis]. Delft University of Technology; 2010. [cited 2020 Apr 02]. Available from: http://resolver.tudelft.nl/uuid:e74500fe-1e84-40c4-b2ac-25cac7324942.

Council of Science Editors:

Fernández Abeledo E. Implementation of Nexus: Dynamic Hardware Management Support for Multicore Platforms:. [Masters Thesis]. Delft University of Technology; 2010. Available from: http://resolver.tudelft.nl/uuid:e74500fe-1e84-40c4-b2ac-25cac7324942

.