Publicado

2009-01-01

PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE

Palabras clave:

Video codec parallelization, multicore architectures, synchronization, H.264, multiprocessor architectures (es)

Descargas

Autores/as

  • MAURICIO ALVAREZ Ing. Universitat Politècnica de Catalunya, Barcelona, Spain
  • ALEX RAMIREZ PhD. Universitat Politècnica de Catalunya, Barcelona, Spain - Barcelona Supercomputing Center, Barcelona
  • MATEO VALERO PhD. Universitat Politècnica de Catalunya, Barcelona, Spain - Barcelona Supercomputing Center, Barcelona
  • ARNALDO AZEVEDO MSc. Delft University of Technology, Delft, The Netherlands
  • COR MEENDERINCK MSc Delft University of Technology, Delft, The Netherlands
  • BEN JUURLINK PhD. Delft University of Technology, Delft, The Netherlands
This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory multiprocessor (SMP) and compared the results with the theoretical expectations. The study includes the evaluation of three different scheduling techniques: static, dynamic and dynamic with tail-submit. A dynamic scheduling approach with a tail-submit optimization presents the best performance obtaining a maximum speedup of 9.5 with 24 processors. A detailed profiling analysis showed that thread synchronization is one of the limiting factors for achieving a better scalability. The paper includes an evaluation of the impact of using blocking synchronization APIs like POSIX threads and POSIX real-time extensions. Results showed that macroblock-level parallelism as a very fine-grain form of Thread-Level Parallelism (TLP) is highly affected by the thread synchronization overhead generated by these APIs. Other synchronization methods, possibly with hardware support, are required in order to make MB-level parallelization more scalable.

Cómo citar

APA

ALVAREZ, M., RAMIREZ, A., VALERO, M., AZEVEDO, A., MEENDERINCK, C. y JUURLINK, B. (2009). PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE. Avances en Sistemas e Informática, 6(1), 219–228. https://revistas.unal.edu.co/index.php/avances/article/view/14512

ACM

[1]
ALVAREZ, M., RAMIREZ, A., VALERO, M., AZEVEDO, A., MEENDERINCK, C. y JUURLINK, B. 2009. PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE. Avances en Sistemas e Informática. 6, 1 (ene. 2009), 219–228.

ACS

(1)
ALVAREZ, M.; RAMIREZ, A.; VALERO, M.; AZEVEDO, A.; MEENDERINCK, C.; JUURLINK, B. PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE. ava. sis. inf 2009, 6, 219-228.

ABNT

ALVAREZ, M.; RAMIREZ, A.; VALERO, M.; AZEVEDO, A.; MEENDERINCK, C.; JUURLINK, B. PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE. Avances en Sistemas e Informática, [S. l.], v. 6, n. 1, p. 219–228, 2009. Disponível em: https://revistas.unal.edu.co/index.php/avances/article/view/14512. Acesso em: 7 mar. 2025.

Chicago

ALVAREZ, MAURICIO, ALEX RAMIREZ, MATEO VALERO, ARNALDO AZEVEDO, COR MEENDERINCK, y BEN JUURLINK. 2009. «PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE». Avances En Sistemas E Informática 6 (1):219-28. https://revistas.unal.edu.co/index.php/avances/article/view/14512.

Harvard

ALVAREZ, M., RAMIREZ, A., VALERO, M., AZEVEDO, A., MEENDERINCK, C. y JUURLINK, B. (2009) «PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE», Avances en Sistemas e Informática, 6(1), pp. 219–228. Disponible en: https://revistas.unal.edu.co/index.php/avances/article/view/14512 (Accedido: 7 marzo 2025).

IEEE

[1]
M. ALVAREZ, A. RAMIREZ, M. VALERO, A. AZEVEDO, C. MEENDERINCK, y B. JUURLINK, «PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE», ava. sis. inf, vol. 6, n.º 1, pp. 219–228, ene. 2009.

MLA

ALVAREZ, M., A. RAMIREZ, M. VALERO, A. AZEVEDO, C. MEENDERINCK, y B. JUURLINK. «PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE». Avances en Sistemas e Informática, vol. 6, n.º 1, enero de 2009, pp. 219-28, https://revistas.unal.edu.co/index.php/avances/article/view/14512.

Turabian

ALVAREZ, MAURICIO, ALEX RAMIREZ, MATEO VALERO, ARNALDO AZEVEDO, COR MEENDERINCK, y BEN JUURLINK. «PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE». Avances en Sistemas e Informática 6, no. 1 (enero 1, 2009): 219–228. Accedido marzo 7, 2025. https://revistas.unal.edu.co/index.php/avances/article/view/14512.

Vancouver

1.
ALVAREZ M, RAMIREZ A, VALERO M, AZEVEDO A, MEENDERINCK C, JUURLINK B. PERFORMANCE EVALUATION OF MACROBLOCK-LEVEL PARALLELIZATION OF H.264 DECODING ON A CC-NUMA MULTIPROCESSOR ARCHITECTURE. ava. sis. inf [Internet]. 1 de enero de 2009 [citado 7 de marzo de 2025];6(1):219-28. Disponible en: https://revistas.unal.edu.co/index.php/avances/article/view/14512

Descargar cita

Visitas a la página del resumen del artículo

120

Descargas