ListarAC - Contribuciones a congresos científicos por tema "GPU"
Mostrando ítems 1-12 de 12
-
Acelerando los momentos de Zernike sobre Kepler
(2014-05-02)Este trabajo analiza las características más avanzadas de la arquitectura Kepler de Nvidia, principalmente el paralelismo dinámico para el lanzamiento de kernels desde la GPU y la planificación de hilos con Hyper-Q. ... -
C++ for Heterogeneous Programming: oneAPI (DPC++ and oneTBB)
(2020-11-19)This tutorial provides hands-on experience programming CPUs, GPUs and FPGAs using a unified, standards-based programming model: oneAPI. oneAPI includes a cross-architecture language: Data Parallel C++ (DPC++). DPC++ is an ... -
CUVLE: Variable-Length Encoding on CUDA
(2014-10-14)Data compression is the process of representing information in a compact form, in order to reduce the storage requirements and, hence, communication bandwidth. It has been one of the critical enabling technologies for ... -
Entropy-based High Performance Computation of Boolean SNP-SNP Interactions Using GPUs
(2014-05-02)It is being increasingly accepted that traditional statistical Single Nucleotide Polymorphism (SNP) analysis of Genome-Wide Association Studies (GWAS) reveals just a small part of the heritability in complex diseases. ... -
Explotando el nuevo módulo OpenCL de Intel TBB
(2018-07-20)Este artículo tiene como objetivo contribuir al desarrollo de la programación paralela trabajando en una de las herramientas desarrolladas por Intel: Intel Threading Building Blocks (Intel TBB). Hemos implementado una ... -
Expressing Heterogeneous Parallelism in C++ with Threading Building Blocks
(2017-12-18)Due to energy constraints, high performance computing platforms are becoming increasingly heterogeneous, achieving greater performance per watt through the use of hardware that is tuned to specific computational kernels ... -
GPUs para HPC: Logros y perspectivas futuras
(2013-10-18)Hace una década comenzábamos a mejorar las primeras aplicaciones científicas en GPUs utilizando Cg y OpenGL. Ahora CUDA y OpenCL han tomado el relevo, marcando un ritmo vertiginoso en la aceleración de códigos procedentes ... -
Hardware support for Local Memory Transactions on GPU Architectures
(2015-06-26)Graphics Processing Units (GPUs) are popular hardware accelerators for data-parallel applications, enabling the execution of thousands of threads in a Single Instruction - Multiple Thread (SIMT) fashion. However, the SIMT ... -
HPC Accelerators with 3D Memory
(2016-09-13)After a decade evolving in the High Performance Computing arena, GPU-equipped supercomputers have con- quered the top500 and green500 lists, providing us unprecedented levels of computational power and memory bandwidth. ... -
Improvements in Hardware Transactional Memory for GPU Architectures
(2016-07-20)In the multi-core CPU world, transactional memory (TM)has emerged as an alternative to lock-based programming for thread synchronization. Recent research proposes the use of TM in GPU architectures, where a high number of ... -
A scheduling theory framework for GPU tasks efficient execution
(2018-07-16)Concurrent execution of tasks in GPUs can reduce the computation time of a workload by overlapping data transfer and execution commands. However it is difficult to implement an efficient run- time scheduler ... -
Time Series Heterogeneous Co-execution on CPU+GPU
(2019-07-10)Time series motif (similarities) and discords discovery is one of the most important and challenging problems nowadays for time series analytics. We use an algorithm called “scrimp” that excels in collecting the relevant ...