Listar AC - Contribuciones a congresos científicos por título
Mostrando ítems 89-108 de 109
-
Scalability Analysis of Signatures in Transactional Memory Systems
(2014-10-29)Signatures have been proposed in transactional memory systems to represent read and write sets and to decouple transaction conflict detection from private caches or to accelerate it. Generally, signatures are implemented ... -
A scheduling theory framework for GPU tasks efficient execution
(2018-07-16)Concurrent execution of tasks in GPUs can reduce the computation time of a workload by overlapping data transfer and execution commands. However it is difficult to implement an efficient run- time scheduler ... -
Simplified Floating-Point Units for High Dynamic Range Image and Video Systems
(2015-06-29)The upcoming arrival of high dynamic range image and video applications to consumer electronics will force the utilization of floating-point numbers on them. This paper shows that introducing a slight modification on ... -
Siting Multiple Observers for Maximum Coverage: An Accurate Approach
(2015-06-05)The selection of the minimal number of observers that ensures the maximum visual coverage over an area represented by a digital elevation model (DEM) have great interest in many elds, e.g., telecommunications, environment ... -
SkewEngine: enhancing performance of intensive calculations on regular meshes
(2023)In various applications such as hyperspectral data manipulation, MRI data exploration, or visual basin identification in digital elevation models, performing arithmetic operations on each point of a data mesh that involve ... -
SkewEngine: reorganización de mallas regulares para cálculo intensivo
(2023)En muchas aplicaciones, tales como la manipulación de datos hiperespectrales, la exploración de datos de resonancia magnética o la identificación de cuencas visuales en modelos digitales de elevación, es necesario realizar ... -
Smith-Waterman Acceleration in Multi-GPUs: A Performance per Watt Analysis
(Springer, 2017)We present a performance per watt analysis of CUDAlign 4.0, a parallel strategy to obtain the optimal alignment of huge DNA se- quences in multi-GPU platforms using the exact Smith-Waterman method. Speed-up factors and ... -
Solución de múltiples sistemas lineales en GPUs
(2013-11-05)Este trabajo se centra en el calculo, de forma concurrente, de múltiples sistemas lineales definidos por matrices densas de una dimensión media. Se considera una solución basada en la factorización de Cholesky y su ... -
Solving Large-Scale Markov Decision Processes on Low-Power Heterogeneous Platforms
(2019-07-11)Markov Decision Processes (MDPs) provide a framework for a machine to act autonomously and intelligently in environments where the effects of its actions are not deterministic. MDPs have numerous applications. We focus ... -
Tasks Fairness Scheduler for GPU
(2019-09-24)Nowadays GPU clusters are available in almost every data processing center. Their GPUs are typically shared by different applications that might have different processing needs and/or different levels of priority. As current ... -
Three is not a crowd: ACPU-GPU-FPGA K-means implementation
(2017-06-15)Clustering is the task of assigning a set of objects into groups (clusters) so that objects in the same group are more similar to each other than to those in other groups. In particular, K-means is a clustering algorithm ... -
Time Series Analysis Using Transprecision Computing
(2019-09-11)This work presents results using transprecision techniques for reducing the precision of the computation of time series analysis. The developed benchmark allows to explore how the accuracy of the results is affected by ... -
Time Series Heterogeneous Co-execution on CPU+GPU
(2019-07-10)Time series motif (similarities) and discords discovery is one of the most important and challenging problems nowadays for time series analytics. We use an algorithm called “scrimp” that excels in collecting the relevant ... -
TMbarrier: speculative barriers using hardware transactional memory
(2018-11-15)Barrier is a very common synchronization method used in parallel programming. Barriers are used typically to enforce a partial thread execution order, since there may be dependences between code sections before and after ... -
Towards a Software Transactional Memory for heterogeneous CPU-GPU processors
(2017-09-15)The heterogeneous Accelerated Processing Units (APUs) integrate a multi-core CPU and a GPU within the same chip. Modern APUs provide the programmer with platform atomics, used to communicate the CPU cores with the GPU using ... -
Towards the intelligent diagnosis of hematological diseases
(2019-11-18)In traditional medicine, patient diagnosis usually implies an in depth study of its state and symptoms that a specialist has to carry out. The adaptation and customization of the medical treatment to those individual ... -
Unified Locality-Sensitive Signatures for Transactional Memory
(Springer, 2011)Transactional memory systems coordinate the execution of concurrent transactions by committing non-conflicting ones. Transaction conflicts are detected by recording on-the-fly the memory locations issued by the threads. ... -
A weakly-supervised approach for discovering common objects in airport video surveillance footage
(2019-07-22)Object detection in video is a relevant task in computer vision. Standard and current detectors are typically trained in a strongly supervised way, what requires a huge amount of labelled data. In contrast, in this paper ... -
Weather Projections and Dynamical Downscaling for the Republic of Panama: Evaluation of Implementation Methods via GPGPU Acceleration
(2020-09)Climate change could have a critical impact on the Republic of Panama where a major segment of the economy is dependent on the operation of the Panama Canal. New capabilities to do targeted research around climate change ... -
Whole-Genome Assembly: An Experimental Study of Computational Costs and Architectural Opportunities
(2022)Whole-genome sequencing (WGS) pro- vides a huge amount of reads from which a comple- te genome could be assembled. The recent advent of long read sequencing technologies, such as PacBio and Oxford Nanopore, and the subsequent ...