Publications

Francisco Fernández Rivera

2024
A new thread-level speculative automatic parallelization model and library based on duplicate code execution
Powerline Detection and Characterization in General-Purpose Airborne LiDAR Surveys
Assessing Intel OneAPI capabilities and cloud-performance for heterogeneous computing
2022
Virtual LiDAR simulation as a high performance computing challenge: Towards HPC HELIOS++
CIMAR, NIMAR, and LMMA: Novel algorithms for thread and memory migrations in user space on NUMA systems using hardware counters
A fast and optimal pathfinder using airborne LiDAR data
2021
Load balanced heterogeneous parallelism for finite difference problems on image denoising
LBMA and IMAR2: Weighted lottery based migration strategies for NUMA multiprocessing servers
IHP: a dynamic heterogeneous parallel scheme for iterative or time-step methods-- image denoising as case study
2019
Fast Ground Filtering of Airborne LiDAR Data Based on Iterative Scan-Line Spline Interpolation
Implementación de un algoritmo de filtrado de terreno a partir de datos LiDAR sobre SoC Zynq
Caracterización vial en base a nubes de puntos LiDAR terrestre con MPI
A new hardware counters based thread migration strategy for NUMA systems
Influence of Architectural Features of the SNC-4 Mode of the Intel Xeon Phi KNL on Matrix Multiplication
Automatic Detection and Characterization of Power Lines and their Surroundings Using LiDAR Data
Procesamiento eficiente de nubes de puntos LiDAR aéreo para aplicaciones de caracterización del terreno
2017
Comparative study of building footprint estimation methods from LiDAR point clouds
Caracterización de aplicaciones mediante información de contadores hardware en sistemas NUMA
Euro-Par 2017: Parallel Processing. Lecture Notes in Computer Science
Graph-based approach for airborne light detection and ranging segmentation
Landing sites detection using LiDAR data on manycore systems
2014
3DyRM: a dynamic roofline model including memory latency information
Multiobjective Optimization Technique Based on Monitoring Information to Increase the Performance of Thread Migration on Multicores
Thread migration techniques based on dynamic Roofline models and latency information
Performance Prediction and Evaluation
A hardware counter-based toolkit for the analysis of memory accesses in SMPs
Using an extended Roofline Model to understand data and thread affinities on NUMA systems
Study of data locality and thread affinity on multicore systems using the Roofline Model
Using sampled information: is it enough for the sparse matrix-vector product locality optimization?
Modeling the performance of parallel applications using model selection techniques
2013
Sparse matrix-vector multiplication on the Single-Chip Cloud Computer many-core processor
Extensión del modelo Roofline y herramientas para su uso
A Flexible and Dynamic Page Migration Infrastructure based on Hardware Counters
DyRM: A Dynamic Roofline Model Based on Runtime Information
High performance genetic algorithm for land use planning
2012
Uso de algoritmos genéticos para la obtención de modelos estadísticos de rendimiento
A Graphical Tool for Performance Analysis of Multicore Systems Based on the Roofline Model
Hardware Counters Based Analysis of Memory Accesses in SMPs
Model Selection to Characterize Performance using Genetic Algorithms
Experiences with the Sparse Matrix-Vector Multiplication on a Many-core Processor
Optimization of Sparse Matrix-Vector Multiplication Using Reordering Techniques on GPUs
2011
Estimating the Effect of Cache Misses on the Performance of Parallel Applications Using Analytical Models
Using accurate AIC-based performance models to improve the scheduling of parallel applications
Herramientas para la monitorización de los accesos a memoria de códigos paralelos mediante contadores hardware
Estimación del efecto de los fallos cache en el rendimiento de aplicaciones paralelas
A parallel algorithm based on simulated annealing for land use zoning plans
Study of Performance Issues on a SMP-NUMA System Using the Roofline Model
A Java-based Parallel Genetic Algorithm for the Land Use Planning Problem
2009
On the Influence of Thread Allocation for Irregular Codes in NUMA Systems
Increasing data reuse of sparse algebra codes on simultaneous multithreading architectures
El Criterio de Información de Akaike en la Obtención de Modelos Estadísticos de Rendimiento
Accurate Analytical Performance Model of Communications in MPI Applications
Greedy Performance Metrics for Grid Schedulers
2007
Simulación de códigos de N-cuerpos en sistemas de memoria distribuida mediante un algoritmo paralelo por etapas
An Inspector/Executor Based Strategy to Efficiently Parallelize N-Body Simulation Programs on shared memory systems
Simulation of parallel applications in GridSim
Software Tools for Performance Modeling of Parallel Programs