Fault Tolerance Techniques For High Performance Computing


Download Fault Tolerance Techniques For High Performance Computing PDF/ePub or read online books in Mobi eBooks. Click Download or Read Online button to get Fault Tolerance Techniques For High Performance Computing book now. This website allows unlimited access to, at the time of writing, more than 1.5 million titles, including hundreds of thousands of titles in various foreign languages.

Download

Fault-Tolerance Techniques for High-Performance Computing


Fault-Tolerance Techniques for High-Performance Computing

Author: Thomas Herault

language: en

Publisher: Springer

Release Date: 2015-07-01


DOWNLOAD





This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.

Innovative Research and Applications in Next-Generation High Performance Computing


Innovative Research and Applications in Next-Generation High Performance Computing

Author: Hassan, Qusay F.

language: en

Publisher: IGI Global

Release Date: 2016-07-05


DOWNLOAD





High-performance computing (HPC) describes the use of connected computing units to perform complex tasks. It relies on parallelization techniques and algorithms to synchronize these disparate units in order to perform faster than a single processor could, alone. Used in industries from medicine and research to military and higher education, this method of computing allows for users to complete complex data-intensive tasks. This field has undergone many changes over the past decade, and will continue to grow in popularity in the coming years. Innovative Research Applications in Next-Generation High Performance Computing aims to address the future challenges, advances, and applications of HPC and related technologies. As the need for such processors increases, so does the importance of developing new ways to optimize the performance of these supercomputers. This timely publication provides comprehensive information for researchers, students in ICT, program developers, military and government organizations, and business professionals.

High Performance Computing in Science and Engineering


High Performance Computing in Science and Engineering

Author: Tomáš Kozubek

language: en

Publisher: Springer Nature

Release Date: 2021-01-07


DOWNLOAD





This book constitutes the thoroughly refereed post-conference proceedings of the 4th International Conference on High Performance Computing in Science and Engineering, HPCSE 2019, held in Karolinka, Czech Republic, in May 2019. The 9 papers presented in this volume were carefully reviewed and selected from 13 submissions. The conference provides an international forum for exchanging ideas among researchers involved in scientific and parallel computing, including theory and applications, as well as applied and computational mathematics. The focus of HPCSE 2019 was on models, algorithms, and software tools that facilitate efficient and convenient utilization of modern parallel and distributed computing architectures, as well as on large-scale applications.