"*SCIENTIFIC computing" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"*SCIENTIFIC computing"' showing total 81,690 results

Start Over "*SCIENTIFIC computing" Search Limiters Full Text

81,690 results on '"*SCIENTIFIC computing"'

1. Scientific Computing With Matlab.

Author: Xue, Dingyü; Chen, Yangquan
Published: 2017

2. Exploring and Exploiting Runtime Reconfigurable Floating Point Precision in Scientific Computing: a Case Study for Solving PDEs

Author: Hao, Cong "Callie"
Subjects: Computer Science - Hardware Architecture
Abstract: Scientific computing applications, such as computational fluid dynamics and climate modeling, typically rely on 64-bit double-precision floating-point operations, which are extremely costly in terms of computation, memory, and energy. While the machine learning community has successfully utilized low-precision computations, scientific computing remains cautious due to concerns about numerical stability. To tackle this long-standing challenge, we propose a novel approach to dynamically adjust the floating-point data precision at runtime, maintaining computational fidelity using lower bit widths. We first conduct a thorough analysis of data range distributions during scientific simulations to identify opportunities and challenges for dynamic precision adjustment. We then propose a runtime reconfigurable, flexible floating-point multiplier (R2F2), which automatically and dynamically adjusts multiplication precision based on the current operands, ensuring accurate results with lower bit widths. Our evaluation shows that 16-bit R2F2 significantly reduces error rates by 70.2\% compared to standard half-precision, with resource overhead ranging from a 5% reduction to a 7% increase and no latency overhead. In two representative scientific computing applications, R2F2, using 16 or fewer bits, can achieve the same simulation results as 32-bit precision, while standard half precision will fail. This study pioneers runtime reconfigurable arithmetic, demonstrating great potential to enhance scientific computing efficiency. Code available at https://github.com/sharc-lab/R2F2., Comment: Accepted by ASD-DAC 2025
Published: 2024

3. Generic and Scalable Differential Equation Solver for Quantum Scientific Computing

Author: Sul, Jinhwan and Wang, Yan
Subjects: Quantum Physics, Mathematical Physics
Abstract: One of the most important topics in quantum scientific computing is solving differential equations. In this paper, generalized quantum functional expansion (QFE) framework is proposed. In the QFE framework, a functional expansion of solution is encoded into a quantum state and the time evolution of the quantum state is solved with variational quantum simulation (VQS). The quantum functional encoding supports different numerical schemes of functional expansions. The lower bound of the required number of qubits is double logarithm of the inverse error bound in the QFE framework. Furthermore, a new parallel Pauli operation strategy is proposed to significantly improve the scalability of VQS. The number of circuits in VQS is exponentially reduced to only the quadratic order of the number of ansatz parameters. Four example differential equations are solved to demonstrate the generic QFE framework.
Published: 2024

4. Operator Learning Using Random Features: A Tool for Scientific Computing

Author: Nelsen, Nicholas H. and Stuart, Andrew M.
Subjects: Computer Science - Machine Learning, Mathematics - Numerical Analysis, Statistics - Machine Learning, 68T05, 65D40, 62J07, 62M45, 68W20, 35R60
Abstract: Supervised operator learning centers on the use of training data, in the form of input-output pairs, to estimate maps between infinite-dimensional spaces. It is emerging as a powerful tool to complement traditional scientific computing, which may often be framed in terms of operators mapping between spaces of functions. Building on the classical random features methodology for scalar regression, this paper introduces the function-valued random features method. This leads to a supervised operator learning architecture that is practical for nonlinear problems yet is structured enough to facilitate efficient training through the optimization of a convex, quadratic cost. Due to the quadratic structure, the trained model is equipped with convergence guarantees and error and complexity bounds, properties that are not readily available for most other operator learning architectures. At its core, the proposed approach builds a linear combination of random operators. This turns out to be a low-rank approximation of an operator-valued kernel ridge regression algorithm, and hence the method also has strong connections to Gaussian process regression. The paper designs function-valued random features that are tailored to the structure of two nonlinear operator learning benchmark problems arising from parametric partial differential equations. Numerical results demonstrate the scalability, discretization invariance, and transferability of the function-valued random features method., Comment: 36 pages, 1 table, 9 figures. SIGEST version of SIAM J. Sci. Comput. Vol. 43 No. 5 (2021) pp. A3212-A3243, hence text overlap with arXiv:2005.10224
Published: 2024
Full Text: View/download PDF

5. A Review of Quantum Scientific Computing Algorithms for Engineering Problems

Author: Raisuddin, Osama Muhammad and De, Suvranu
Subjects: Quantum Physics, Mathematical Physics
Abstract: Quantum computing, leveraging quantum phenomena like superposition and entanglement, is emerging as a transformative force in computing technology, promising unparalleled computational speed and efficiency crucial for engineering applications. This advancement presents both opportunities and challenges, requiring engineers to familiarize themselves with quantum principles, applications, and complexities. This paper systematically explores the foundational concepts of quantum mechanics and their implications for computational advancements, emphasizing the superiority of quantum algorithms in solving engineering problems. It identifies areas where gate-based quantum computing has the potential to outperform classical methods despite facing scalability and coherence issues. By offering clear examples with minimal reliance on in-depth quantum physics or hardware specifics, the aim is to make quantum computing accessible to engineers, addressing the steep learning curve and fostering its practical adoption for complex problem-solving and technological advancement as quantum hardware becomes more robust and reliable.
Published: 2024

6. Scientific Computing with Large Language Models

Author: Culver, Christopher, Hicks, Peter, Milenkovic, Mihailo, Shanmugavelu, Sanjif, and Becker, Tobias
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: We provide an overview of the emergence of large language models for scientific computing applications. We highlight use cases that involve natural language processing of scientific documents and specialized languages designed to describe physical systems. For the former, chatbot style applications appear in medicine, mathematics and physics and can be used iteratively with domain experts for problem solving. We also review specialized languages within molecular biology, the languages of molecules, proteins, and DNA where language models are being used to predict properties and even create novel physical systems at much faster rates than traditional computing methods., Comment: 13 pages
Published: 2024

7. Accelerating imaging research at large-scale scientific facilities through scientific computing

Author: Chunpeng Wang, Xiaoyun Li, Rongzheng Wan, Jige Chen, Jing Ye, Ke Li, Aiguo Li, Renzhong Tai, and Alessandro Sepe
Subjects: scientific computing, synchrotron, imaging, automation, tomography, Nuclear and particle physics. Atomic energy. Radioactivity, QC770-798, Crystallography, QD901-999
Abstract: To date, computed tomography experiments, carried-out at synchrotron radiation facilities worldwide, pose a tremendous challenge in terms of the breadth and complexity of the experimental datasets produced. Furthermore, near real-time three-dimensional reconstruction capabilities are becoming a crucial requirement in order to perform high-quality and result-informed synchrotron imaging experiments, where a large amount of data is collected and processed within a short time window. To address these challenges, we have developed and deployed a synchrotron computed tomography framework designed to automatically process online the experimental data from the synchrotron imaging beamlines, while leveraging the high-performance computing cluster capabilities to accelerate the real-time feedback to the users on their experimental results. We have, further, integrated it within a modern unified national authentication and data management framework, which we have developed and deployed, spanning the entire data lifecycle of a large-scale scientific facility. In this study, the overall architecture, functional modules and workflow design of our synchrotron computed tomography framework are presented in detail. Moreover, the successful integration of the imaging beamlines at the Shanghai Synchrotron Radiation Facility into our scientific computing framework is also detailed, which, ultimately, resulted in accelerating and fully automating their entire data processing pipelines. In fact, when compared with the original three-dimensional tomography reconstruction approaches, the implementation of our synchrotron computed tomography framework led to an acceleration in the experimental data processing capabilities, while maintaining a high level of integration with all the beamline processing software and systems.
Published: 2024
Full Text: View/download PDF

8. Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific Computing

Author: Pan, Haochen, Chard, Ryan, Zhou, Sicheng, Kamatar, Alok, Vescovi, Rafael, Hayot-Sasson, Valérie, Bauer, André, Gonthier, Maxime, Chard, Kyle, and Foster, Ian
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Scientific research increasingly relies on distributed computational resources, storage systems, networks, and instruments, ranging from HPC and cloud systems to edge devices. Event-driven architecture (EDA) benefits applications targeting distributed research infrastructures by enabling the organization, communication, processing, reliability, and security of events generated from many sources. To support the development of scientific EDA, we introduce Octopus, a hybrid, cloud-to-edge event fabric designed to link many local event producers and consumers with cloud-hosted brokers. Octopus can be scaled to meet demand, permits the deployment of highly available Triggers for automatic event processing, and enforces fine-grained access control. We identify requirements in self-driving laboratories, scientific data automation, online task scheduling, epidemic modeling, and dynamic workflow management use cases, and present results demonstrating Octopus' ability to meet those requirements. Octopus supports producing and consuming events at a rate of over 4.2 M and 9.6 M events per second, respectively, from distributed clients., Comment: 12 pages and 8 figures. Camera-ready version for FTXS'24 (https://sites.google.com/view/ftxs2024)
Published: 2024

9. Bridging Worlds: Achieving Language Interoperability between Julia and Python in Scientific Computing

Author: Osborne, Ianna, Pivarski, Jim, and Ling, Jerry
Subjects: Computer Science - Programming Languages, Physics - Data Analysis, Statistics and Probability
Abstract: In the realm of scientific computing, both Julia and Python have established themselves as powerful tools. Within the context of High Energy Physics (HEP) data analysis, Python has been traditionally favored, yet there exists a compelling case for migrating legacy software to Julia. This article focuses on language interoperability, specifically exploring how Awkward Array data structures can bridge the gap between Julia and Python. The talk offers insights into key considerations such as memory management, data buffer copies, and dependency handling. It delves into the performance enhancements achieved by invoking Julia from Python and vice versa, particularly for intensive array-oriented calculations involving large-scale, though not excessively dimensional, arrays of HEP data. The advantages and challenges inherent in achieving interoperability between Julia and Python in the domain of scientific computing are discussed., Comment: 8 pages, 1 figure, ACAT2024 workshop
Published: 2024

10. Building Flexible Machine Learning Models for Scientific Computing at Scale

Author: Chen, Tianyu, Zhou, Haoyi, Li, Ying, Wang, Hao, Gao, Chonghan, Zhang, Shanghang, and Li, Jianxin
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Foundation models have revolutionized knowledge acquisition across domains, and our study introduces OmniArch, a paradigm-shifting approach designed for building foundation models in multi-physics scientific computing. OmniArch's pre-training involves a versatile pipeline that processes multi-physics spatio-temporal data, casting forward problem learning into scalable auto-regressive tasks, while our novel Physics-Informed Reinforcement Learning (PIRL) technique during fine-tuning ensures alignment with physical laws. Pre-trained on the comprehensive PDEBench dataset, OmniArch not only sets new performance benchmarks for 1D, 2D and 3D PDEs but also demonstrates exceptional adaptability to new physics via few-shot and zero-shot learning approaches. The model's representations further extend to inverse problem-solving, highlighting the transformative potential of AI-enabled Scientific Computing(AI4SC) foundation models for engineering applications and physics discovery., Comment: Work in Progress
Published: 2024

11. Report of the DOE/NSF Workshop on Correctness in Scientific Computing, June 2023, Orlando, FL

Author: Gokhale, Maya, Gopalakrishnan, Ganesh, Mayo, Jackson, Nagarakatte, Santosh, Rubio-González, Cindy, and Siegel, Stephen F.
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Software Engineering, B.8.1, C.1.4, D.0.3, D.0.4, D.1.3, D.2.1, D.2.5, D.3.1, G.1.2, J.2
Abstract: This report is a digest of the DOE/NSF Workshop on Correctness in Scientific Computing (CSC'23) held on June 17, 2023, as part of the Federated Computing Research Conference (FCRC) 2023. CSC was conceived by DOE and NSF to address the growing concerns about correctness among those who employ computational methods to perform large-scale scientific simulations. These concerns have escalated, given the complexity, scale, and heterogeneity of today's HPC software and hardware. If correctness is not proactively addressed, there is the risk of producing flawed science on top of unacceptable productivity losses faced by computational scientists and engineers. HPC systems are beginning to include data-driven methods, including machine learning and surrogate models, and their impact on overall HPC system correctness was also felt urgent to discuss. Stakeholders of correctness in this space were identified to belong to several sub-disciplines of computer science; from computer architecture researchers who design special-purpose hardware that offers high energy efficiencies; numerical algorithm designers who develop efficient computational schemes based on reduced precision as well as reduced data movement; all the way to researchers in programming language and formal methods who seek methodologies for correct compilation and verification. To include attendees with such a diverse set of backgrounds, CSC was held during the Federated Computing Research Conference (FCRC) 2023., Comment: 36 pages. DOE/NSF Workshop on Correctness in Scientific Computing (CSC 2023) was a PLDI 2023 workshop
Published: 2023

12. Merging Traditional Scientific Computing with Data Science to Develop New Class of Prediction Engines

Author: Terasaki, Gbocho Masato
Published: 2024

13. Quantum algorithms for scientific computing

Author: Au-Yeung, R., Camino, B., Rathore, O., and Kendon, V.
Subjects: Quantum Physics, Physics - Computational Physics
Abstract: Quantum computing promises to provide the next step up in computational power for diverse application areas. In this review, we examine the science behind the quantum hype, and the breakthroughs required to achieve true quantum advantage in real world applications. Areas that are likely to have the greatest impact on high performance computing (HPC) include simulation of quantum systems, optimization, and machine learning. We draw our examples from electronic structure calculations and computational fluid dynamics which account for a large fraction of current scientific and engineering use of HPC. Potential challenges include encoding and decoding classical data for quantum devices, and mismatched clock speeds between classical and quantum processors. Even a modest quantum enhancement to current classical techniques would have far-reaching impacts in areas such as weather forecasting, aerospace engineering, and the design of "green" materials for sustainable development. This requires significant effort from the computational science, engineering and quantum computing communities working together., Comment: Accepted by Reports on Progress in Physics. 43 pages + 29 pages references
Published: 2023

14. Scientific Computing with Open SageMath not only for Physics Education

Author: Borovský, Dominik, Hanč, Jozef, and Hančová, Martina
Subjects: Physics - Physics Education
Abstract: Nowadays interactive digital scientific environments have become an integral part of scientific computing in solving various scientific tasks in research, but also STEM education. We introduce SageMath or shortly Sage -- a free open Python-based alternative to the well-known commercial software -- in the frame of our course Methods of Physical Problems Solving for future scientists and science teachers. Particularly, in the 1st illustrative example from the Physics Olympiad, we present Sage as a scientific open data source, symbolic, numerical, and visualization tool. The 2nd example from the Young Physicists' Tournament shows Sage as a multimedia, modeling, and programming tool. By employing SageMath as an open digital environment for scientific computing in the education of all STEM disciplines, teachers and students are empowered not only with a universal educational tool, but a real research tool, enabling them to engage in interactive visualization, modeling, programming, and solving of authentic, complex interdisciplinary problems, thus naturally enhancing their motivation to pursue science in alignment with the core mission of STEM education., Comment: 9 pages, 3 figures, 1 table, conference DIDSCI2022
Published: 2023

15. Leveraging DevOps for Scientific Computing

Author: Nuyujukian, Paul
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Critical goals of scientific computing are to increase scientific rigor, reproducibility, and transparency while keeping up with ever-increasing computational demands. This work presents an integrated framework well-suited for data processing and analysis spanning individual, on-premises, and cloud environments. This framework leverages three well-established DevOps tools: 1) Git repositories linked to 2) CI/CD engines operating on 3) containers. It supports the full life-cycle of scientific data workflows with minimal friction between stages--including solutions for researchers who generate data. This is achieved by leveraging a single container that supports local, interactive user sessions and deployment in HPC or Kubernetes clusters. Combined with Git repositories integrated with CI/CD, this approach enables decentralized data pipelines across multiple, arbitrary computational environments. This framework has been successfully deployed and validated within our research group, spanning experimental acquisition systems and computational clusters with open-source, purpose-built GitLab CI/CD executors for slurm and Google Kubernetes Engine Autopilot. Taken together, this framework can increase the rigor, reproducibility, and transparency of compute-dependent scientific research.
Published: 2023

16. Bandicoot: C++ Library for GPU Linear Algebra and Scientific Computing

Author: Curtin, Ryan R., Edel, Marcus, and Sanderson, Conrad
Subjects: Computer Science - Mathematical Software, 65Y05, 65F45, 15-04, G.1.3, G.4, I.3.1, I.3.6
Abstract: This report provides an introduction to the Bandicoot C++ library for linear algebra and scientific computing on GPUs, overviewing its user interface and performance characteristics, as well as the technical details of its internal design. Bandicoot is the GPU-enabled counterpart to the well-known Armadillo C++ linear algebra library, aiming to allow users to take advantage of GPU-accelerated computation for their existing codebases without significant changes. Adapting the same internal template meta-programming techniques that Armadillo uses, Bandicoot is able to provide compile-time optimisation of mathematical expressions within user code. The library is ready for production use and is available at https://coot.sourceforge.io. Bandicoot is distributed under the Apache 2.0 License.
Published: 2023

17. Energy Estimates Across Layers of Computing: From Devices to Large-Scale Applications in Machine Learning for Natural Language Processing, Scientific Computing, and Cryptocurrency Mining

Author: Shankar, Sadasivan
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, C.3, C.4, I.2, J.2
Abstract: Estimates of energy usage in layers of computing from devices to algorithms have been determined and analyzed. Building on the previous analysis [3], energy needed from single devices and systems including three large-scale computing applications such as Artificial Intelligence (AI)/Machine Learning for Natural Language Processing, Scientific Simulations, and Cryptocurrency Mining have been estimated. In contrast to the bit-level switching, in which transistors achieved energy efficiency due to geometrical scaling, higher energy is expended both at the at the instructions and simulations levels of an application. Additionally, the analysis based on AI/ML Accelerators indicate that changes in architectures using an older semiconductor technology node have comparable energy efficiency with a different architecture using a newer technology. Further comparisons of the energy in computing systems with the thermodynamic and biological limits, indicate that there is a 27-36 orders of magnitude higher energy requirements for total simulation of an application. These energy estimates underscore the need for serious considerations of energy efficiency in computing by including energy as a design parameter, enabling growing needs of compute-intensive applications in a digital world., Comment: 6 pages, 5 figures
Published: 2023

18. A method for automated regression test in scientific computing libraries: illustration with SPHinXsys

Author: Zhang, Bo, Zhang, Chi, and Hu, Xiangyu
Subjects: Physics - Computational Physics
Abstract: Scientific computing libraries, either being in-house or open-source, have experienced enormous progress in both engineering and scientific research. It is therefore essential to ensure that the modifications in the source code aroused by bug fixing or new feature development wouldn't compromise the accuracy and functionality that has already been validated and verified. With this in mind, this paper introduces a method for developing and implementing an automatic regression test environment and takes the open-source multi-physics library SPHinXsys \cite{zhang2021sphinxsys} as an example. Firstly, the reference database for each benchmark test is generated from monitored data by multiple executions. This database contains the maximum variation range of metrics for different types of strategies, i.e., time-averaged method, ensemble-averaged method as well as the dynamic time warping method, covering the uncertainty arising from parallel computing, particle relaxation, physical instabilities, etc. Then, new results obtained after source code modification will be tested with them according to a curve-similarity based comparison. Whenever the source code is updated, the regression test will be carried out automatically for all test cases and used to report the validity of the current results. This regression test environment has already been implemented in all dynamics test cases released in SPHinXsys, including fluid dynamics, solid mechanics, fluid-structure interaction, thermal and mass diffusion, reaction-diffusion, and their multi-physics coupling, and shows good capability for testing various problems. It's worth noting that while the present test environment is built and implemented for a specific scientific computing library, its underlying principle is generic and can be applied to many others., Comment: 39 pages, 13 figures
Published: 2023

19. Big-PERCIVAL: Exploring the Native Use of 64-Bit Posit Arithmetic in Scientific Computing

Author: Mallasén, David, Del Barrio, Alberto A., and Prieto-Matias, Manuel
Subjects: Computer Science - Hardware Architecture
Abstract: The accuracy requirements in many scientific computing workloads result in the use of double-precision floating-point arithmetic in the execution kernels. Nevertheless, emerging real-number representations, such as posit arithmetic, show promise in delivering even higher accuracy in such computations. In this work, we explore the native use of 64-bit posits in a series of numerical benchmarks and compare their timing performance, accuracy and hardware cost to IEEE 754 doubles. In addition, we also study the conjugate gradient method for numerically solving systems of linear equations in real-world applications. For this, we extend the PERCIVAL RISC-V core and the Xposit custom RISC-V extension with posit64 and quire operations. Results show that posit64 can obtain up to 4 orders of magnitude lower mean square error than doubles. This leads to a reduction in the number of iterations required for convergence in some iterative solvers. However, leveraging the quire accumulator register can limit the order of some operations such as matrix multiplications. Furthermore, detailed FPGA and ASIC synthesis results highlight the significant hardware cost of 64-bit posit arithmetic and quire. Despite this, the large accuracy improvements achieved with the same memory bandwidth suggest that posit arithmetic may provide a potential alternative representation for scientific computing., Comment: 12 pages. Code available at https://github.com/artecs-group/PERCIVAL. Reviewed version to be published in IEEE TC
Published: 2023
Full Text: View/download PDF

20. Accelerating imaging research at large‐scale scientific facilities through scientific computing.

Author: Wang, Chunpeng, Li, Xiaoyun, Wan, Rongzheng, Chen, Jige, Ye, Jing, Li, Ke, Li, Aiguo, Tai, Renzhong, and Sepe, Alessandro
Subjects: *ONLINE data processing, *SYNCHROTRON radiation, *SCIENTIFIC computing, *PROCESS capability, *COMPUTER workstation clusters
Abstract: To date, computed tomography experiments, carried‐out at synchrotron radiation facilities worldwide, pose a tremendous challenge in terms of the breadth and complexity of the experimental datasets produced. Furthermore, near real‐time three‐dimensional reconstruction capabilities are becoming a crucial requirement in order to perform high‐quality and result‐informed synchrotron imaging experiments, where a large amount of data is collected and processed within a short time window. To address these challenges, we have developed and deployed a synchrotron computed tomography framework designed to automatically process online the experimental data from the synchrotron imaging beamlines, while leveraging the high‐performance computing cluster capabilities to accelerate the real‐time feedback to the users on their experimental results. We have, further, integrated it within a modern unified national authentication and data management framework, which we have developed and deployed, spanning the entire data lifecycle of a large‐scale scientific facility. In this study, the overall architecture, functional modules and workflow design of our synchrotron computed tomography framework are presented in detail. Moreover, the successful integration of the imaging beamlines at the Shanghai Synchrotron Radiation Facility into our scientific computing framework is also detailed, which, ultimately, resulted in accelerating and fully automating their entire data processing pipelines. In fact, when compared with the original three‐dimensional tomography reconstruction approaches, the implementation of our synchrotron computed tomography framework led to an acceleration in the experimental data processing capabilities, while maintaining a high level of integration with all the beamline processing software and systems. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. IterLara: A Turing Complete Algebra for Big Data, AI, Scientific Computing, and Database

Author: Li, Hongxiao, Gao, Wanling, Wang, Lei, and Zhan, Jianfeng
Subjects: Computer Science - Databases, Computer Science - Computation and Language, Computer Science - Data Structures and Algorithms
Abstract: \textsc{Lara} is a key-value algebra that aims at unifying linear and relational algebra with three types of operation abstraction. The study of \textsc{Lara}'s expressive ability reports that it can represent relational algebra and most linear algebra operations. However, several essential computations, such as matrix inversion and determinant, cannot be expressed in \textsc{Lara}. \textsc{Lara} cannot represent global and iterative computation, either. This article proposes \textsc{IterLara}, extending \textsc{Lara} with iterative operators, to provide an algebraic model that unifies operations in general-purpose computing, like big data, AI, scientific computing, and database. We study the expressive ability of \textsc{Lara} and \textsc{IterLara} and prove that \textsc{IterLara} with aggregation functions can represent matrix inversion, determinant. Besides, we demonstrate that \textsc{IterLara} with no limitation of function utility is Turing complete. We also propose the Operation Count (OP) as a metric of computation amount for \textsc{IterLara} and ensure that the OP metric is in accordance with the existing computation metrics.
Published: 2023

22. Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing

Author: Li, Kun, Li, Zhichun, Chen, Yuetao, Wang, Zixuan, Zhang, Yiwei, Yuan, Liang, Jia, Haipeng, Zhang, Yunquan, Cao, Ting, and Yang, Mao
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Stencil computation is one of the most important kernels in various scientific computing. Nowadays, most Stencil-driven scientific computing still relies heavily on supercomputers, suffering from expensive access, poor scalability, and duplicated optimizations. This paper proposes Tetris, the first system for high-performance Stencil on heterogeneous CPU+GPU, towards democratizing Stencil-driven scientific computing on Cloud. In Tetris, polymorphic tiling tetrominoes are first proposed to bridge different hardware architectures and various application contexts with a perfect spatial and temporal tessellation automatically. Tetris is contributed by three main components: (1) Underlying hardware characteristics are first captured to achieve a sophisticated Pattern Mapping by register-level tetrominoes; (2) An efficient Locality Enhancer is first presented for data reuse on spatial and temporal dimensions simultaneously by cache/SMEM-level tetrominoes; (3) A novel Concurrent Scheduler is first designed to exploit the full potential of on-cloud memory and computing power by memory-level tetrominoes. Tetris is orthogonal to (and complements) the optimizations or deployments for a wide variety of emerging and legacy scientific computing applications. Results of thermal diffusion simulation demonstrate that the performance is improved by 29.6x, reducing time cost from day to hour, while preserving the original accuracy.
Published: 2023

23. Scientific Computing with Diffractive Optical Neural Networks

Author: Chen, Ruiyang, Tang, Yingheng, Ma, Jianzhu, and Gao, Weilu
Subjects: Computer Science - Machine Learning, Computer Science - Emerging Technologies, Physics - Optics
Abstract: Diffractive optical neural networks (DONNs) have been emerging as a high-throughput and energy-efficient hardware platform to perform all-optical machine learning (ML) in machine vision systems. However, the current demonstrated applications of DONNs are largely straightforward image classification tasks, which undermines the prospect of developing and utilizing such hardware for other ML applications. Here, we numerically and experimentally demonstrate the deployment of an all-optical reconfigurable DONNs system for scientific computing, including guiding two-dimensional quantum material synthesis, predicting the properties of nanomaterials and small molecular cancer drugs, predicting the device response of nanopatterned integrated photonic power splitters, and the dynamic stabilization of an inverted pendulum with reinforcement learning. Despite a large variety of input data structures, we develop a universal feature engineering approach to convert categorical input features to the images that can be processed in the DONNs system. Our results open up new opportunities of employing DONNs systems for a broad range of ML applications.
Published: 2023

24. High Energy Physics Exascale Requirements Review. An Office of Science review sponsored jointly by Advanced Scientific Computing Research and High Energy Physics, June 10-12, 2015, Bethesda, Maryland

Author: Habib, Salman, Roser, Robert, Gerber, Richard, Antypas, Katie, Dart, Eli, Dosanjh, Sudip, Hack, James, Monga, Inder, Papka, Michael E, Riley, Katherine, Rotman, Lauren, Straatsma, Tjerk, Wells, Jack, Williams, Tim, Almgren, A, Amundson, J, Bailey, Stephen, Bard, Deborah, Bloom, Ken, Bockelman, Brian, Borgland, Anders, Borrill, Julian, Boughezal, Radja, Brower, Richard, Cowan, Benjamin, Finkel, Hal, Frontiere, Nicholas, Fuess, Stuart, Ge, Lixin, Gnedin, Nick, Gottlieb, Steven, Gutsche, Oliver, Han, T, Heitmann, Katrin, Hoeche, Stefan, Ko, Kwok, Kononenko, Oleksiy, LeCompte, Thomas, Li, Zheng, Lukic, Zarija, Mori, Warren, Ng, Cho-Kuen, Nugent, Peter, Oleynik, Gene, O’Shea, Brian, Padmanabhan, Nikhil, Petravick, Donald, Petriello, Frank J, Pope, Adrian, Power, John, Qiang, Ji, Reina, Laura, Rizzo, Thomas Gerard, Ryne, Robert, Schram, Malachi, Spentzouris, P, Toussaint, Doug, Vay, Jean Luc, Viren, B, Wuerthwein, Frank, Xiao, Liling, and Coffey, Richard
Published: 2023

25. Basic Energy Sciences Exascale Requirements Review. An Office of Science review sponsored jointly by Advanced Scientific Computing Research and Basic Energy Sciences, November 3-5, 2015, Rockville, Maryland

Author: Windus, Theresa, Banda, Michael, Devereaux, Thomas, White, Julia C, Antypas, Katie, Coffey, Richard, Dart, Eli, Dosanjh, Sudip, Gerber, Richard, Hack, James, Monga, Inder, Papka, Michael E, Riley, Katherine, Rotman, Lauren, Straatsma, Tjerk, Wells, Jack, Baruah, Tunna, Benali, Anouar, Borland, Michael, Brabec, Jiri, Carter, Emily, Ceperley, David, Chan, Maria, Chelikowsky, James, Chen, Jackie, Cheng, Hai-Ping, Clark, Aurora, Darancet, Pierre, DeJong, Wibe, Deslippe, Jack, Dixon, David, Donatelli, Jeffrey, Dunning, Thomas, Fernandez-Serra, Marivi, Freericks, James, Gagliardi, Laura, Galli, Giulia, Garrett, Bruce, Glezakou, Vassiliki-Alexandra, Gordon, Mark, Govind, Niri, Gray, Stephen, Gull, Emanuel, Gygi, Francois, Hexemer, Alexander, Isborn, Christine, Jarrell, Mark, Kalia, Rajiv K, Kent, Paul, Klippenstein, Stephen, Kowalski, Karol, Krishnamurthy, Hulikal, Kumar, Dinesh, Lena, Charles, Li, Xiaosong, Maier, Thomas, Markland, Thomas, McNulty, Ian, Millis, Andrew, Mundy, Chris, Nakano, Aiichiro, Niklasson, AMN, Panagiotopoulos, Thanos, Pandolfi, Ron, Parkinson, Dula, Pask, John, Perazzo, Amedeo, Rehr, John, Rousseau, Roger, Sankaranarayanan, Subramanian, Schenter, Greg, Selloni, Annabella, Sethian, Jamie, Siepmann, Ilja, Slipchenko, Lyudmila, Sternberg, Michael, Stevens, Mark, Summers, Michael, Sumpter, Bobby, Sushko, Peter, Thayer, Jana, Toby, Brian, Tull, Craig, Valeev, Edward, Vashishta, Priya, Venkatakrishnan, V, Yang, C, Yang, Ping, and Zwart, Peter H
Published: 2023

26. Deep Active Learning for Scientific Computing in the Wild

Author: Ren, Simiao, Deng, Yang, Padilla, Willie J., Collins, Leslie, and Malof, Jordan
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Deep learning (DL) is revolutionizing the scientific computing community. To reduce the data gap caused by usually expensive simulations or experimentation, active learning has been identified as a promising solution for the scientific computing community. However, the deep active learning (DAL) literature is currently dominated by image classification problems and pool-based methods, which are not directly transferrable to scientific computing problems, dominated by regression problems with no pre-defined 'pool' of unlabeled data. Here for the first time, we investigate the robustness of DAL methods for scientific computing problems using ten state-of-the-art DAL methods and eight benchmark problems. We show that, to our surprise, the majority of the DAL methods are not robust even compared to random sampling when the ideal pool size is unknown. We further analyze the effectiveness and robustness of DAL methods and suggest that diversity is necessary for a robust DAL for scientific computing problems.
Published: 2023

27. Physics-Guided, Physics-Informed, and Physics-Encoded Neural Networks in Scientific Computing

Author: Faroughi, Salah A, Pawar, Nikhil, Fernandes, Celio, Raissi, Maziar, Das, Subasish, Kalantari, Nima K., and Mahjour, Seyed Kourosh
Subjects: Computer Science - Machine Learning
Abstract: Recent breakthroughs in computing power have made it feasible to use machine learning and deep learning to advance scientific computing in many fields, including fluid mechanics, solid mechanics, materials science, etc. Neural networks, in particular, play a central role in this hybridization. Due to their intrinsic architecture, conventional neural networks cannot be successfully trained and scoped when data is sparse, which is the case in many scientific and engineering domains. Nonetheless, neural networks provide a solid foundation to respect physics-driven or knowledge-based constraints during training. Generally speaking, there are three distinct neural network frameworks to enforce the underlying physics: (i) physics-guided neural networks (PgNNs), (ii) physics-informed neural networks (PiNNs), and (iii) physics-encoded neural networks (PeNNs). These methods provide distinct advantages for accelerating the numerical modeling of complex multiscale multi-physics phenomena. In addition, the recent developments in neural operators (NOs) add another dimension to these new simulation paradigms, especially when the real-time prediction of complex multi-physics systems is required. All these models also come with their own unique drawbacks and limitations that call for further fundamental research. This study aims to present a review of the four neural network frameworks (i.e., PgNNs, PiNNs, PeNNs, and NOs) used in scientific computing research. The state-of-the-art architectures and their applications are reviewed, limitations are discussed, and future research opportunities in terms of improving algorithms, considering causalities, expanding applications, and coupling scientific and deep learning solvers are presented. This critical review provides researchers and engineers with a solid starting point to comprehend how to integrate different layers of physics into neural networks.
Published: 2022

28. FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

Author: Zhang, Boyuan, Tian, Jiannan, Di, Sheng, Yu, Xiaodong, Feng, Yunhe, Liang, Xin, Tao, Dingwen, and Cappello, Franck
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Today's large-scale scientific applications running on high-performance computing (HPC) systems generate vast data volumes. Thus, data compression is becoming a critical technique to mitigate the storage burden and data-movement cost. However, existing lossy compressors for scientific data cannot achieve a high compression ratio and throughput simultaneously, hindering their adoption in many applications requiring fast compression, such as in-memory compression. To this end, in this work, we develop a fast and high-ratio error-bounded lossy compressor on GPUs for scientific data (called FZ-GPU). Specifically, we first design a new compression pipeline that consists of fully parallelized quantization, bitshuffle, and our newly designed fast encoding. Then, we propose a series of deep architectural optimizations for each kernel in the pipeline to take full advantage of CUDA architectures. We propose a warp-level optimization to avoid data conflicts for bit-wise operations in bitshuffle, maximize shared memory utilization, and eliminate unnecessary data movements by fusing different compression kernels. Finally, we evaluate FZ-GPU on two NVIDIA GPUs (i.e., A100 and RTX A4000) using six representative scientific datasets from SDRBench. Results on the A100 GPU show that FZ-GPU achieves an average speedup of 4.2X over cuSZ and an average speedup of 37.0X over a multi-threaded CPU implementation of our algorithm under the same error bound. FZ-GPU also achieves an average speedup of 2.3X and an average compression ratio improvement of 2.0X over cuZFP under the same data distortion., Comment: 14 pages, 12 figures, accepted by ACM HPDC '23
Published: 2023
Full Text: View/download PDF

29. Scientific Computing Algorithms to Learn Enhanced Scalable Surrogates for Mesh Physics

Author: Bartoldson, Brian R., Hu, Yeping, Saini, Amar, Cadena, Jose, Fu, Yucheng, Bao, Jie, Xu, Zhijie, Ng, Brenda, and Nguyen, Phan
Subjects: Computer Science - Machine Learning, Mathematics - Numerical Analysis
Abstract: Data-driven modeling approaches can produce fast surrogates to study large-scale physics problems. Among them, graph neural networks (GNNs) that operate on mesh-based data are desirable because they possess inductive biases that promote physical faithfulness, but hardware limitations have precluded their application to large computational domains. We show that it is \textit{possible} to train a class of GNN surrogates on 3D meshes. We scale MeshGraphNets (MGN), a subclass of GNNs for mesh-based physics modeling, via our domain decomposition approach to facilitate training that is mathematically equivalent to training on the whole domain under certain conditions. With this, we were able to train MGN on meshes with \textit{millions} of nodes to generate computational fluid dynamics (CFD) simulations. Furthermore, we show how to enhance MGN via higher-order numerical integration, which can reduce MGN's error and training time. We validated our methods on an accompanying dataset of 3D $\text{CO}_2$-capture CFD simulations on a 3.1M-node mesh. This work presents a practical path to scaling MGN for real-world applications., Comment: ICLR 2023 Workshop on Physics for Machine Learning
Published: 2023

30. AI chatbots in programming education: Students’ use in a scientific computing course and consequences for learning

Author: Groothuijsen, Suzanne, van den Beemt, Antoine, Remmers, Joris C., and van Meeuwen, Ludo W.
Published: 2024
Full Text: View/download PDF

31. Apple Silicon Performance in Scientific Computing

Author: Kenyon, Connor and Capano, Collin
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Physics - Computational Physics
Abstract: With the release of the Apple Silicon System-on-a-Chip processors, and the impressive performance shown in general use by both the M1 and M1 Ultra, the potential use for Apple Silicon processors in scientific computing is explored. Both the M1 and M1 Ultra are compared to current state-of-the-art data-center GPUs, including an NVIDIA V100 with PCIe, an NVIDIA V100 with NVLink, and an NVIDIA A100 with PCIe. The scientific performance is measured using the Scalable Heterogeneous Computing (SHOC) benchmark suite using OpenCL benchmarks. We find that both M1 processors outperform the GPUs in all benchmarks., Comment: 10 pages, 3 figures, IEEE HPEC
Published: 2022

32. Numerical Methods in Scientific Computing

Author: van Kan, Jos, author, Segal, Guus, author, and Vermolen, Fred, author
Subjects: Mathematics, Applied mathematics, Textbooks
Abstract: This is a book about numerically solving partial differential equations occurring in technical and physical contexts and the authors have set themselves a more ambitious target than to just talk about the numerics. Their aim is to show the place of numerical solutions in the general modeling process and this must inevitably lead to considerations about modeling itself. Partial differential equations usually are a consequence of applying first principles to a technical or physical problem at hand. That means, that most of the time the physics also have to be taken into account especially for validation of the numerical solution obtained. This book aims especially at engineers and scientists who have ’real world’ problems. It will concern itself less with pesky mathematical detail. For the interested reader though, we have included sections on mathematical theory to provide the necessary mathematical background. Since this treatment had to be on the superficial side we have provided further reference to the literature where necessary.
Published: 2023

33. Classical Numerical Methods in Scientific Computing

Author: van Kan, Jos, author, Segal, Guus, author, and Vermolen, Fred, author
Subjects: Mathematics, Applied mathematics, Textbooks
Abstract: Partial differential equations are paramount in mathematical modelling with applications in engineering and science. The book starts with a crash course on partial differential equations in order to familiarize the reader with fundamental properties such as existence, uniqueness and possibly existing maximum principles. The main topic of the book entails the description of classical numerical methods that are used to approximate the solution of partial differential equations. The focus is on discretization methods such as the finite difference, finite volume and finite element method. The manuscript also makes a short excursion to the solution of large sets of (non)linear algebraic equations that result after application of discretization method to partial differential equations. The book treats the construction of such discretization methods, as well as some error analysis, where it is noted that the error analysis for the finite element method is merely descriptive, rather than rigorous from a mathematical point of view. The last chapters focus on time integration issues for classical time-dependent partial differential equations. After reading the book, the reader should be able to derive finite element methods, to implement the methods and to judge whether the obtained approximations are consistent with the solution to the partial differential equations. The reader will also obtain these skills for the other classical discretization methods. Acquiring such fundamental knowledge will allow the reader to continue studying more advanced methods like meshfree methods, discontinuous Galerkin methods and spectral methods for the approximation of solutions to partial differential equations.
Published: 2023

34. Modeling, scientific computing and optimal control for renewable energy systems with storage

Author: Cantisani, Nicola, Ritschel, Tobias K. S., Thilker, Christian A., Madsen, Henrik, and Jørgensen, John Bagterp
Subjects: Mathematics - Optimization and Control, Electrical Engineering and Systems Science - Systems and Control
Abstract: This paper presents models for renewable energy systems with storage, and considers its optimal operation. We model and simulate wind and solar power production using stochastic differential equations as well as storage of the produced power using batteries, thermal storage, and water electrolysis. We formulate an economic optimal control problem, with the scope of controlling the system in the most efficient way, while satisfying the power demand from the electric grid. Deploying multiple storage systems allows flexibility and higher reliability of the renewable energy system., Comment: Submitted to European Control Conference 2023
Published: 2022

35. Scientific computing of radiative heat transfer with thermal slip effects near stagnation point by artificial neural network

Author: Shahzad, Hasan, Sadiq, M.N., Li, Zhiyong, Algarni, Salem, Alqahtani, Talal, and Irshad, Kashif
Published: 2024
Full Text: View/download PDF

36. INFN and the evolution of distributed scientific computing in Italy.

Author: Salomoni, Davide, Alkhansa, Ahmad, Antonacci, Marica, Belluomo, Patrizia, Biasotto, Massimo, Carbone, Luca Giovanni, Cesini, Daniele, Ciangottini, Diego, Ciaschini, Vincenzo, Costantini, Alessandro, Doria, Alessandra, Donvito, Giacinto, Duma, Doina Cristina, Fanzago, Federica, Foggetti, Nadina, Fornari, Federico, Giorgio, Emidio Maria, Italiano, Alessandro, Malatesta, Giada, and Martelli, Barbara
Subjects: *SCIENTIFIC computing, *GRID computing, *CLOUD computing, *SOFTWARE as a service
Abstract: INFN has been running a distributed infrastructure (the Tier-1 at Bologna-CNAF and 9 Tier-2 centres) for more than 20 years which currently offers about 150000 CPU cores and 120 PB of space both in tape and disk storage, serving more than 40 international scientific collaborations. This Grid-based infrastructure was augmented in 2019 with the INFN Cloud: a production quality multi-site federated Cloud infrastructure, composed by a core backbone, and which is able to integrate other INFN sites and public or private Clouds as well. The INFN Cloud provides a customizable and extensible portfolio offering computing and storage services spanning the IaaS, PaaS and SaaS layers, with dedicated solutions to serve special purposes, such as ISO-certified regions for the handling of sensitive data. INFN is now revising and expanding its infrastructure to tackle the challenges expected in the next 10 years of scientific computing adopting a "cloud-first" approach, through which all the INFN data centres will be federated via the INFN Cloud middleware and integrated with key HPC centres, such as the pre-exascale Leonardo machine at CINECA. In such a process, which involves both the infrastructures and the higher level services, initiatives and projects such as the "Italian National Centre on HPC, Big Data and Quantum Computing" (funded in the context of the Italian "National Recovery and Resilience Plan") and the Bologna Technopole are precious opportunities that will be exploited to offer advanced resources and services to universities, research institutions and industry. In this paper we describe how INFN is evolving its computing infrastructure, with the ambition to create and operate a national vendorneutral, open, scalable, and flexible "datalake" able to serve much more than just INFN users and experiments. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

37. Advanced Scientific Computing Research Exascale Requirements Review. An Office of Science review sponsored by Advanced Scientific Computing Research, September 27-29, 2016, Rockville, Maryland

Author: Almgren, Ann, DeMar, Phil, Vetter, Jeffrey, Riley, Katherine, Antypas, Katie, Bard, Deborah, Coffey, Richard, Dart, Eli, Dosanjh, Sudip, Gerber, Richard, Hack, James, Monga, Inder, Papka, Michael E, Rotman, Lauren, Straatsma, Tjerk, Wells, Jack, Bernholdt, David E, Bethel, Wes, Bosilca, George, Cappello, Frank, Gamblin, Todd, Habib, Salman, Hill, Judy, Hollingsworth, Jeffrey K, McInnes, Lois Curfman, Mohror, Kathryn, Moore, Shirley, Moreland, Ken, Roser, Rob, Shende, Sameer, Shipman, Galen, and Williams, Samuel
Published: 2022

38. Sea: A lightweight data-placement library for Big Data scientific computing

Author: Hayot-Sasson, Valérie, Dugré, Mathieu, and Glatard, Tristan
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: The recent influx of open scientific data has contributed to the transitioning of scientific computing from compute intensive to data intensive. Whereas many Big Data frameworks exist that minimize the cost of data transfers, few scientific applications integrate these frameworks or adopt data-placement strategies to mitigate the costs. Scientific applications commonly rely on well-established command-line tools that would require complete reinstrumentation in order to incorporate existing frameworks. We developed Sea as a means to enable data-placement strategies for scientific applications executing on HPC clusters without the need to reinstrument workflows. Sea leverages GNU C library interception to intercept POSIX-compliant file system calls made by the applications. We designed a performance model and evaluated the performance of Sea on a synthetic data-intensive application processing a representative neuroimaging dataset (the Big Brain). Our results demonstrate that Sea significantly improves performance, up to a factor of 3$\times$.
Published: 2022

39. The Cavendish Computors: The women working in scientific computing for Radio Astronomy

Author: Allan, Verity
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics, Computer Science - Computers and Society, Physics - History and Philosophy of Physics
Abstract: A discussion of the history of scientific computing for Radio Astronomy in the Cavendish Laboratory of the University of Cambridge in the decades after the Second World War. This covers the development of the aperture synthesis technique for Radio Astronomy and how that required using the new computing technology developed by the University's Mathematical Laboratory: the EDSAC, EDSAC 2 and TITAN computers. It looks at the scientific advances made by the Radio Astronomy group, particularly the assembling of evidence which contradicted the Steady State Hypothesis. It also examines the software advances that allowed bigger telescopes to be built: the Fast Fourier Transform (FFT) and the degridding algorithm. Throughout, the contribution of women is uncovered, from the diagrams they drew for scientific publications, through programming and operating computers, to writing scientific papers., Comment: First presented at the Joint BSHM CSHPM/SCHPM Conference People, Places, Practices at St Andrews, July 2021
Published: 2022
Full Text: View/download PDF

40. Construction of a specialized integrated simulation platform for molecule screening based on scientific computing workflow engine

Author: Gou, Chengqiu, Li, Jifeng, Li, Yufeng, Liu, Jian, Zhao, Shicao, Xiao, Yonghao, and Duan, Bowen
Published: 2023
Full Text: View/download PDF

41. AI chatbots in programming education: Students’ use in a scientific computing course and consequences for learning

Author: Suzanne Groothuijsen, Antoine van den Beemt, Joris C. Remmers, and Ludo W. van Meeuwen
Subjects: AI chatbots, ChatGPT, Programming education, Pair programming, Student learning, Engineering education, Electronic computers. Computer science, QA75.5-76.95
Abstract: Teaching and learning in higher education require adaptation following students' inevitable use of AI chatbots. This study contributes to the empirical literature on students' use of AI chatbots and how they influence learning. The aim of this study is to identify how to adapt programming education in higher engineering education. A mixed-methods case study was conducted of a scientific computing course in a Mechanical Engineering Master's program at a Eindhoven University of Technology in the Netherlands. Data consisted of 29 student questionnaires, a semi-structured group interview with three students, a semi-structured interview with the teacher, and 29 students' grades. Results show that students used ChatGPT for error checking and debugging of code, increasing conceptual understanding, generating, and optimizing solution code, explaining code, and solving mathematical problems. While students reported advantages of using ChatGPT, the teacher expressed concerns over declining code quality and student learning. Furthermore, both students and teacher perceived a negative influence from ChatGPT usage on pair programming, and consequently on student collaboration. The findings suggest that learning objectives should be formulated in more detail, to highlight essential programming skills, and be expanded to include the use of AI tools. Complex programming assignments remain appropriate in programming education, but pair programming as a didactic approach should be reconsidered in light of the growing use of AI Chatbots.
Published: 2024
Full Text: View/download PDF

42. No installation required: how WebAssembly is changing scientific computing

Author: Perkel, Jeffrey M.
Published: 2024
Full Text: View/download PDF

43. Anderson acceleration with approximate calculations: applications to scientific computing

Author: Pasini, Massimiliano Lupo and Laiu, M. Paul
Subjects: Mathematics - Numerical Analysis, Astrophysics - Instrumentation and Methods for Astrophysics, Mathematical Physics, 65F10, 65F50, 65G30, 65G50, 65N12, 65N15, 65Y20, 65Z05, 68T01, 68W20, 68W40, G.1.3, G.1.10
Abstract: We provide rigorous theoretical bounds for Anderson acceleration (AA) that allow for approximate calculations when applied to solve linear problems. We show that, when the approximate calculations satisfy the provided error bounds, the convergence of AA is maintained while the computational time could be reduced. We also provide computable heuristic quantities, guided by the theoretical error bounds, which can be used to automate the tuning of accuracy while performing approximate calculations. For linear problems, the use of heuristics to monitor the error introduced by approximate calculations, combined with the check on monotonicity of the residual, ensures the convergence of the numerical scheme within a prescribed residual tolerance. Motivated by the theoretical studies, we propose a reduced variant of AA, which consists in projecting the least-squares used to compute the Anderson mixing onto a subspace of reduced dimension. The dimensionality of this subspace adapts dynamically at each iteration as prescribed by the computable heuristic quantities. We numerically show and assess the performance of AA with approximate calculations on: (i) linear deterministic fixed-point iterations arising from the Richardson's scheme to solve linear systems with open-source benchmark matrices with various preconditioners and (ii) non-linear deterministic fixed-point iterations arising from non-linear time-dependent Boltzmann equations., Comment: 22 pages, 4 figures, 3 table
Published: 2022

44. Scientific Computing Plan for the ECCE Detector at the Electron Ion Collider

Author: Bernauer, J. C., Dean, C. T., Fanelli, C., Huang, J., Kauder, K., Lawrence, D., Osborn, J. D., Paus, C., Adkins, J. K., Akiba, Y., Albataineh, A., Amaryan, M., Arsene, I. C., Gayoso, C. Ayerbe, Bae, J., Bai, X., Baker, M. D., Bashkanov, M., Bellwied, R., Benmokhtar, F., Berdnikov, V., Bock, F., Boeglin, W., Borysova, M., Brash, E., Brindza, P., Briscoe, W. J., Brooks, M., Bueltmann, S., Bukhari, M. H. S., Bylinkin, A., Capobianco, R., Chang, W. -C., Cheon, Y., Chen, K., Chen, K. -F., Cheng, K. -Y., Chiu, M., Chujo, T., Citron, Z., Cline, E., Cohen, E., Cormier, T., Morales, Y. Corrales, Cotton, C., Crafts, J., Crawford, C., Creekmore, S., Cuevas, C., Cunningham, J., David, G., Demarteau, M., Diehl, S., Doshita, N., Dupré, R., Durham, J. M., Dzhygadlo, R., Ehlers, R., Fassi, L. El, Emmert, A., Ent, R., Fatemi, R., Fegan, S., Finger, M., Finger Jr., M., Frantz, J., Friedman, M., Friscic, I., Gangadharan, D., Gardner, S., Gates, K., Geurts, F., Gilman, R., Glazier, D., Glimos, E., Goto, Y., Grau, N., Greene, S. V., Guo, A. Q., Guo, L., Ha, S. K., Haggerty, J., Hayward, T., He, X., Hen, O., Higinbotham, D. W., Hoballah, M., Horn, T., Hoghmrtsyan, A., Hsu, P. -h. J., Huber, G., Hutson, A., Hwang, K. Y., Hyde, C., Inaba, M., Iwata, T., Jo, H. S., Joo, K., Kalantarians, N., Kalicy, G., Kawade, K., Kay, S. J. D., Kim, A., Kim, B., Kim, C., Kim, M., Kim, Y., Kistenev, E., Klimenko, V., Ko, S. H., Korover, I., Korsch, W., Krintiras, G., Kuhn, S., Kuo, C. -M., Kutz, T., Lajoie, J., Lebedev, S., Lee, H., Lee, J. S. H., Lee, S. W., Lee, Y. -J., Li, W., Li, X., Liang, Y. T., Lim, S., Lin, C. -h., Lin, D. X., Liu, K., Liu, M. X., Livingston, K., Liyanage, N., Llope, W. J., Loizides, C., Long, E., Lu, R. -S., Lu, Z., Lynch, W., Marchand, D., Marcisovsky, M., Markowitz, P., Marukyan, H., McGaughey, P., Mihovilovic, M., Milner, R. G., Milov, A., Miyachi, Y., Mkrtchyan, A., Monaghan, P., Montgomery, R., Morrison, D., Movsisyan, A., Mkrtchyan, H., Camacho, C. Munoz, Murray, M., Nagai, K., Nagle, J., Nakagawa, I., Nattrass, C., Nguyen, D., Niccolai, S., Nouicer, R., Nukazuka, G., Nycz, M., Okorokov, V. A., Orešić, S., O'Shaughnessy, C., Paganis, S., Papandreou, Z, Pate, S. F., Patel, M., Penman, G., Perdekamp, M. G., Perepelitsa, D. V., da Costa, H. Periera, Peters, K., Phelps, W., Piasetzky, E., Pinkenburg, C., Prochazka, I., Protzman, T., Purschke, M. L., Putschke, J., Pybus, J. R., Rajput-Ghoshal, R., Rasson, J., Raue, B., Read, K., Røed, K., Reed, R., Reinhold, J., Renner, E. L., Richards, J., Riedl, C., Rinn, T., Roche, J., Roland, G. M., Ron, G., Rosati, M., Royon, C., Ryu, J., Salur, S., Santiesteban, N., Santos, R., Sarsour, M., Schambach, J., Schmidt, A., Schmidt, N., Schwarz, C., Schwiening, J., Seidl, R., Sickles, A., Simmerling, P., Sirca, S., Sharma, D., Shi, Z., Shibata, T. -A., Shih, C. -W., Shimizu, S., Shrestha, U., Slifer, K., Smith, K., Sokhan, D., Soltz, R., Sondheim, W., Song, J., Strakovsky, I. I., Steinberg, P., Stepanov, P., Stevens, J., Strube, J., Sun, P., Sun, X., Suresh, K., Tadevosyan, V., Tang, W. -C., Araya, S. Tapia, Tarafdar, S., Teodorescu, L., Timmins, A., Tomasek, L., Trotta, N., Trotta, R., Tveter, T. S., Umaka, E., Usman, A., van Hecke, H. W., Van Hulse, C., Velkovska, J., Voutier, E., Wang, P. K., Wang, Q., Wang, Y., Watts, D. P., Wickramaarachchi, N., Weinstein, L., Williams, M., Wong, C. -P., Wood, L., Wood, M. H., Woody, C., Wyslouch, B., Xiao, Z., Yamazaki, Y., Yang, Y., Ye, Z., Yoo, H. D., Yurov, M., Zachariou, N., Zajc, W. A., Zhang, J., Zhang, Y., Zhao, Y. X., Zheng, X., and Zhuang, P.
Subjects: Physics - Instrumentation and Detectors, High Energy Physics - Experiment, Nuclear Experiment, Physics - Computational Physics
Abstract: The Electron Ion Collider (EIC) is the next generation of precision QCD facility to be built at Brookhaven National Laboratory in conjunction with Thomas Jefferson National Laboratory. There are a significant number of software and computing challenges that need to be overcome at the EIC. During the EIC detector proposal development period, the ECCE consortium began identifying and addressing these challenges in the process of producing a complete detector proposal based upon detailed detector and physics simulations. In this document, the software and computing efforts to produce this proposal are discussed; furthermore, the computing and software model and resources required for the future of ECCE are described.
Published: 2022
Full Text: View/download PDF

45. A C++17 thread pool for high-performance scientific computing

Author: Barak Shoshany
Subjects: C++, Parallel computing, High-performance computing, Computer software, QA76.75-76.765
Abstract: We present a modern C++17-compatible thread pool implementation, built from scratch with high-performance scientific computing in mind. The thread pool is implemented as a single lightweight and self-contained class, and does not have any dependencies other than the C++17 standard library, thus allowing a great degree of portability. In particular, our implementation does not utilize any high-level multithreading APIs, and thus gives the programmer precise low-level control over the details of the parallelization, which permits more robust optimizations. The thread pool was extensively tested on both AMD and Intel CPUs with up to 40 cores and 80 threads.
Published: 2024
Full Text: View/download PDF

46. Towards Robust Deep Active Learning for Scientific Computing

Author: Ren, Simiao, Deng, Yang, Padilla, Willie J., and Malof, Jordan
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Deep learning (DL) is revolutionizing the scientific computing community. To reduce the data gap, active learning has been identified as a promising solution for DL in the scientific computing community. However, the deep active learning (DAL) literature is dominated by image classification problems and pool-based methods. Here we investigate the robustness of pool-based DAL methods for scientific computing problems (dominated by regression) where DNNs are increasingly used. We show that modern pool-based DAL methods all share an untunable hyperparameter, termed the pool ratio, denoted $\gamma$, which is often assumed to be known apriori in the literature. We evaluate the performance of five state-of-the-art DAL methods on six benchmark problems if we assume $\gamma$ is \textit{not} known - a more realistic assumption for scientific computing problems. Our results indicate that this reduces the performance of modern DAL methods and that they sometimes can even perform worse than random sampling, creating significant uncertainty when used in real-world settings. To overcome this limitation we propose, to our knowledge, the first query synthesis DAL method for regression, termed NA-QBC. NA-QBC removes the sensitive $\gamma$ hyperparameter and we find that, on average, it outperforms the other DAL methods on our benchmark problems. Crucially, NA-QBC always outperforms random sampling, providing more robust performance benefits.
Published: 2022

47. Fusion Energy Sciences Exascale Requirements Review. An Office of Science review sponsored jointly by Advanced Scientific Computing Research and Fusion Energy Sciences, January 27-29, 2016, Gaithersburg, Maryland

Author: Chang, Choong-Seock, Greenwald, Martin, Riley, Katherine, Antypas, Katie, Coffey, Richard, Dart, Eli, Dosanjh, Sudip, Gerber, Richard, Hack, James, Monga, Inder, Papka, Michael E, Rotman, Lauren, Straatsma, Tjerk, Wells, Jack, Andre, R, Bernholdt, David, Bhattacharjee, Amitava, Bonoli, Paul, Boyd, Iain, Bulanov, Stepan, Cary, John R, Chen, Yang, Curreli, Davide, Ernst, Darin R, Ethier, Stephane, Green, David, Hager, Robert, Hakim, Ammar, Hassanein, A, Hatch, David, Held, ED, Howard, Nathan, Izzo, Valerie A, Jardin, Steve, Jenkins, TG, Jenko, Frank, Kemp, Andreas, King, Jacob, Kritz, Arnold, Krstic, Predrag, Kruger, Scott E, Kurtz, Rick, Lin, Zhihong, Loring, Burlen, Nandipati, Giridhar, Pankin, AY, Parker, Scott, Perez, Danny, Pigarov, Alex Y, Poli, Francesca, Pueschel, MJ, Rafiq, Tariq, Rübel, Oliver, Setyawan, Wahyu, Sizyuk, Valeryi A, Smithe, DN, Sovinec, CR, Turner, Miles, Umansky, Maxim, Vay, Jean-Luc, Verboncoeur, John, Vincenti, Henri, Voter, Arthur, Wang, Weixing, Wirth, Brian, Wright, John, and Yuan, X
Published: 2022

48. Biological and Environmental Research Exascale Requirements Review. An Office of Science review sponsored jointly by Advanced Scientific Computing Research and Biological and Environmental Research, March 28-31, 2016, Rockville, Maryland

Author: Arkin, Adam, Bader, David C, Coffey, Richard, Antypas, Katie, Bard, Deborah, Dart, Eli, Dosanjh, Sudip, Gerber, Richard, Hack, James, Monga, Inder, Papka, Michael E, Riley, Katherine, Rotman, Lauren, Straatsma, Tjerk, Wells, Jack, Aluru, Srinivas, Andersen, Amity, Aprá, Edoardo, Azad, Ariful, Bates, Susan, Blaby, Ian, Blaby-Haas, Crysten, Bonneau, Rich, Bowen, Ben, Bradford, Mark A, Brodie, Eoin, Brown, James Ben, Buluc, Aydin, Bernholdt, David, Bylaska, Eric, Calvin, Kate, Cannon, Bill, Chen, Xingyuan, Cheng, Xiaolin, Cheung, Margaret, Chowdhary, Kenny, Colella, Phillip, Collins, Bill, Compo, Gil, Crowley, Mike, Debusschere, Bert, D’Imperio, Nicholas, Dror, Ron, Egan, Rob, Evans, Katherine, Friedberg, Iddo, Fyke, Jeremy, Gao, Zheng, Georganas, Evangelos, Giraldo, Frank, Gnanakaran, Gnana, Govind, Niri, Grandy, Stuart, Gustafson, Bill, Hammond, Glenn, Hargrove, William, Heroux, Michael, Hoffman, Forrest, Hofmeyr, Steven, Hunke, Elizabeth, Jackson, Charles, Jacob, Rob, Jacobson, Dan, Jacobson, Matt, Jain, Chirag, Johansen, Hans, Johnson, Jeff, Jones, Andy, Jones, Phil, Kalyanaraman, Ananth, Kang, Senghwa, King, Eric, Koanantakool, Penporn, Kollias, Pavlos, Kopera, Michal, Kotamarthi, Rao, Kowalski, Karol, Kumar, Jitendra, Kyrpides, Nikos, Leung, Ruby, Li, Xiaolin, Lin, Wuyin, Link, Robert, Liu, Yangang, Loew, Leslie, Luke, Edward, Ma, Hsi-Yen, Mahadevan, Radhakrishnan, Maranas, Costas, Martin, Daniel, Maslowski, Wieslaw, McCue, Lee Ann, McInnes, Lois Curfman, Mills, Richard, Molins Rafa, Sergi, Morozov, Dmitriy, Mostafavi, Sara, Moulton, David J, Mourao, Zenaida, and Najm, Habib
Published: 2022

49. Crosscut report: Exascale Requirements Reviews, March 9–10, 2017 – Tysons Corner, Virginia. An Office of Science review sponsored by: Advanced Scientific Computing Research, Basic Energy Sciences, Biological and Environmental Research, Fusion Energy Sciences, High Energy Physics, Nuclear Physics

Author: Gerber, Richard, Hack, James, Riley, Katherine, Antypas, Katie, Coffey, Richard, Dart, Eli, Straatsma, Tjerk, Wells, Jack, Bard, Deborah, Dosanjh, Sudip, Monga, Inder, Papka, Michael E, and Rotman, Lauren
Published: 2022

50. Predicting Slow Network Transfers in Scientific Computing

Author: Shao, Robin, Kim, Jinoh, Sim, Alex, and Wu, Kesheng
Abstract: Data access throughput is one of the key performance metrics in scientific computing, particularly for distributed data-intensive applications. While there has been a body of studies focusing on elephant connections that consume a significant fraction of network bandwidth, this study focuses on predicting slow connections that create bottlenecks in distributed workflows. In this study, we analyze network traffic logs collected between January 2019 and May 2021 at National Energy Research Scientific Computing Center (NERSC). Based on the observed patterns from this data collection, we define a set of features to be used for identifying low-performing data transfers. Through extensive feature engineering and feature selection, we identify a number of new features to significantly enhance the prediction performance. With these new features, even the relatively simple decision tree model could predict slow connections with a F1 score as high as 0.945.
Published: 2022

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

81,690 results on '"*SCIENTIFIC computing"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources