Author: "Guyot, Cyril" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Guyot, Cyril"' showing total 21 results

Start Over Author "Guyot, Cyril" Publication Year Range Last 10 years

21 results on '"Guyot, Cyril"'

1. AttMEMO : Accelerating Transformers with Memoization on Big Memory Systems

Author: Feng, Yuan, Jeon, Hyeran, Blagojevic, Filip, Guyot, Cyril, Li, Qing, and Li, Dong
Subjects: Computer Science - Performance, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Transformer models gain popularity because of their superior inference accuracy and inference throughput. However, the transformer is computation-intensive, causing a long inference time. The existing works on transformer inference acceleration have limitations caused by either the modification of transformer architectures or the need of specialized hardware. In this paper, we identify the opportunities of using memoization to accelerate the self-attention mechanism in transformers without the above limitations. Built upon a unique observation that there is rich similarity in attention computation across inference sequences, we build a memoization database that leverages the emerging big memory system. We introduce a novel embedding technique to find semantically similar inputs to identify computation similarity. We also introduce a series of techniques such as memory mapping and selective memoization to avoid memory copy and unnecessary overhead. We enable 22% inference-latency reduction on average (up to 68%) with negligible loss in inference accuracy.
Published: 2023

2. Optimizing Write Fidelity of MRAMs via Iterative Water-filling Algorithm

Author: Kim, Yongjune, Jeon, Yoocharn, Choi, Hyeokjin, Guyot, Cyril, and Cassuto, Yuval
Subjects: Computer Science - Emerging Technologies, Computer Science - Hardware Architecture, Computer Science - Information Theory
Abstract: Magnetic random-access memory (MRAM) is a promising memory technology due to its high density, non-volatility, and high endurance. However, achieving high memory fidelity incurs significant write-energy costs, which should be reduced for large-scale deployment of MRAMs. In this paper, we formulate a \emph{biconvex} optimization problem to optimize write fidelity given energy and latency constraints. The basic idea is to allocate non-uniform write pulses depending on the importance of each bit position. The fidelity measure we consider is mean squared error (MSE), for which we optimize write pulses via \emph{alternating convex search (ACS)}. By using Karush-Kuhn-Tucker (KKT) conditions, we derive analytic solutions and propose an \emph{iterative water-filling-type} algorithm by leveraging the analytic solutions. Hence, the proposed iterative water-filling algorithm is computationally more efficient than the original ACS while their solutions are identical. Although the original ACS and the proposed iterative water-filling algorithm do not guarantee global optimality, the MSEs obtained by the proposed algorithm are comparable to the MSEs by complicated global nonlinear programming solvers. Furthermore, we prove that the proposed algorithm can reduce the MSE exponentially with the number of bits per word. For an 8-bit accessed word, the proposed algorithm reduces the MSE by a factor of 21. We also evaluate the proposed algorithm for MNIST dataset classification supposing that the model parameters of deep neural networks are stored in MRAMs. The numerical results show that the optimized write pulses can achieve \SI{40}{\%} write energy reduction for a given classification accuracy., Comment: arXiv admin note: substantial text overlap with arXiv:2001.03803
Published: 2021

3. On the Efficient Estimation of Min-Entropy

Author: Kim, Yongjune, Guyot, Cyril, and Kim, Young-Sik
Subjects: Computer Science - Cryptography and Security, Computer Science - Information Theory
Abstract: The min-entropy is a widely used metric to quantify the randomness of generated random numbers in cryptographic applications; it measures the difficulty of guessing the most likely output. An important min-entropy estimator is the compression estimator of NIST Special Publication (SP) 800-90B, which relies on Maurer's universal test. In this paper, we propose two kinds of min-entropy estimators to improve computational complexity and estimation accuracy by leveraging two variations of Maurer's test: Coron's test (for Shannon entropy) and Kim's test (for Renyi entropy). First, we propose a min-entropy estimator based on Coron's test. It is computationally more efficient than the compression estimator while maintaining the estimation accuracy. The secondly proposed estimator relies on Kim's test that computes the Renyi entropy. This estimator improves estimation accuracy as well as computational complexity. We analytically characterize the bias-variance tradeoff, which depends on the order of Renyi entropy. By taking into account this tradeoff, we observe that the order of two is a proper assignment and focus on the min-entropy estimation based on the collision entropy (i.e., Renyi entropy of order two). The min-entropy estimation from the collision entropy can be described by a closed-form solution, whereas both the compression estimator and the proposed estimator based on Coron's test do not have closed-form solutions. By leveraging the closed-form solution, we also propose a lightweight estimator that processes data samples in an online manner. Numerical evaluations demonstrate that the first proposed estimator achieves the same accuracy as the compression estimator with much less computation. The proposed estimator based on the collision entropy can even improve the accuracy and reduce the computational complexity.
Published: 2020
Full Text: View/download PDF

4. Optimizing the Write Fidelity of MRAMs

Author: Kim, Yongjune, Jeon, Yoocharn, Guyot, Cyril, and Cassuto, Yuval
Subjects: Computer Science - Information Theory, Computer Science - Hardware Architecture, Computer Science - Emerging Technologies
Abstract: Magnetic random-access memory (MRAM) is a promising memory technology due to its high density, non-volatility, and high endurance. However, achieving high memory fidelity incurs significant write-energy costs, which should be reduced for large-scale deployment of MRAMs. In this paper, we formulate an optimization problem for maximizing the memory fidelity given energy constraints, and propose a biconvex optimization approach to solve it. The basic idea is to allocate non-uniform write pulses depending on the importance of each bit position. The fidelity measure we consider is minimum mean squared error (MSE), for which we propose an iterative water-filling algorithm. Although the iterative algorithm does not guarantee global optimality, we can choose a proper starting point that decreases the MSE exponentially and guarantees fast convergence. For an 8-bit accessed word, the proposed algorithm reduces the MSE by a factor of 21.
Published: 2020
Full Text: View/download PDF

5. On the Optimal Refresh Power Allocation for Energy-Efficient Memories

Author: Kim, Yongjune, Choi, Won Ho, Guyot, Cyril, and Cassuto, Yuval
Subjects: Computer Science - Hardware Architecture, Computer Science - Information Theory, Electrical Engineering and Systems Science - Signal Processing
Abstract: Refresh is an important operation to prevent loss of data in dynamic random-access memory (DRAM). However, frequent refresh operations incur considerable power consumption and degrade system performance. Refresh power cost is especially significant in high-capacity memory devices and battery-powered edge/mobile applications. In this paper, we propose a principled approach to optimizing the refresh power allocation. Given a model for the bit error rate dependence on power, we formulate a convex optimization problem to minimize the word mean squared error for a refresh power constraint; hence we can guarantee the optimality of the obtained refresh power allocations. In addition, we provide an integer programming problem to optimize the discrete refresh interval assignments. For an 8-bit accessed word, numerical results show that the optimized nonuniform refresh intervals reduce the refresh power by 29% at a peak signal-to-noise ratio of 50dB compared to the uniform assignment., Comment: 6 pages
Published: 2019
Full Text: View/download PDF

6. Storage-Efficient Shared Memory Emulation

Author: Zorgui, Marwen, Mateescu, Robert, Blagojevic, Filip, Guyot, Cyril, and Wang, Zhiying
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: We study the design of storage-efficient algorithms for emulating atomic shared memory over an asynchronous, distributed message-passing system. Our first algorithm is an atomic single-writer multi-reader algorithm based on a novel erasure-coding technique, termed \emph{multi-version code}. Next, we propose an extension of our single-writer algorithm to a multi-writer multi-reader environment. Our second algorithm combines replication and multi-version code, and is suitable in situations where we expect a large number of concurrent writes. Moreover, when the number of concurrent writes is bounded, we propose a simplified variant of the second algorithm that has a simple structure similar to the single-writer algorithm. Let $N$ be the number of servers, and the shared memory variable be of size 1 unit. Our algorithms have the following properties: (i) The write operation terminates if the number of server failures is bounded by a parameter $f$. The algorithms also guarantee the termination of the read as long as the number of writes concurrent with the read is smaller than a design parameter $\nu$, and the number of server failures is bounded by $f$. (ii) The overall storage size for the first algorithm, and the steady-state storage size for the second algorithm, are all $N/\lceil \frac{N-2f}{\nu} \rceil$ units. Moreover, our simplified variant of the second algorithm achieves the worst-case storage cost of $N/\lceil \frac{N-2f}{\nu} \rceil$, asymptotically matching a lower bound by Cadambe et al. for $N \gg f, \nu \le f+1$. (iii) The write and read operations only consist of a small number (2 to 3) of communication rounds. (iv) For all algorithms, the server maintains a simple data structure. A server only needs to store the information associated with the latest value it observes, similar to replication-based algorithms.
Published: 2018

7. POSIX-based Operating System in the environment of NVM/SCM memory

Author: Dubeyko, Vyacheslav, Guyot, Cyril, Cargnini, Luis, and Manzanares, Adam
Subjects: Computer Science - Operating Systems
Abstract: Modern Operating Systems are typically POSIX-compliant. The system calls are the fundamental layer of interaction between user-space applications and the OS kernel and its implementation of fundamental abstractions and primitives used in modern computing. The next generation of NVM/SCM memory raises critical questions about the efficiency of modern OS architecture. This paper investigates how the POSIX API drives performance for a system with NVM/SCM memory. We show that OS and metadata related system calls represent the most important area of optimization. However, the synchronization related system calls (poll(), futex(), wait4()) are the most time-consuming overhead that even a RAMdisk platform fails to eliminate. Attempting to preserve the POSIX-based approach will likely result in fundamental inefficiencies for any future applications of NVM/SCM memory.
Published: 2017

8. Rate-Distortion Theory by and for Energy-Based Models

Author: Li, Qing, primary and Guyot, Cyril, additional
Published: 2024
Full Text: View/download PDF

9. Rate-Distortion via Energy-Based Models

Author: Li, Qing, primary, Kim, Yongjune, additional, and Guyot, Cyril, additional
Published: 2023
Full Text: View/download PDF

10. Batch Denoising via Blahut-Arimoto

Author: Li, Qing, primary and Guyot, Cyril, additional
Published: 2023
Full Text: View/download PDF

11. Optimized Privacy-Preserving CNN Inference With Fully Homomorphic Encryption

Author: Kim, Dongwoo, primary and Guyot, Cyril, additional
Published: 2023
Full Text: View/download PDF

12. Optimizing Write Fidelity of MRAMs by Alternating Water-Filling Algorithm

Author: Kim, Yongjune, primary, Jeon, Yoocharn, additional, Choi, Hyeokjin, additional, Guyot, Cyril, additional, and Cassuto, Yuval, additional
Published: 2022
Full Text: View/download PDF

13. On the Efficient Estimation of Min-Entropy

Author: Kim, Yongjune, primary, Guyot, Cyril, additional, and Kim, Young-Sik, additional
Published: 2021
Full Text: View/download PDF

14. Optimizing the Write Fidelity of MRAMs

Author: Kim, Yongjune, primary, Jeon, Yoocharn, additional, Guyot, Cyril, additional, and Cassuto, Yuval, additional
Published: 2020
Full Text: View/download PDF

15. On the Optimal Refresh Power Allocation for Energy-Efficient Memories

Author: Kim, Yongjune, primary, Choi, Won Ho, additional, Guyot, Cyril, additional, and Cassuto, Yuval, additional
Published: 2019
Full Text: View/download PDF

16. Garbage Collection Algorithms for Meta Data Updates in NAND Flash

Author: Qin, Minghai, primary, Mateescu, Robert, additional, Wang, Qingbo, additional, Guyot, Cyril, additional, Vucinic, Dejan, additional, and Bandic, Zvonimir, additional
Published: 2019
Full Text: View/download PDF

17. Towards Robust File System Checkers

Author: Gatla, Om Rameshwar, primary, Zheng, Mai, additional, Hameed, Muhammad, additional, Dubeyko, Viacheslav, additional, Manzanares, Adam, additional, Blagojevic, Filip, additional, Guyot, Cyril, additional, and Mateescu, Robert, additional
Published: 2018
Full Text: View/download PDF

18. Latency Tails of Byte-Addressable Non-Volatile Memories in Systems

Author: Sun, Chao, primary, Le Moal, Damien, additional, Wang, Qingbo, additional, Mateescu, Robert, additional, Blagojevic, Filip, additional, Lueker-Boden, Martin, additional, Guyot, Cyril, additional, Bandic, Zvonimir, additional, and Vucinic, Dejan, additional
Published: 2017
Full Text: View/download PDF

19. Spider Codes: Practical erasure codes for distributed storage systems

Author: Pamies-Juarez, Lluis, primary, Guyot, Cyril, additional, and Mateescu, Robert, additional
Published: 2016
Full Text: View/download PDF

20. Balanced codes for data retention of multi-level flash memories with fast page read

Author: Qin, Minghai, primary, Mateescu, Robert, additional, Guyot, Cyril, additional, and Bandic, Zvonimir, additional
Published: 2015
Full Text: View/download PDF

21. A Parallel and Pipelined Architecture for Accelerating Fingerprint Computation in High Throughput Data Storages

Author: Li, Dongyang, primary, Yang, Qing, additional, Wang, Qingbo, additional, Guyot, Cyril, additional, Narasimha, Ashwin, additional, Vucinic, Dejan, additional, and Bandic, Zvonimir, additional
Published: 2015
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

21 results on '"Guyot, Cyril"'

1. AttMEMO : Accelerating Transformers with Memoization on Big Memory Systems

2. Optimizing Write Fidelity of MRAMs via Iterative Water-filling Algorithm

3. On the Efficient Estimation of Min-Entropy

4. Optimizing the Write Fidelity of MRAMs

5. On the Optimal Refresh Power Allocation for Energy-Efficient Memories

6. Storage-Efficient Shared Memory Emulation

7. POSIX-based Operating System in the environment of NVM/SCM memory

8. Rate-Distortion Theory by and for Energy-Based Models

9. Rate-Distortion via Energy-Based Models

10. Batch Denoising via Blahut-Arimoto

11. Optimized Privacy-Preserving CNN Inference With Fully Homomorphic Encryption

12. Optimizing Write Fidelity of MRAMs by Alternating Water-Filling Algorithm

13. On the Efficient Estimation of Min-Entropy

14. Optimizing the Write Fidelity of MRAMs

15. On the Optimal Refresh Power Allocation for Energy-Efficient Memories

16. Garbage Collection Algorithms for Meta Data Updates in NAND Flash

17. Towards Robust File System Checkers

18. Latency Tails of Byte-Addressable Non-Volatile Memories in Systems

19. Spider Codes: Practical erasure codes for distributed storage systems

20. Balanced codes for data retention of multi-level flash memories with fast page read

21. A Parallel and Pipelined Architecture for Accelerating Fingerprint Computation in High Throughput Data Storages

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

21 results on '"Guyot, Cyril"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources