6 results on '"Sheriff Sadiqbatcha"'
Search Results
2. EM Lifetime Constrained Optimization for Multi-Segment Power Grid Networks
- Author
-
Zeyu Sun, Sheldon X.-D. Tan, Sheriff Sadiqbatcha, and Han Zhou
- Subjects
010302 applied physics ,Mathematical optimization ,Interconnection ,Optimization problem ,Linear programming ,Computer science ,Reliability (computer networking) ,Constrained optimization ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,02 engineering and technology ,Chip ,01 natural sciences ,020202 computer hardware & architecture ,Power (physics) ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,Power network design - Abstract
This chapter provides techniques for power grid network sizing while considering electromigration reliability. Starting with power grid network and electromigration (EM) fundamentals. Specifically the concerns here are EM immortality and aging effects used as EM constraints when formulating the optimization problems. The chapter first shows that the new power/ground (P/G) optimization problem, subject to the voltage IR drop and new EM constraints, can still be formulated as an efficient sequence of linear programming (SLP) problem, where the optimization is carried out in two linear programming phases in each iteration. The new optimization will ensure that none of the wires fails if all the constraints are satisfied. However, requiring all the wires to be EM immortal can be over-constrained. To mitigate this problem, the improvement is to consider the aging effects of interconnect wires in P/G networks. The idea is to allow some short-lifetime wires to fail and optimize the rest of the wires while considering the additional resistance caused by the failed wire segments. In this way, the resulting P/G networks can be optimized such that the target lifetime of the whole P/G networks can be ensured and will become more robust and aging-aware over the expected lifetime of the chip.
- Published
- 2020
- Full Text
- View/download PDF
3. HAT-DRL: Hotspot-Aware Task Mapping for Lifetime Improvement of Multicore System using Deep Reinforcement Learning
- Author
-
Nanpeng Yu, Sheriff Sadiqbatcha, Yuanqi Gao, Michael O'Dea, Sheldon X.-D. Tan, and Jinwei Zhang
- Subjects
Multi-core processor ,business.industry ,Computer science ,Embedded system ,Hotspot (geology) ,Task mapping ,Task analysis ,Reinforcement learning ,System on a chip ,Chip ,Reinforcement learning algorithm ,business - Abstract
In this work, we propose a novel learning-based task to core mapping technique to improve lifetime and reliability based on advanced deep reinforcement learning. The new method, called HAT-DRL, is based on the observation that on-chip temperature sensors may not capture the true hotspots of the chip, which can lead to suboptimal control decisions. In the new method, we first perform data-driven learning to model the hotspot activation indicator with respect to the resource utilization of different workloads. On top of this, we propose to employ deep reinforcement learning algorithm to improve the long-term reliability and minimize the performance degradation from NBTI/HCI effects. It penalizes continuously stressing the same hotspots and encourages even stressing of cores. The proposed algorithm is validated with an Intel i7-8650U four-core CPU platform executing CPU benchmarks for various hotspot activation profiles. Results show that HAT- DRL balances the stress between all cores and hotspots, and achieves 50% and 160% longer lifetime compared to non-hotspot-aware and Linux default scheduling respectively. The proposed method can also reduce the average temperature by exploiting the true-hotspot information.
- Published
- 2020
- Full Text
- View/download PDF
4. Full-chip thermal map estimation for commercial multi-core CPUs with generative adversarial learning
- Author
-
Sheriff Sadiqbatcha, Jinwei Zhang, Sheldon X.-D. Tan, and Wentian Jin
- Subjects
Multi-core processor ,Discriminator ,Mean squared error ,Artificial neural network ,Computer science ,0211 other engineering and technologies ,02 engineering and technology ,Chip ,Finite element method ,020202 computer hardware & architecture ,0202 electrical engineering, electronic engineering, information engineering ,Transient (computer programming) ,Algorithm ,021106 design practice & management ,Generator (mathematics) - Abstract
In this paper, we propose a novel transient full-chip thermal map estimation method for multi-core commercial CPU based on the data-driven generative adversarial learning method. We treat the thermal modeling problem as an image-generation problem using the generative neural networks. In stead of using traditional functional unit powers as input, the new models are directly based on the measurable real-time high level chip utilizations and thermal sensor information of commercial chips without any assumption of additional physical sensors requirement. The resulting thermal map estimation method, called ThermGAN can provide tool-accurate full-chip transient thermal maps from the given performance monitor traces of commercial off-the-shelf multi-core processors. In our work, both generator and discriminator are composed of simple convolutional layers with Wasserstein distance as loss function. ThermGAN can provide the transient and real-time thermal map without using any historical data for training and inferences, which is contrast with a recent RNN-based thermal map estimation method in which historical data is needed. Experimental results show the trained model is very accurate in thermal estimation with an average RMSE of 0.47°C, namely, 0.63% of the full-scale error. Our data further show that the speed of the model is faster than 7.5ms per inference, which is two orders of magnitude faster than the traditional finite element based thermal analysis. Furthermore, the new method is ~4x more accurate than recently proposed LSTM-based thermal map estimation method and has faster inference speed. It also achieves ~2x accuracy with much less computational cost than a state-of-the-art pre-silicon based estimation method.
- Published
- 2020
- Full Text
- View/download PDF
5. Machine Learning Based Online Full-Chip Heatmap Estimation
- Author
-
Yue Zhao, Sheldon X.-D. Tan, Jinwei Zhang, Jorg Henkel, Hussam Amrouch, and Sheriff Sadiqbatcha
- Subjects
020203 distributed computing ,Neutral network ,Computer science ,business.industry ,Inference ,02 engineering and technology ,Machine learning ,computer.software_genre ,Chip ,Die (integrated circuit) ,020202 computer hardware & architecture ,Power (physics) ,0202 electrical engineering, electronic engineering, information engineering ,Discrete cosine transform ,Transient (computer programming) ,Artificial intelligence ,business ,computer ,Curse of dimensionality - Abstract
Runtime power and thermal control is crucial in any modern processor. However, these control schemes require accurate real-time temperature information, ideally of the entire die area, in order to be effective. On-chip temperature sensors alone cannot provide the full-chip temperature information since the number of sensors that are typically available is very limited due to their high area and power overheads. Furthermore, as we will demonstrate, the peak locations within hot-spots are not stationary and are very workload dependent, making it difficult to rely on fixed temperature sensors alone. Therefore, we propose a novel approach to real-time estimation of full-chip transient heatmaps for commercial processors based on machine learning. The model derived in this work supplements the temperature data sensed from the existing on-chip sensors, allowing for the development of more robust runtime power and thermal control schemes that can take advantage of the additional thermal information that is otherwise not available. The new approach involves offline acquisition of accurate spatial and temporal heatmaps using an infrared thermal imaging setup while nominal working conditions are maintained on the chip. To build the dynamic thermal model, we apply Long-Short-Term-Memory (LSTM) neutral networks with system-level variables such as chip frequency, instruction counts, and other performance metrics as inputs. To reduce the dimensionality of the model, 2D spatial discrete cosine transformation (DCT) is first performed on the heatmaps so that they can be expressed with just their dominant DCT frequencies. Our study shows that only 6×6 DCT coefficients are required to maintain sufficient accuracy across a variety of workloads. Experimental results show that the proposed approach can estimate the full-chip heatmaps with less than $1.4^{o}$C root-mean-square-error and take only $\sim$19ms for each inference which suits well for real-time use.
- Published
- 2020
- Full Text
- View/download PDF
6. Real-Time Full-Chip Thermal Tracking: A Post-Silicon, Machine Learning Perspective
- Author
-
Sheriff Sadiqbatcha, Jinwei Zhang, Hussam Amrouch, and Sheldon X.-D. Tan
- Subjects
Neutral network ,Computer science ,Perspective (graphical) ,Inference ,Chip ,Tracking (particle physics) ,Theoretical Computer Science ,Computational Theory and Mathematics ,Computer engineering ,Hardware and Architecture ,Thermal ,Discrete cosine transform ,State (computer science) ,Software - Abstract
This article presents a novel approach to real-time tracking of full-chip heatmaps for off-the-shelf microprocessors based on machine-learning. The proposed post-silicon approach, named RealMaps, only uses the existing temperature sensors and workload-independent utilization information. RealMaps does not require any knowledge of the proprietary design or manufacturing process-specific details of the chip. Consequently, the methods presented in this work can be implemented by either the original chip manufacturer or a third party alike. The approach involves offline acquisition of spatial heatmaps using a thermal imaging setup. To build the dynamic thermal model, a temporal-aware long-short-term-memory neutral network is trained with system-level features as inputs. 2D discrete cosine transformation (DCT) is performed on the heatmaps so that they can be expressed with just a few dominant DCT coefficients. This allows the model to be built to estimate just the dominant spatial features of the heatmaps, rather than the entire heatmap images, making it significantly more efficient. Experimental results from two commercial chips show that RealMaps can estimate the full-chip heatmaps with 0.9C and 1.2C root-mean-square-error respectively and take only 0.4ms for each inference. Compared to the state of the art pre-silicon approach, RealMaps shows similar accuracy, but with much less computational cost.
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.