Author: "Mitzenmacher, Michael" / Publication Year Range: Last 3 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mitzenmacher, Michael"' showing total 33 results

Start Over Author "Mitzenmacher, Michael" Publication Year Range Last 3 years

33 results on '"Mitzenmacher, Michael"'

1. Efficient Inference for Augmented Large Language Models

Author: Shahout, Rana, Liang, Cong, Xin, Shiji, Lao, Qianru, Cui, Yong, Yu, Minlan, and Mitzenmacher, Michael
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Augmented Large Language Models (LLMs) enhance the capabilities of standalone LLMs by integrating external data sources through API calls. In interactive LLM applications, efficient scheduling is crucial for maintaining low request completion times, directly impacting user engagement. However, these augmentations introduce scheduling challenges due to the need to manage limited memory for cached information (KV caches). As a result, traditional size-based scheduling algorithms, such as Shortest Job First (SJF), become less effective at minimizing completion times. Existing work focuses only on handling requests during API calls by preserving, discarding, or swapping memory without considering how to schedule requests with API calls. In this paper, we propose LAMPS, a novel LLM inference framework for augmented LLMs. LAMPS minimizes request completion time through a unified scheduling approach that considers the total length of requests and their handling strategies during API calls. Recognizing that LLM inference is memory-bound, our approach ranks requests based on their consumption of memory over time, which depends on both the output sizes and how a request is managed during its API calls. To implement our scheduling, LAMPS predicts the strategy that minimizes memory waste of a request during its API calls, aligning with but improving upon existing approaches. We also propose starvation prevention techniques and optimizations to mitigate the overhead of our scheduling. We implement LAMPS on top of vLLM and evaluate its performance against baseline LLM inference systems, demonstrating improvements in end-to-end latency by 27%-85% and reductions in TTFT by 4%-96% compared to the existing augmented-LLM system, with even greater gains over vLLM.
Published: 2024

2. Don't Stop Me Now: Embedding Based Scheduling for LLMs

Author: Shahout, Rana, Malach, Eran, Liu, Chunwei, Jiang, Weifan, Yu, Minlan, and Mitzenmacher, Michael
Subjects: Computer Science - Machine Learning
Abstract: Efficient scheduling is crucial for interactive Large Language Model (LLM) applications, where low request completion time directly impacts user engagement. Size-based scheduling algorithms like Shortest Remaining Process Time (SRPT) aim to reduce average request completion time by leveraging known or estimated request sizes and allowing preemption by incoming jobs with shorter service times. However, two main challenges arise when applying size-based scheduling to LLM systems. First, accurately predicting output lengths from prompts is challenging and often resource-intensive, making it impractical for many systems. As a result, the state-of-the-art LLM systems default to first-come, first-served scheduling, which can lead to head-of-line blocking and reduced system efficiency. Second, preemption introduces extra memory overhead to LLM systems as they must maintain intermediate states for unfinished (preempted) requests. In this paper, we propose TRAIL, a method to obtain output predictions from the target LLM itself. After generating each output token, we recycle the embedding of its internal structure as input for a lightweight classifier that predicts the remaining length for each running request. Using these predictions, we propose a prediction-based SRPT variant with limited preemption designed to account for memory overhead in LLM systems. This variant allows preemption early in request execution when memory consumption is low but restricts preemption as requests approach completion to optimize resource utilization. On the theoretical side, we derive a closed-form formula for this SRPT variant in an M/G/1 queue model, which demonstrates its potential value. In our system, we implement this preemption policy alongside our embedding-based prediction method.
Published: 2024

3. Learning-Augmented Frequency Estimation in Sliding Windows

Author: Shahout, Rana, Sabek, Ibrahim, and Mitzenmacher, Michael
Subjects: Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning
Abstract: We show how to utilize machine learning approaches to improve sliding window algorithms for approximate frequency estimation problems, under the ``algorithms with predictions'' framework. In this dynamic environment, previous learning-augmented algorithms are less effective, since properties in sliding window resolution can differ significantly from the properties of the entire stream. Our focus is on the benefits of predicting and filtering out items with large next arrival times -- that is, there is a large gap until their next appearance -- from the stream, which we show improves the memory-accuracy tradeoffs significantly. We provide theorems that provide insight into how and by how much our technique can improve the sliding window algorithm, as well as experimental results using real-world data sets. Our work demonstrates that predictors can be useful in the challenging sliding window setting.
Published: 2024

4. Beyond Throughput and Compression Ratios: Towards High End-to-end Utility of Gradient Compression

Author: Han, Wenchen, Vargaftik, Shay, Mitzenmacher, Michael, Karp, Brad, and Basat, Ran Ben
Subjects: Computer Science - Machine Learning, Computer Science - Networking and Internet Architecture
Abstract: Gradient aggregation has long been identified as a major bottleneck in today's large-scale distributed machine learning training systems. One promising solution to mitigate such bottlenecks is gradient compression, directly reducing communicated gradient data volume. However, in practice, many gradient compression schemes do not achieve acceleration of the training process while also preserving accuracy. In this work, we identify several common issues in previous gradient compression systems and evaluation methods. These issues include excessive computational overheads; incompatibility with all-reduce; and inappropriate evaluation metrics, such as not using an end-to-end metric or using a 32-bit baseline instead of a 16-bit baseline. We propose several general design and evaluation techniques to address these issues and provide guidelines for future work. Our preliminary evaluation shows that our techniques enhance the system's performance and provide a clearer understanding of the end-to-end utility of gradient compression methods., Comment: 9 pages, 3 figures
Published: 2024

5. Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams

Author: Shahout, Rana and Mitzenmacher, Michael
Subjects: Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning
Abstract: Identifying heavy hitters and estimating the frequencies of flows are fundamental tasks in various network domains. Existing approaches to this challenge can broadly be categorized into two groups, hashing-based and competing-counter-based. The Count-Min sketch is a standard example of a hashing-based algorithm, and the Space Saving algorithm is an example of a competing-counter algorithm. Recent works have explored the use of machine learning to enhance algorithms for frequency estimation problems, under the algorithms with prediction framework. However, these works have focused solely on the hashing-based approach, which may not be best for identifying heavy hitters. In this paper, we present the first learned competing-counter-based algorithm, called LSS, for identifying heavy hitters, top k, and flow frequency estimation that utilizes the well-known Space Saving algorithm. We provide theoretical insights into how and to what extent our approach can improve upon Space Saving, backed by experimental results on both synthetic and real-world datasets. Our evaluation demonstrates that LSS can enhance the accuracy and efficiency of Space Saving in identifying heavy hitters, top k, and estimating flow frequencies.
Published: 2024

6. SkipPredict: When to Invest in Predictions for Scheduling

Author: Shahout, Rana and Mitzenmacher, Michael
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: In light of recent work on scheduling with predicted job sizes, we consider the effect of the cost of predictions in queueing systems, removing the assumption in prior research that predictions are external to the system's resources and/or cost-free. In particular, we introduce a novel approach to utilizing predictions, SkipPredict, designed to address their inherent cost. Rather than uniformly applying predictions to all jobs, we propose a tailored approach that categorizes jobs based on their prediction requirements. To achieve this, we employ one-bit "cheap predictions" to classify jobs as either short or long. SkipPredict prioritizes predicted short jobs over long jobs, and for the latter, SkipPredict applies a second round of more detailed "expensive predictions" to approximate Shortest Remaining Processing Time for these jobs. Our analysis takes into account the cost of prediction. We examine the effect of this cost for two distinct models. In the external cost model, predictions are generated by some external method without impacting job service times but incur a cost. In the server time cost model, predictions themselves require server processing time, and are scheduled on the same server as the jobs.
Published: 2024

7. Optimal and Near-Optimal Adaptive Vector Quantization

Author: Ben-Basat, Ran, Ben-Itzhak, Yaniv, Mitzenmacher, Michael, and Vargaftik, Shay
Subjects: Computer Science - Machine Learning, Computer Science - Data Structures and Algorithms, Computer Science - Information Theory, Computer Science - Networking and Internet Architecture
Abstract: Quantization is a fundamental optimization for many machine-learning use cases, including compressing gradients, model weights and activations, and datasets. The most accurate form of quantization is \emph{adaptive}, where the error is minimized with respect to a given input, rather than optimizing for the worst case. However, optimal adaptive quantization methods are considered infeasible in terms of both their runtime and memory requirements. We revisit the Adaptive Vector Quantization (AVQ) problem and present algorithms that find optimal solutions with asymptotically improved time and space complexity. We also present an even faster near-optimal algorithm for large inputs. Our experiments show our algorithms may open the door to using AVQ more extensively in a variety of machine learning applications.
Published: 2024

8. Analyzing Generalized P\'olya Urn Models using Martingales, with an Application to Viral Evolution

Author: Specht, Ivan and Mitzenmacher, Michael
Subjects: Statistics - Applications
Abstract: The randomized play-the-winner (RPW) model is a generalized P\'olya Urn process with broad applications ranging from clinical trials to molecular evolution. We derive an exact expression for the variance of the RPW model by transforming the P\'olya Urn process into a martingale, correcting an earlier result of Matthews and Rosenberger (1997). We then use this result to approximate the full probability mass function of the RPW model for certain parameter values relevant to genetic applications. Finally, we fit our model to genomic sequencing data of SARS-CoV-2, demonstrating a novel method of estimating the viral mutation rate that delivers comparable results to existing scientific literature., Comment: 27 pages, 2 figures
Published: 2023

9. THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression

Author: Li, Minghao, Basat, Ran Ben, Vargaftik, Shay, Lao, ChonLam, Xu, Kevin, Mitzenmacher, Michael, and Yu, Minlan
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Networking and Internet Architecture
Abstract: Deep neural networks (DNNs) are the de facto standard for essential use cases, such as image classification, computer vision, and natural language processing. As DNNs and datasets get larger, they require distributed training on increasingly larger clusters. A main bottleneck is the resulting communication overhead where workers exchange model updates (i.e., gradients) on a per-round basis. To address this bottleneck and accelerate training, a widely-deployed approach is compression. However, previous deployments often apply bi-directional compression schemes by simply using a uni-directional gradient compression scheme in each direction. This results in significant computational overheads at the parameter server and increased compression error, leading to longer training and lower accuracy. We introduce Tensor Homomorphic Compression (THC), a novel bi-directional compression framework that enables the direct aggregation of compressed values and thus eliminating the aforementioned computational overheads. Moreover, THC is compatible with in-network aggregation (INA), which allows for further acceleration. Our evaluation shows that training representative vision and language models with THC reaches target accuracy by 1.40x to 1.47x faster using INA and 1.28x to 1.33x faster using a software PS compared with state-of-the-art systems., Comment: 12 pages body, 21 pages total
Published: 2023

10. Proteus: A Self-Designing Range Filter

Author: Knorr, Eric R., Lemaire, Baptiste, Lim, Andrew, Luo, Siqiang, Zhang, Huanchen, Idreos, Stratos, and Mitzenmacher, Michael
Subjects: Computer Science - Databases, Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning, F.2.m, H.3.3
Abstract: We introduce Proteus, a novel self-designing approximate range filter, which configures itself based on sampled data in order to optimize its false positive rate (FPR) for a given space requirement. Proteus unifies the probabilistic and deterministic design spaces of state-of-the-art range filters to achieve robust performance across a larger variety of use cases. At the core of Proteus lies our Contextual Prefix FPR (CPFPR) model - a formal framework for the FPR of prefix-based filters across their design spaces. We empirically demonstrate the accuracy of our model and Proteus' ability to optimize over both synthetic workloads and real-world datasets. We further evaluate Proteus in RocksDB and show that it is able to improve end-to-end performance by as much as 5.3x over more brittle state-of-the-art methods such as SuRF and Rosetta. Our experiments also indicate that the cost of modeling is not significant compared to the end-to-end performance gains and that Proteus is robust to workload shifts., Comment: 14 pages, 9 figures, originally published in the Proceedings of the 2022 International Conference on Management of Data (SIGMOD'22), ISBN: 9781450392495
Published: 2022
Full Text: View/download PDF

11. QUIC-FL: Quick Unbiased Compression for Federated Learning

Author: Basat, Ran Ben, Vargaftik, Shay, Portnoy, Amit, Einziger, Gil, Ben-Itzhak, Yaniv, and Mitzenmacher, Michael
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Data Structures and Algorithms, Computer Science - Networking and Internet Architecture
Abstract: Distributed Mean Estimation (DME), in which $n$ clients communicate vectors to a parameter server that estimates their average, is a fundamental building block in communication-efficient federated learning. In this paper, we improve on previous DME techniques that achieve the optimal $O(1/n)$ Normalized Mean Squared Error (NMSE) guarantee by asymptotically improving the complexity for either encoding or decoding (or both). To achieve this, we formalize the problem in a novel way that allows us to use off-the-shelf mathematical solvers to design the quantization.
Published: 2022

12. FRANCIS: Fast Reaction Algorithms for Network Coordination In Switches

Author: Han, Wenchen, Feng, Vic, Schwartzman, Gregory, Mitzenmacher, Michael, Yu, Minlan, and Ben-Basat, Ran
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Distributed protocols are widely used to support network functions such as clock synchronization and multicast. As the network gets larger and faster, it is increasingly challenging for these protocols to react quickly to network events. The theory community has made significant progress in developing distributed message passing algorithms with improved convergence times. With the emerging programmability at switches, it now becomes feasible to adopt and adapt these theoretical advances for networking functions. In this paper, we propose FRANCIS, a new framework for running message passing algorithms on programmable switches to enable fast reactions to network events in large networks. We introduce an execution engine with computing and communication primitives for supporting message passing algorithms in P4 switches. We exemplify the framework's usefulness by improving the resiliency and reaction times of clock synchronization and source-routed multicast. In particular, our approach allows lower clock drift than Sundial and PTP, quickly recovers from multiple failures, and reduces the time uncertainty bound by up to 5x. Compared with state-of-the-art multicast solutions, our approach uses packet headers up to 33\% smaller and has an order of magnitude faster reaction time.
Published: 2022

13. Tabula: Efficiently Computing Nonlinear Activation Functions for Secure Neural Network Inference

Author: Lam, Maximilian, Mitzenmacher, Michael, Reddi, Vijay Janapa, Wei, Gu-Yeon, and Brooks, David
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence
Abstract: Multiparty computation approaches to secure neural network inference commonly rely on garbled circuits for securely executing nonlinear activation functions. However, garbled circuits require excessive communication between server and client, impose significant storage overheads, and incur large runtime penalties. To reduce these costs, we propose an alternative to garbled circuits: Tabula, an algorithm based on secure lookup tables. Our approach precomputes lookup tables during an offline phase that contains the result of all possible nonlinear function calls. Because these tables incur exponential storage costs in the number of operands and the precision of the input values, we use quantization to reduce these storage costs to make this approach practical. This enables an online phase where securely computing the result of a nonlinear function requires just a single round of communication, with communication cost equal to twice the number of bits of the input to the nonlinear function. In practice our approach costs 2 bytes of communication per nonlinear function call in the online phase. Compared to garbled circuits with 8-bit quantized inputs, when computing individual nonlinear functions during the online phase, experiments show Tabula with 8-bit activations uses between $280$-$560 \times$ less communication, is over $100\times$ faster, and uses a comparable (within a factor of 2) amount of storage; compared against other state-of-the-art protocols Tabula achieves greater than $40\times$ communication reduction. This leads to significant performance gains over garbled circuits with quantized inputs during the online phase of secure inference of neural networks: Tabula reduces end-to-end inference communication by up to $9 \times$ and achieves an end-to-end inference speedup of up to $50 \times$, while imposing comparable storage and offline preprocessing costs.
Published: 2022

14. Incentive Compatible Queues Without Money

Author: Grosof, Isaac and Mitzenmacher, Michael
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Performance
Abstract: For job scheduling systems, where jobs require some amount of processing and then leave the system, it is natural for each user to provide an estimate of their job's time requirement in order to aid the scheduler. However, if there is no incentive mechanism for truthfulness, each user will be motivated to provide estimates that give their job precedence in the schedule, so that the job completes as early as possible. We examine how to make such scheduling systems incentive compatible, without using monetary charges, under a natural queueing theory framework. In our setup, each user has an estimate of their job's running time, but it is possible for this estimate to be incorrect. We examine scheduling policies where if a job exceeds its estimate, it is with some probability "punished" and re-scheduled after other jobs, to disincentivize underestimates of job times. However, because user estimates may be incorrect (without any malicious intent), excessive punishment may incentivize users to overestimate their job times, which leads to less efficient scheduling. We describe two natural scheduling policies, BlindTrust and MeasuredTrust. We show that, for both of these policies, given the parameters of the system, we can efficiently determine the set of punishment probabilities that are incentive compatible, in that users are incentivized to provide their actual estimate of the job time. Moreover, we prove for MeasuredTrust that in the limit as estimates converge to perfect accuracy, the range of punishment probabilities that are incentive compatible converges to $[0,1]$. Our formalism establishes a framework for studying further queue-based scheduling problems where job time estimates from users are utilized, and the system needs to incentivize truthful reporting of estimates., Comment: 20 pages
Published: 2022

15. Direct Telemetry Access

Author: Langlet, Jonatan, Basat, Ran Ben, Oliaro, Gabriele, Mitzenmacher, Michael, Yu, Minlan, and Antichi, Gianni
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Fine-grained network telemetry is becoming a modern datacenter standard and is the basis of essential applications such as congestion control, load balancing, and advanced troubleshooting. As network size increases and telemetry gets more fine-grained, there is a tremendous growth in the amount of data needed to be reported from switches to collectors to enable network-wide view. As a consequence, it is progressively hard to scale data collection systems. We introduce Direct Telemetry Access (DTA), a solution optimized for aggregating and moving hundreds of millions of reports per second from switches into queryable data structures in collectors' memory. DTA is lightweight and it is able to greatly reduce overheads at collectors. DTA is built on top of RDMA, and we propose novel and expressive reporting primitives to allow easy integration with existing state-of-the-art telemetry mechanisms such as INT or Marple. We show that DTA significantly improves telemetry collection rates. For example, when used with INT, it can collect and aggregate over 400M reports per second with a single server, improving over the Atomic MultiLog by up to $16$x., Comment: As appearing in the proceedings of ACM SIGCOMM'23
Published: 2022

16. Algorithmic Tools for Understanding the Motif Structure of Networks

Author: Chen, Tianyi, Matejek, Brian, Mitzenmacher, Michael, Tsourakakis, Charalampos E., Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Amini, Massih-Reza, editor, Canu, Stéphane, editor, Fischer, Asja, editor, Guns, Tias, editor, Kralj Novak, Petra, editor, and Tsoumakas, Grigorios, editor
Published: 2023
Full Text: View/download PDF

17. Algorithmic Tools for Understanding the Motif Structure of Networks

Author: Chen, Tianyi, primary, Matejek, Brian, additional, Mitzenmacher, Michael, additional, and Tsourakakis, Charalampos E., additional
Published: 2023
Full Text: View/download PDF

18. Leveraging Parameterized Chernoff Bounds for Simplified Algorithm Analyses

Author: Dillencourt, Michael, primary, Goodrich, Michael T., additional, and Mitzenmacher, Michael, additional
Published: 2024
Full Text: View/download PDF

19. Designing sensitive viral diagnostics with machine learning

Author: Metsky, Hayden C., Welch, Nicole L., Pillai, Priya P., Haradhvala, Nicholas J., Rumker, Laurie, Mantena, Sreekar, Zhang, Yibin B., Yang, David K., Ackerman, Cheri M., Weller, Juliane, Blainey, Paul C., Myhrvold, Cameron, Mitzenmacher, Michael, and Sabeti, Pardis C.
Published: 2022
Full Text: View/download PDF

20. Edge-colored directed subgraph enumeration on the connectome

Author: Matejek, Brian, Wei, Donglai, Chen, Tianyi, Tsourakakis, Charalampos E., Mitzenmacher, Michael, and Pfister, Hanspeter
Published: 2022
Full Text: View/download PDF

21. Inferring Viral Transmission Pathways from Within-Host Variation

Author: Specht, Ivan O. A., primary, Petros, Brittany A., additional, Moreno, Gage K., additional, Brock-Fisher, Taylor, additional, Krasilnikova, Lydia A., additional, Schifferli, Mark, additional, Yang, Katherine, additional, Cronan, Paul, additional, Glennon, Olivia, additional, Schaffner, Stephen F., additional, Park, Daniel J., additional, MacInnis, Bronwyn L., additional, Ozonoff, Al, additional, Fry, Ben, additional, Mitzenmacher, Michael D., additional, Varilly, Patrick, additional, and Sabeti, Pardis C., additional
Published: 2023
Full Text: View/download PDF

22. Direct Telemetry Access

Author: Langlet, Jonatan, primary, Ben Basat, Ran, additional, Oliaro, Gabriele, additional, Mitzenmacher, Michael, additional, Yu, Minlan, additional, and Antichi, Gianni, additional
Published: 2023
Full Text: View/download PDF

23. Designing sensitive viral diagnostics with machine learning

Author: Massachusetts Institute of Technology. Department of Biological Engineering, Metsky, Hayden C, Welch, Nicole L, Pillai, Priya P, Haradhvala, Nicholas J, Rumker, Laurie, Mantena, Sreekar, Zhang, Yibin B, Yang, David K, Ackerman, Cheri M, Weller, Juliane, Blainey, Paul C, Myhrvold, Cameron, Mitzenmacher, Michael, Sabeti, Pardis C, Massachusetts Institute of Technology. Department of Biological Engineering, Metsky, Hayden C, Welch, Nicole L, Pillai, Priya P, Haradhvala, Nicholas J, Rumker, Laurie, Mantena, Sreekar, Zhang, Yibin B, Yang, David K, Ackerman, Cheri M, Weller, Juliane, Blainey, Paul C, Myhrvold, Cameron, Mitzenmacher, Michael, and Sabeti, Pardis C
Abstract: AbstractDesign of nucleic acid-based viral diagnostics typically follows heuristic rules and, to contend with viral variation, focuses on a genome’s conserved regions. A design process could, instead, directly optimize diagnostic effectiveness using a learned model of sensitivity for targets and their variants. Toward that goal, we screen 19,209 diagnostic–target pairs, concentrated on CRISPR-based diagnostics, and train a deep neural network to accurately predict diagnostic readout. We join this model with combinatorial optimization to maximize sensitivity over the full spectrum of a virus’s genomic variation. We introduce Activity-informed Design with All-inclusive Patrolling of Targets (ADAPT), a system for automated design, and use it to design diagnostics for 1,933 vertebrate-infecting viral species within 2 hours for most species and within 24 hours for all but three. We experimentally show that ADAPT’s designs are sensitive and specific to the lineage level and permit lower limits of detection, across a virus’s variation, than the outputs of standard design techniques. Our strategy could facilitate a proactive resource of assays for detecting pathogens.
Published: 2023

24. Analyzing Generalized Pólya Urn Models Using Martingales, with an Application to Viral Evolution

Author: Specht, Ivan, primary and Mitzenmacher, Michael, additional
Published: 2023
Full Text: View/download PDF

25. Can Learned Models Replace Hash Functions?

Author: Sabek, Ibrahim, primary, Vaidya, Kapil, additional, Horn, Dominik, additional, Kipf, Andreas, additional, Mitzenmacher, Michael, additional, and Kraska, Tim, additional
Published: 2022
Full Text: View/download PDF

26. Viewpoint: Algorithms with Predictions.

Author: Mitzenmacher, Michael, Watson, Thomas J., and Vassilvitskii, Sergei
Subjects: *ALGORITHMS, *FORECASTING, *MACHINE learning
Abstract: The authors present their thoughts concerning the research sphere of algorithms with predictions, particularly focusing upon a research trend in machine learning (ML) predictors in order to go beyond the concept of the worst-case analysis.
Published: 2022
Full Text: View/download PDF

27. Algorithms with predictions

Author: Mitzenmacher, Michael, primary and Vassilvitskii, Sergei, additional
Published: 2022
Full Text: View/download PDF

28. Proteus: A Self-Designing Range Filter

Author: Knorr, Eric R., primary, Lemaire, Baptiste, additional, Lim, Andrew, additional, Luo, Siqiang, additional, Zhang, Huanchen, additional, Idreos, Stratos, additional, and Mitzenmacher, Michael, additional
Published: 2022
Full Text: View/download PDF

29. Uniform Bounds for Scheduling with Job Size Estimates

Author: Scully, Ziv, Grosof, Isaac, Mitzenmacher, Michael, Scully, Ziv, Grosof, Isaac, and Mitzenmacher, Michael
Abstract: We consider the problem of scheduling to minimize mean response time in M/G/1 queues where only estimated job sizes (processing times) are known to the scheduler, where a job of true size s has estimated size in the interval [? s, ? s] for some ? ? ? > 0. We evaluate each scheduling policy by its approximation ratio, which we define to be the ratio between its mean response time and that of Shortest Remaining Processing Time (SRPT), the optimal policy when true sizes are known. Our question: is there a scheduling policy that (a) has approximation ratio near 1 when ? and ? are near 1, (b) has approximation ratio bounded by some function of ? and ? even when they are far from 1, and (c) can be implemented without knowledge of ? and ?? We first show that naively running SRPT using estimated sizes in place of true sizes is not such a policy: its approximation ratio can be arbitrarily large for any fixed ? < 1. We then provide a simple variant of SRPT for estimated sizes that satisfies criteria (a), (b), and (c). In particular, we prove its approximation ratio approaches 1 uniformly as ? and ? approach 1. This is the first result showing this type of convergence for M/G/1 scheduling. We also study the Preemptive Shortest Job First (PSJF) policy, a cousin of SRPT. We show that, unlike SRPT, naively running PSJF using estimated sizes in place of true sizes satisfies criteria (b) and (c), as well as a weaker version of (a).
Published: 2022
Full Text: View/download PDF

30. SNARF

Author: Vaidya, Kapil, primary, Chatterjee, Subarna, additional, Knorr, Eric, additional, Mitzenmacher, Michael, additional, Idreos, Stratos, additional, and Kraska, Tim, additional
Published: 2022
Full Text: View/download PDF

31. The Supermarket Model with Known and Predicted Service Times

Author: Mitzenmacher, Michael, primary and Dell'Amico, Matteo, additional
Published: 2022
Full Text: View/download PDF

32. Zero-CPU Collection with Direct Telemetry Access

Author: Langlet, Jonatan, primary, Ben-Basat, Ran, additional, Ramanathan, Sivaramakrishnan, additional, Oliaro, Gabriele, additional, Mitzenmacher, Michael, additional, Yu, Minlan, additional, and Antichi, Gianni, additional
Published: 2021
Full Text: View/download PDF

33. Inferring Viral Transmission Pathways from Within-Host Variation.

Author: Specht IOA, Petros BA, Moreno GK, Brock-Fisher T, Krasilnikova LA, Schifferli M, Yang K, Cronan P, Glennon O, Schaffner SF, Park DJ, MacInnis BL, Ozonoff A, Fry B, Mitzenmacher MD, Varilly P, and Sabeti PC
Abstract: Genome sequencing can offer critical insight into pathogen spread in viral outbreaks, but existing transmission inference methods use simplistic evolutionary models and only incorporate a portion of available genetic data. Here, we develop a robust evolutionary model for transmission reconstruction that tracks the genetic composition of within-host viral populations over time and the lineages transmitted between hosts. We confirm that our model reliably describes within-host variant frequencies in a dataset of 134,682 SARS-CoV-2 deep-sequenced genomes from Massachusetts, USA. We then demonstrate that our reconstruction approach infers transmissions more accurately than two leading methods on synthetic data, as well as in a controlled outbreak of bovine respiratory syncytial virus and an epidemiologically-investigated SARS-CoV-2 outbreak in South Africa. Finally, we apply our transmission reconstruction tool to 5,692 outbreaks among the 134,682 Massachusetts genomes. Our methods and results demonstrate the utility of within-host variation for transmission inference of SARS-CoV-2 and other pathogens, and provide an adaptable mathematical framework for tracking within-host evolution., Competing Interests: DECLARATIONS OF INTERESTS P.C.S. is a co-founder and shareholder of Sherlock Biosciences and Delve Bio and is a non-executive board member and shareholder of Danaher Corporation. P.C.S. is an inventor on patents related to diagnostics and Bluetooth-based contact tracing tools and technologies filed with the USPTO and other intellectual property bodies. A patent application has been filed on inventions described in this manuscript. All other authors declare no competing interests.
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

33 results on '"Mitzenmacher, Michael"'

1. Efficient Inference for Augmented Large Language Models

2. Don't Stop Me Now: Embedding Based Scheduling for LLMs

3. Learning-Augmented Frequency Estimation in Sliding Windows

4. Beyond Throughput and Compression Ratios: Towards High End-to-end Utility of Gradient Compression

5. Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams

6. SkipPredict: When to Invest in Predictions for Scheduling

7. Optimal and Near-Optimal Adaptive Vector Quantization

8. Analyzing Generalized P\'olya Urn Models using Martingales, with an Application to Viral Evolution

9. THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression

10. Proteus: A Self-Designing Range Filter

11. QUIC-FL: Quick Unbiased Compression for Federated Learning

12. FRANCIS: Fast Reaction Algorithms for Network Coordination In Switches

13. Tabula: Efficiently Computing Nonlinear Activation Functions for Secure Neural Network Inference

14. Incentive Compatible Queues Without Money

15. Direct Telemetry Access

16. Algorithmic Tools for Understanding the Motif Structure of Networks

17. Algorithmic Tools for Understanding the Motif Structure of Networks

18. Leveraging Parameterized Chernoff Bounds for Simplified Algorithm Analyses

19. Designing sensitive viral diagnostics with machine learning

20. Edge-colored directed subgraph enumeration on the connectome

21. Inferring Viral Transmission Pathways from Within-Host Variation

22. Direct Telemetry Access

23. Designing sensitive viral diagnostics with machine learning

24. Analyzing Generalized Pólya Urn Models Using Martingales, with an Application to Viral Evolution

25. Can Learned Models Replace Hash Functions?

26. Viewpoint: Algorithms with Predictions.

27. Algorithms with predictions

28. Proteus: A Self-Designing Range Filter

29. Uniform Bounds for Scheduling with Job Size Estimates

30. SNARF

31. The Supermarket Model with Known and Predicted Service Times

32. Zero-CPU Collection with Direct Telemetry Access

33. Inferring Viral Transmission Pathways from Within-Host Variation.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

33 results on '"Mitzenmacher, Michael"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources