Author: "Lee, Tony" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Lee, Tony"' showing total 1,588 results

Start Over Author "Lee, Tony"

1,588 results on '"Lee, Tony"'

1. Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Author: Li, Manling, Zhao, Shiyu, Wang, Qineng, Wang, Kangrui, Zhou, Yu, Srivastava, Sanjana, Gokmen, Cem, Lee, Tony, Li, Li Erran, Zhang, Ruohan, Liu, Weiyu, Liang, Percy, Fei-Fei, Li, Mao, Jiayuan, and Wu, Jiajun
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Robotics
Abstract: We aim to evaluate Large Language Models (LLMs) for embodied decision making. While a significant body of work has been leveraging LLMs for decision making in embodied environments, we still lack a systematic understanding of their performance because they are usually applied in different domains, for different purposes, and built based on different inputs and outputs. Furthermore, existing evaluations tend to rely solely on a final success rate, making it difficult to pinpoint what ability is missing in LLMs and where the problem lies, which in turn blocks embodied agents from leveraging LLMs effectively and selectively. To address these limitations, we propose a generalized interface (Embodied Agent Interface) that supports the formalization of various types of tasks and input-output specifications of LLM-based modules. Specifically, it allows us to unify 1) a broad set of embodied decision-making tasks involving both state and temporally extended goals, 2) four commonly-used LLM-based modules for decision making: goal interpretation, subgoal decomposition, action sequencing, and transition modeling, and 3) a collection of fine-grained metrics which break down evaluation into various types of errors, such as hallucination errors, affordance errors, various types of planning errors, etc. Overall, our benchmark offers a comprehensive assessment of LLMs' performance for different subtasks, pinpointing the strengths and weaknesses in LLM-powered embodied AI systems, and providing insights for effective and selective use of LLMs in embodied decision making., Comment: Accepted for oral presentation at NeurIPS 2024 in the Datasets and Benchmarks track
Published: 2024

2. VHELM: A Holistic Evaluation of Vision Language Models

Author: Lee, Tony, Tu, Haoqin, Wong, Chi Heem, Zheng, Wenhao, Zhou, Yiyang, Mai, Yifan, Roberts, Josselin Somerville, Yasunaga, Michihiro, Yao, Huaxiu, Xie, Cihang, and Liang, Percy
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Current benchmarks for assessing vision-language models (VLMs) often focus on their perception or problem-solving capabilities and neglect other critical aspects such as fairness, multilinguality, or toxicity. Furthermore, they differ in their evaluation procedures and the scope of the evaluation, making it difficult to compare models. To address these issues, we extend the HELM framework to VLMs to present the Holistic Evaluation of Vision Language Models (VHELM). VHELM aggregates various datasets to cover one or more of the 9 aspects: visual perception, knowledge, reasoning, bias, fairness, multilinguality, robustness, toxicity, and safety. In doing so, we produce a comprehensive, multi-dimensional view of the capabilities of the VLMs across these important factors. In addition, we standardize the standard inference parameters, methods of prompting, and evaluation metrics to enable fair comparisons across models. Our framework is designed to be lightweight and automatic so that evaluation runs are cheap and fast. Our initial run evaluates 22 VLMs on 21 existing datasets to provide a holistic snapshot of the models. We uncover new key findings, such as the fact that efficiency-focused models (e.g., Claude 3 Haiku or Gemini 1.5 Flash) perform significantly worse than their full models (e.g., Claude 3 Opus or Gemini 1.5 Pro) on the bias benchmark but not when evaluated on the other aspects. For transparency, we release the raw model generations and complete results on our website (https://crfm.stanford.edu/helm/vhelm/v2.0.1). VHELM is intended to be a living benchmark, and we hope to continue adding new datasets and models over time., Comment: NeurIPS 2024. First three authors contributed equally
Published: 2024

3. Enhancing Relevance of Embedding-based Retrieval at Walmart

Author: Lin, Juexin, Yadav, Sachin, Liu, Feng, Rossi, Nicholas, Suram, Praveen R., Chembolu, Satya, Chandran, Prijith, Mohapatra, Hrushikesh, Lee, Tony, Magnani, Alessandro, and Liao, Ciya
Subjects: Computer Science - Information Retrieval, H.3.3
Abstract: Embedding-based neural retrieval (EBR) is an effective search retrieval method in product search for tackling the vocabulary gap between customer search queries and products. The initial launch of our EBR system at Walmart yielded significant gains in relevance and add-to-cart rates [1]. However, despite EBR generally retrieving more relevant products for reranking, we have observed numerous instances of relevance degradation. Enhancing retrieval performance is crucial, as it directly influences product reranking and affects the customer shopping experience. Factors contributing to these degradations include false positives/negatives in the training data and the inability to handle query misspellings. To address these issues, we present several approaches to further strengthen the capabilities of our EBR model in terms of retrieval relevance. We introduce a Relevance Reward Model (RRM) based on human relevance feedback. We utilize RRM to remove noise from the training data and distill it into our EBR model through a multi-objective loss. In addition, we present the techniques to increase the performance of our EBR model, such as typo-aware training, and semi-positive generation. The effectiveness of our EBR is demonstrated through offline relevance evaluation, online AB tests, and successful deployments to live production. [1] Alessandro Magnani, Feng Liu, Suthee Chaidaroon, Sachin Yadav, Praveen Reddy Suram, Ajit Puthenputhussery, Sijie Chen, Min Xie, Anirudh Kashi, Tony Lee, et al. 2022. Semantic retrieval at walmart. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3495-3503., Comment: 8 pages, 3 figures, CIKM 2024
Published: 2024
Full Text: View/download PDF

4. Relevance Filtering for Embedding-based Retrieval

Author: Rossi, Nicholas, Lin, Juexin, Liu, Feng, Yang, Zhen, Lee, Tony, Magnani, Alessandro, and Liao, Ciya
Subjects: Computer Science - Information Retrieval, H.3.3
Abstract: In embedding-based retrieval, Approximate Nearest Neighbor (ANN) search enables efficient retrieval of similar items from large-scale datasets. While maximizing recall of relevant items is usually the goal of retrieval systems, a low precision may lead to a poor search experience. Unlike lexical retrieval, which inherently limits the size of the retrieved set through keyword matching, dense retrieval via ANN search has no natural cutoff. Moreover, the cosine similarity scores of embedding vectors are often optimized via contrastive or ranking losses, which make them difficult to interpret. Consequently, relying on top-K or cosine-similarity cutoff is often insufficient to filter out irrelevant results effectively. This issue is prominent in product search, where the number of relevant products is often small. This paper introduces a novel relevance filtering component (called "Cosine Adapter") for embedding-based retrieval to address this challenge. Our approach maps raw cosine similarity scores to interpretable scores using a query-dependent mapping function. We then apply a global threshold on the mapped scores to filter out irrelevant results. We are able to significantly increase the precision of the retrieved set, at the expense of a small loss of recall. The effectiveness of our approach is demonstrated through experiments on both public MS MARCO dataset and internal Walmart product search data. Furthermore, online A/B testing on the Walmart site validates the practical value of our approach in real-world e-commerce settings., Comment: 8 pages, 3 figures, CIKM 2024
Published: 2024
Full Text: View/download PDF

5. Doc2Token: Bridging Vocabulary Gap by Predicting Missing Tokens for E-commerce Search

Author: Li, Kaihao, Lin, Juexin, and Lee, Tony
Subjects: Computer Science - Information Retrieval, H.3.3
Abstract: Addressing the "vocabulary mismatch" issue in information retrieval is a central challenge for e-commerce search engines, because product pages often miss important keywords that customers search for. Doc2Query[1] is a popular document-expansion technique that predicts search queries for a document and includes the predicted queries with the document for retrieval. However, this approach can be inefficient for e-commerce search, because the predicted query tokens are often already present in the document. In this paper, we propose Doc2Token, a technique that predicts relevant tokens (instead of queries) that are missing from the document and includes these tokens in the document for retrieval. For the task of predicting missing tokens, we introduce a new metric, "novel ROUGE score". Doc2Token is demonstrated to be superior to Doc2Query in terms of novel ROUGE score and diversity of predictions. Doc2Token also exhibits efficiency gains by reducing both training and inference times. We deployed the feature to production and observed significant revenue gain in an online A/B test, and launched the feature to full traffic on Walmart.com. [1] R. Nogueira, W. Yang, J. Lin, K. Cho, Document expansion by query prediction, arXiv preprint arXiv:1904.08375 (2019), Comment: 9 pages, 1 figure, SIGIR 2024 Workshop on eCommerce
Published: 2024

6. Large Language Models for Relevance Judgment in Product Search

Author: Mehrdad, Navid, Mohapatra, Hrushikesh, Bagdouri, Mossaab, Chandran, Prijith, Magnani, Alessandro, Cai, Xunfan, Puthenputhussery, Ajit, Yadav, Sachin, Lee, Tony, Zhai, ChengXiang, and Liao, Ciya
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, H.3.3, I.2.7
Abstract: High relevance of retrieved and re-ranked items to the search query is the cornerstone of successful product search, yet measuring relevance of items to queries is one of the most challenging tasks in product information retrieval, and quality of product search is highly influenced by the precision and scale of available relevance-labelled data. In this paper, we present an array of techniques for leveraging Large Language Models (LLMs) for automating the relevance judgment of query-item pairs (QIPs) at scale. Using a unique dataset of multi-million QIPs, annotated by human evaluators, we test and optimize hyper parameters for finetuning billion-parameter LLMs with and without Low Rank Adaption (LoRA), as well as various modes of item attribute concatenation and prompting in LLM finetuning, and consider trade offs in item attribute inclusion for quality of relevance predictions. We demonstrate considerable improvement over baselines of prior generations of LLMs, as well as off-the-shelf models, towards relevance annotations on par with the human relevance evaluators. Our findings have immediate implications for the growing field of relevance judgment automation in product search., Comment: 10 pages, 1 figure, 11 tables - SIGIR 2024, LLM4Eval
Published: 2024

7. BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Author: Bolton, Elliot, Venigalla, Abhinav, Yasunaga, Michihiro, Hall, David, Xiong, Betty, Lee, Tony, Daneshjou, Roxana, Frankle, Jonathan, Liang, Percy, Carbin, Michael, and Manning, Christopher D.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Models such as GPT-4 and Med-PaLM 2 have demonstrated impressive performance on a wide variety of biomedical NLP tasks. However, these models have hundreds of billions of parameters, are computationally expensive to run, require users to send their input data over the internet, and are trained on unknown data sources. Can smaller, more targeted models compete? To address this question, we build and release BioMedLM, a 2.7 billion parameter GPT-style autoregressive model trained exclusively on PubMed abstracts and full articles. When fine-tuned, BioMedLM can produce strong multiple-choice biomedical question-answering results competitive with much larger models, such as achieving a score of 57.3% on MedMCQA (dev) and 69.0% on the MMLU Medical Genetics exam. BioMedLM can also be fine-tuned to produce useful answers to patient questions on medical topics. This demonstrates that smaller models can potentially serve as transparent, privacy-preserving, economical and environmentally friendly foundations for particular NLP applications, such as in biomedicine. The model is available on the Hugging Face Hub: https://huggingface.co/stanford-crfm/BioMedLM., Comment: 23 pages
Published: 2024

8. Holistic Evaluation of Text-To-Image Models

Author: Lee, Tony, Yasunaga, Michihiro, Meng, Chenlin, Mai, Yifan, Park, Joon Sung, Gupta, Agrim, Zhang, Yunzhi, Narayanan, Deepak, Teufel, Hannah Benita, Bellagente, Marco, Kang, Minguk, Park, Taesung, Leskovec, Jure, Zhu, Jun-Yan, Fei-Fei, Li, Wu, Jiajun, Ermon, Stefano, and Liang, Percy
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The stunning qualitative improvement of recent text-to-image models has led to their widespread attention and adoption. However, we lack a comprehensive quantitative understanding of their capabilities and risks. To fill this gap, we introduce a new benchmark, Holistic Evaluation of Text-to-Image Models (HEIM). Whereas previous evaluations focus mostly on text-image alignment and image quality, we identify 12 aspects, including text-image alignment, image quality, aesthetics, originality, reasoning, knowledge, bias, toxicity, fairness, robustness, multilinguality, and efficiency. We curate 62 scenarios encompassing these aspects and evaluate 26 state-of-the-art text-to-image models on this benchmark. Our results reveal that no single model excels in all aspects, with different models demonstrating different strengths. We release the generated images and human evaluation results for full transparency at https://crfm.stanford.edu/heim/v1.1.0 and the code at https://github.com/stanford-crfm/helm, which is integrated with the HELM codebase., Comment: NeurIPS 2023. First three authors contributed equally
Published: 2023

9. StarCoder: may the source be with you!

Author: Li, Raymond, Allal, Loubna Ben, Zi, Yangtian, Muennighoff, Niklas, Kocetkov, Denis, Mou, Chenghao, Marone, Marc, Akiki, Christopher, Li, Jia, Chim, Jenny, Liu, Qian, Zheltonozhskii, Evgenii, Zhuo, Terry Yue, Wang, Thomas, Dehaene, Olivier, Davaadorj, Mishig, Lamy-Poirier, Joel, Monteiro, João, Shliazhko, Oleh, Gontier, Nicolas, Meade, Nicholas, Zebaze, Armel, Yee, Ming-Ho, Umapathi, Logesh Kumar, Zhu, Jian, Lipkin, Benjamin, Oblokulov, Muhtasham, Wang, Zhiruo, Murthy, Rudra, Stillerman, Jason, Patel, Siva Sankalp, Abulkhanov, Dmitry, Zocca, Marco, Dey, Manan, Zhang, Zhihan, Fahmy, Nour, Bhattacharyya, Urvashi, Yu, Wenhao, Singh, Swayam, Luccioni, Sasha, Villegas, Paulo, Kunakov, Maxim, Zhdanov, Fedor, Romero, Manuel, Lee, Tony, Timor, Nadav, Ding, Jennifer, Schlesinger, Claire, Schoelkopf, Hailey, Ebert, Jan, Dao, Tri, Mishra, Mayank, Gu, Alex, Robinson, Jennifer, Anderson, Carolyn Jane, Dolan-Gavitt, Brendan, Contractor, Danish, Reddy, Siva, Fried, Daniel, Bahdanau, Dzmitry, Jernite, Yacine, Ferrandis, Carlos Muñoz, Hughes, Sean, Wolf, Thomas, Guha, Arjun, von Werra, Leandro, and de Vries, Harm
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Programming Languages, Computer Science - Software Engineering
Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large collection of permissively licensed GitHub repositories with inspection tools and an opt-out process. We fine-tuned StarCoderBase on 35B Python tokens, resulting in the creation of StarCoder. We perform the most comprehensive evaluation of Code LLMs to date and show that StarCoderBase outperforms every open Code LLM that supports multiple programming languages and matches or outperforms the OpenAI code-cushman-001 model. Furthermore, StarCoder outperforms every model that is fine-tuned on Python, can be prompted to achieve 40\% pass@1 on HumanEval, and still retains its performance on other programming languages. We take several important steps towards a safe open-access model release, including an improved PII redaction pipeline and a novel attribution tracing tool, and make the StarCoder models publicly available under a more commercially viable version of the Open Responsible AI Model license.
Published: 2023

10. Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs

Author: Narayanan, Deepak, Santhanam, Keshav, Henderson, Peter, Bommasani, Rishi, Lee, Tony, and Liang, Percy
Subjects: Computer Science - Machine Learning
Abstract: Large language models (LLMs) power many state-of-the-art systems in natural language processing. However, these models are extremely computationally expensive, even at inference time, raising the natural question: when is the extra cost of deploying a larger model worth the anticipated boost in capabilities? Better understanding this tradeoff fundamentally could benefit from an inference efficiency metric that is both (i) easily comparable across models from different providers, and (ii) representative of the true cost of running queries in an isolated performance environment. Unfortunately, access to LLMs today is largely restricted to black-box text generation APIs and raw runtimes measured through this interface do not satisfy these desiderata: model providers can apply various software and hardware optimizations orthogonal to the model, and models served on shared infrastructure are susceptible to performance contention. To circumvent these problems, we propose a new metric for comparing inference efficiency across models. This metric puts models on equal footing as though they were served (i) on uniform hardware and software, and (ii) without performance contention. We call this metric the \emph{idealized runtime}, and we propose a methodology to efficiently estimate this metric for autoregressive Transformer models. We also propose cost-aware variants that incorporate the number of accelerators needed to serve the model. Using these metrics, we compare ten state-of-the-art LLMs to provide the first analysis of inference efficiency-capability tradeoffs; we make several observations from this analysis, including the fact that the superior inference runtime performance of certain APIs is often a byproduct of optimizations within the API rather than the underlying model. Our methodology also facilitates the efficient comparison of different software and hardware stacks.
Published: 2023

11. The Role of Expert Judgement in Language Test Validation

Author: Coniam, David, Lee, Tony, Milanovic, Michael, Pike, Nigel, and Zhao, Wen
Abstract: The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the LanguageCert suite of tests, Common European Framework of Reference (CEFR), the China Standards of English (CSE) and China's College English Test (CET). In the larger study, Year 1 students at a prestigious university in China were administered two tests--one with items based on China's College English Test (CET), and the other a CEFR-aligned test developed by LanguageCert--the LanguageCert Test of English (LTE). Comparable sections of the CET and the LTE involved sets of discrete items targeting lexico-grammatical competence. In order to ascertain whether expert judges were equally comfortable placing test items on either scale (CET or CEFR), a group of professors from the university in China who set the CET-based test, were asked to expert judge the CET items against the nine CSE levels with which they were very familiar. They were then asked to judge the LTE items against the six CEFR levels, with which they were less familiar. Both sets of expert ratings and the test taker responses on both tests were then calibrated within a single frame of reference and located on the LanguageCert scale. In the analysis of the expert ratings, the CSE-familiar raters exhibited higher levels of agreement with the empirically-derived score levels for the CET items than they did with the equivalent LTE items. This supports the proposition that expert judgement may be used in the calibration process where the experts in question have a strong knowledge of both the test material and the standards against which the test material is to be judged.
Published: 2022

12. International Recruitment: China Recruiters' Experience during COVID-19 Pandemic

Author: Lee, Tony and Cheng, Yanjie
Abstract: The purpose of this qualitative study was to explore the experiences of China recruiters during the pandemic, particularly with their job stress and sense of job security. The study also explored the new norms of Chinese students' recruitment following the post-pandemic crisis. Using qualitative analysis, we found that China recruiters experienced different stressors during the pandemic regardless of their working location. The participants recognized the importance of communication and seeking institutional support to help overcome their stress during the pandemic. They suggested that higher education administrators should be sensitive to the needs of their international recruiters. The participants also suggested several new norms for future recruitment, such as using the hybrid recruitment model, promoting university collaboration, initiating joint programs between US and Chinese institutions, and hiring domestic recruiters. Implications for practice are discussed. [Note: The page range (104-120) shown on the PDF is incorrect. The correct page range is 104-121.]
Published: 2022

13. Evaluating Human-Language Model Interaction

Author: Lee, Mina, Srivastava, Megha, Hardy, Amelia, Thickstun, John, Durmus, Esin, Paranjape, Ashwin, Gerard-Ursin, Ines, Li, Xiang Lisa, Ladhak, Faisal, Rong, Frieda, Wang, Rose E., Kwon, Minae, Park, Joon Sung, Cao, Hancheng, Lee, Tony, Bommasani, Rishi, Bernstein, Michael, and Liang, Percy
Subjects: Computer Science - Computation and Language
Abstract: Many real-world applications of language models (LMs), such as writing assistance and code autocomplete, involve human-LM interaction. However, most benchmarks are non-interactive in that a model produces output without human involvement. To evaluate human-LM interaction, we develop a new framework, Human-AI Language-based Interaction Evaluation (HALIE), that defines the components of interactive systems and dimensions to consider when designing evaluation metrics. Compared to standard, non-interactive evaluation, HALIE captures (i) the interactive process, not only the final output; (ii) the first-person subjective experience, not just a third-party assessment; and (iii) notions of preference beyond quality (e.g., enjoyment and ownership). We then design five tasks to cover different forms of interaction: social dialogue, question answering, crossword puzzles, summarization, and metaphor generation. With four state-of-the-art LMs (three variants of OpenAI's GPT-3 and AI21 Labs' Jurassic-1), we find that better non-interactive performance does not always translate to better human-LM interaction. In particular, we highlight three cases where the results from non-interactive and interactive metrics diverge and underscore the importance of human-LM interaction for LM evaluation., Comment: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI)
Published: 2022

14. Holistic Evaluation of Language Models

Author: Liang, Percy, Bommasani, Rishi, Lee, Tony, Tsipras, Dimitris, Soylu, Dilara, Yasunaga, Michihiro, Zhang, Yian, Narayanan, Deepak, Wu, Yuhuai, Kumar, Ananya, Newman, Benjamin, Yuan, Binhang, Yan, Bobby, Zhang, Ce, Cosgrove, Christian, Manning, Christopher D., Ré, Christopher, Acosta-Navas, Diana, Hudson, Drew A., Zelikman, Eric, Durmus, Esin, Ladhak, Faisal, Rong, Frieda, Ren, Hongyu, Yao, Huaxiu, Wang, Jue, Santhanam, Keshav, Orr, Laurel, Zheng, Lucia, Yuksekgonul, Mert, Suzgun, Mirac, Kim, Nathan, Guha, Neel, Chatterji, Niladri, Khattab, Omar, Henderson, Peter, Huang, Qian, Chi, Ryan, Xie, Sang Michael, Santurkar, Shibani, Ganguli, Surya, Hashimoto, Tatsunori, Icard, Thomas, Zhang, Tianyi, Chaudhary, Vishrav, Wang, William, Li, Xuechen, Mai, Yifan, Zhang, Yuhui, and Koreeda, Yuta
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest for LMs. Then we select a broad subset based on coverage and feasibility, noting what's missing or underrepresented (e.g. question answering for neglected English dialects, metrics for trustworthiness). Second, we adopt a multi-metric approach: We measure 7 metrics (accuracy, calibration, robustness, fairness, bias, toxicity, and efficiency) for each of 16 core scenarios when possible (87.5% of the time). This ensures metrics beyond accuracy don't fall to the wayside, and that trade-offs are clearly exposed. We also perform 7 targeted evaluations, based on 26 targeted scenarios, to analyze specific aspects (e.g. reasoning, disinformation). Third, we conduct a large-scale evaluation of 30 prominent language models (spanning open, limited-access, and closed models) on all 42 scenarios, 21 of which were not previously used in mainstream LM evaluation. Prior to HELM, models on average were evaluated on just 17.9% of the core HELM scenarios, with some prominent models not sharing a single scenario in common. We improve this to 96.0%: now all 30 models have been densely benchmarked on the same core scenarios and metrics under standardized conditions. Our evaluation surfaces 25 top-level findings. For full transparency, we release all raw model prompts and completions publicly for further analysis, as well as a general modular toolkit. We intend for HELM to be a living benchmark for the community, continuously updated with new scenarios, metrics, and models., Comment: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Project page: https://crfm.stanford.edu/helm/v1.0
Published: 2022

15. Identifying Guessing in English Language Tests via Rasch Fit Statistics: An Exploratory Study

Author: Coniam, David, Lee, Tony, and Lampropoulou, Leda
Abstract: This article explores the issue of identifying guessers -- with a specific focus on multiple-choice tests. Guessing has long been considered a problem due to the fact that it compromises validity. A test taker scoring higher than they should through guessing does not provide a picture of their actual ability. After an initial description of issues associated with guessing, the article then outlines approaches which have been taken to either discourage test takers from guessing or which attempt statistically to handle the problem. From this, the article moves to a novel way of identifying potential guessers: from the post hoc use of Rasch fit statistics. Two datasets, each consisting of approximately 200 beginner level English language test takers were split into two. In each dataset, half the test takers' answers were randomised -- to approximate guessing. Results obtained via a Rasch analysis of the data was then passed to an analyst who used the Rasch fit statistics to identify possible guessers. On each dataset, 80% of guessers were identified.
Published: 2021

16. A DFS Algorithm for Maximum Matchings in General Graphs

Author: Lee, Tony T., Lu, Bojun, and Chu, Hanli
Subjects: Computer Science - Data Structures and Algorithms, 05C30 (Primary) 68R10, 68R05 (Secondary), G.2.1, G.2.2, F.2.2
Abstract: In this paper, we propose a depth-first search (DFS) algorithm for searching maximum matchings in general graphs. Unlike blossom shrinking algorithms, which store all possible alternative alternating paths in the super-vertices shrunk from blossoms, the newly proposed algorithm does not involve blossom shrinking. The basic idea is to deflect the alternating path when facing blossoms. The algorithm maintains detour information in an auxiliary stack to minimize the redundant data structures. A benefit of our technique is to avoid spending time on shrinking and expanding blossoms. This DFS algorithm can determine a maximum matching of a general graph with $m$ edges and $n$ vertices in $O(mn)$ time with space complexity $O(n)$., Comment: 17 pages, 9 figures, 2 tables
Published: 2022

17. Life satisfaction and peace of mind among dentists: a cross-sectional study

Author: Fu, Martin M., Fu, Min-Wen, Chen, Rebecca Y., Chan, Hsun-Liang, Lee, Tony Szu-Hsien, and Fu, Earl
Published: 2023
Full Text: View/download PDF

18. Extending the WILDS Benchmark for Unsupervised Adaptation

Author: Sagawa, Shiori, Koh, Pang Wei, Lee, Tony, Gao, Irena, Xie, Sang Michael, Shen, Kendrick, Kumar, Ananya, Hu, Weihua, Yasunaga, Michihiro, Marklund, Henrik, Beery, Sara, David, Etienne, Stavness, Ian, Guo, Wei, Leskovec, Jure, Saenko, Kate, Hashimoto, Tatsunori, Levine, Sergey, Finn, Chelsea, and Liang, Percy
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Machine learning systems deployed in the wild are often trained on a source distribution but deployed on a different target distribution. Unlabeled data can be a powerful point of leverage for mitigating these distribution shifts, as it is frequently much more available than labeled data and can often be obtained from distributions beyond the source distribution as well. However, existing distribution shift benchmarks with unlabeled data do not reflect the breadth of scenarios that arise in real-world applications. In this work, we present the WILDS 2.0 update, which extends 8 of the 10 datasets in the WILDS benchmark of distribution shifts to include curated unlabeled data that would be realistically obtainable in deployment. These datasets span a wide range of applications (from histology to wildlife conservation), tasks (classification, regression, and detection), and modalities (photos, satellite images, microscope slides, text, molecular graphs). The update maintains consistency with the original WILDS benchmark by using identical labeled training, validation, and test sets, as well as the evaluation metrics. On these datasets, we systematically benchmark state-of-the-art methods that leverage unlabeled data, including domain-invariant, self-training, and self-supervised methods, and show that their success on WILDS is limited. To facilitate method development and evaluation, we provide an open-source package that automates data loading and contains all of the model architectures and methods used in this paper. Code and leaderboards are available at https://wilds.stanford.edu.
Published: 2021

19. COVID-19 stigma associates with burnout among healthcare providers: Evidence from Taiwanese physicians and nurses

Author: Shiu, Chengshi, Chen, Wei-Ti, Hung, Chia-Chun, Huang, Edward Pei-Chuan, and Lee, Tony Szu-Hsien
Subjects: Clinical Research, Prevention, 8.1 Organisation and delivery of services, Health and social care services research, Good Health and Well Being, Burnout, Professional, Burnout, Psychological, COVID-19, Cross-Sectional Studies, Health Personnel, Humans, Nurses, Physicians, SARS-CoV-2, Surveys and Questionnaires, Burnout, Healthcare providers, Pandemics, Social stigma, Medical and Health Sciences, General & Internal Medicine
Abstract: Background/purposeDuring pandemics like SARS-CoV-2, healthcare providers' well-being and morale are in particular at stake. Burnout may substantially hinder the well-being and morale of healthcare providers, challenging our efforts at disease containment. This study investigated the relationship between perceived COVID-19 stigma and burnout symptoms among physicians and nurses. We further aimed to identify potential factors that may moderate this relationship, including profession, clinical contact with COVID-19 patients, and prior experience with 2003 SARS-CoV-1.MethodsWe used a web-based, structured survey from March 12th to 29th, 2020 to collect cross-sectional, self-reported data. Participants were provided with a link to the survey which took them on average 5-8 minutes. Survey consisted of demographic characteristics, clinical experiences, perceived COVID-19 related stigma, and burnout symptoms. Linear regression with bootstrapping techniques was adopted to test the relations between stigma and burnout, as well as other potential moderators, while adjusting for demographic and clinical factors.ResultsOf the 1421 consented respondents, 357 identified as physicians while 1064 identified as nurses. Participants reported some levels of stigma, and noticeable burnout symptoms. Burnout symptoms were positively correlated with COVID-19 stigma, profession, and currently care for confirmed/suspected COVID-19 patients. The interaction between stigma and profession (Stigma × Nurses) but no other interaction terms reached the significance level, suggesting that the slope for nurses was flatter than the slope for physicians.ConclusionThe study results suggest that COVID-19 stigma may contribute to burnout among physicians and nurses, and this relation may not vary across clinical roles and experiences but profession.
Published: 2022

20. Asynchronous, decentralized command and control

Author: Ghosh, Sumit and Lee, Tony
Subjects: COMMAND AND CONTROL, COMMAND, CONTROL AND COMMUNICATIONS
Abstract: illus bibliog
Published: 2000

21. On the Opportunities and Risks of Foundation Models

Author: Bommasani, Rishi, Hudson, Drew A., Adeli, Ehsan, Altman, Russ, Arora, Simran, von Arx, Sydney, Bernstein, Michael S., Bohg, Jeannette, Bosselut, Antoine, Brunskill, Emma, Brynjolfsson, Erik, Buch, Shyamal, Card, Dallas, Castellon, Rodrigo, Chatterji, Niladri, Chen, Annie, Creel, Kathleen, Davis, Jared Quincy, Demszky, Dora, Donahue, Chris, Doumbouya, Moussa, Durmus, Esin, Ermon, Stefano, Etchemendy, John, Ethayarajh, Kawin, Fei-Fei, Li, Finn, Chelsea, Gale, Trevor, Gillespie, Lauren, Goel, Karan, Goodman, Noah, Grossman, Shelby, Guha, Neel, Hashimoto, Tatsunori, Henderson, Peter, Hewitt, John, Ho, Daniel E., Hong, Jenny, Hsu, Kyle, Huang, Jing, Icard, Thomas, Jain, Saahil, Jurafsky, Dan, Kalluri, Pratyusha, Karamcheti, Siddharth, Keeling, Geoff, Khani, Fereshte, Khattab, Omar, Koh, Pang Wei, Krass, Mark, Krishna, Ranjay, Kuditipudi, Rohith, Kumar, Ananya, Ladhak, Faisal, Lee, Mina, Lee, Tony, Leskovec, Jure, Levent, Isabelle, Li, Xiang Lisa, Li, Xuechen, Ma, Tengyu, Malik, Ali, Manning, Christopher D., Mirchandani, Suvir, Mitchell, Eric, Munyikwa, Zanele, Nair, Suraj, Narayan, Avanika, Narayanan, Deepak, Newman, Ben, Nie, Allen, Niebles, Juan Carlos, Nilforoshan, Hamed, Nyarko, Julian, Ogut, Giray, Orr, Laurel, Papadimitriou, Isabel, Park, Joon Sung, Piech, Chris, Portelance, Eva, Potts, Christopher, Raghunathan, Aditi, Reich, Rob, Ren, Hongyu, Rong, Frieda, Roohani, Yusuf, Ruiz, Camilo, Ryan, Jack, Ré, Christopher, Sadigh, Dorsa, Sagawa, Shiori, Santhanam, Keshav, Shih, Andy, Srinivasan, Krishnan, Tamkin, Alex, Taori, Rohan, Thomas, Armin W., Tramèr, Florian, Wang, Rose E., Wang, William, Wu, Bohan, Wu, Jiajun, Wu, Yuhuai, Xie, Sang Michael, Yasunaga, Michihiro, You, Jiaxuan, Zaharia, Matei, Zhang, Michael, Zhang, Tianyi, Zhang, Xikun, Zhang, Yuhui, Zheng, Lucia, Zhou, Kaitlyn, and Liang, Percy
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent capabilities,and their effectiveness across so many tasks incentivizes homogenization. Homogenization provides powerful leverage but demands caution, as the defects of the foundation model are inherited by all the adapted models downstream. Despite the impending widespread deployment of foundation models, we currently lack a clear understanding of how they work, when they fail, and what they are even capable of due to their emergent properties. To tackle these questions, we believe much of the critical research on foundation models will require deep interdisciplinary collaboration commensurate with their fundamentally sociotechnical nature., Comment: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html
Published: 2021

22. Do Question Answering Modeling Improvements Hold Across Benchmarks?

Author: Liu, Nelson F., Lee, Tony, Jia, Robin, and Liang, Percy
Subjects: Computer Science - Computation and Language
Abstract: Do question answering (QA) modeling improvements (e.g., choice of architecture and training procedure) hold consistently across the diverse landscape of QA benchmarks? To study this question, we introduce the notion of concurrence -- two benchmarks have high concurrence on a set of modeling approaches if they rank the modeling approaches similarly. We measure the concurrence between 32 QA benchmarks on a set of 20 diverse modeling approaches and find that human-constructed benchmarks have high concurrence amongst themselves, even if their passage and question distributions are very different. Surprisingly, even downsampled human-constructed benchmarks (i.e., collecting less data) and programmatically-generated benchmarks (e.g., cloze-formatted examples) have high concurrence with human-constructed benchmarks. These results indicate that, despite years of intense community focus on a small number of benchmarks, the modeling improvements studied hold broadly., Comment: 31 pages, 13 figures; to appear at ACL 2023
Published: 2021

23. WILDS: A Benchmark of in-the-Wild Distribution Shifts

Author: Koh, Pang Wei, Sagawa, Shiori, Marklund, Henrik, Xie, Sang Michael, Zhang, Marvin, Balsubramani, Akshay, Hu, Weihua, Yasunaga, Michihiro, Phillips, Richard Lanas, Gao, Irena, Lee, Tony, David, Etienne, Stavness, Ian, Guo, Wei, Earnshaw, Berton A., Haque, Imran S., Beery, Sara, Leskovec, Jure, Kundaje, Anshul, Pierson, Emma, Levine, Sergey, Finn, Chelsea, and Liang, Percy
Subjects: Computer Science - Machine Learning
Abstract: Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity in the real-world deployments, these distribution shifts are under-represented in the datasets widely used in the ML community today. To address this gap, we present WILDS, a curated benchmark of 10 datasets reflecting a diverse range of distribution shifts that naturally arise in real-world applications, such as shifts across hospitals for tumor identification; across camera traps for wildlife monitoring; and across time and location in satellite imaging and poverty mapping. On each dataset, we show that standard training yields substantially lower out-of-distribution than in-distribution performance. This gap remains even with models trained by existing methods for tackling distribution shifts, underscoring the need for new methods for training models that are more robust to the types of distribution shifts that arise in practice. To facilitate method development, we provide an open-source package that automates dataset loading, contains default model architectures and hyperparameters, and standardizes evaluations. Code and leaderboards are available at https://wilds.stanford.edu.
Published: 2020

24. Essential Hybrid and Remote Workers in Higher Education: Challenges and Resiliency of Student Affairs Practitioners

Author: Lee, Tony, Karasek, Brianna, Chan, Roy Y., editor, Lin, Xi, editor, and Bista, Krishna, editor
Published: 2023
Full Text: View/download PDF

25. Recidivism of Individuals Who Completed Schedule I Drugs Deferred Prosecution Treatment: A Population-Based Follow-Up Study from 2008 to 2020 in Taiwan

Author: Ku, Yi-Chien, Chung, Hung-Ping, Hsu, Chia-Yu, Cheng, Yuan-Hao, Hsu, Freya In-Chu, Tsai, Yi-Chia, Chao, En, and Lee, Tony Szu-Hsien
Published: 2023
Full Text: View/download PDF

26. Designing and Analysis of A Wi-Fi Data Offloading Strategy Catering for the Preference of Mobile Users

Author: Zhou, Xiaoyi, Ye, Tong, and Lee, Tony T.
Subjects: Computer Science - Networking and Internet Architecture
Abstract: In recent years, offloading mobile traffic through Wi-Fi has emerged as a potential solution to lower down the communication cost for mobile users. Users hope to reduce the cost while keeping the delay in an acceptable range through Wi-Fi offloading. Also, different users have different sensitivities to the cost and the delay performance. How to make a proper cost-delay tradeoff according to the user's preference is the key issue in the design of the offloading strategy. To address this issue, we propose a preference-oriented offloading strategy for current commercial terminals, which transmit traffic only via one channel simultaneously. We model the strategy as a three-state M/MMSP/1 queueing system, of which the service process is a Markov modulated service process (MMSP), and obtain the structured solutions by establishing a hybrid embedded Markov chain. Our analysis shows that, given the user's preference, there exists an optimal deadline to maximize the utility, which is defined as the linear combination of the cost and the delay. We also provide a method to select the optimal deadline. Our simulation demonstrates that this strategy with the optimal deadline can achieve a good performance.
Published: 2020

27. Differentially Private M-band Wavelet-Based Mechanisms in Machine Learning Environments

Author: Choi, Kenneth and Lee, Tony
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Statistics - Machine Learning, 68T01 (Primary) 94A60 (Secondary)
Abstract: In the post-industrial world, data science and analytics have gained paramount importance regarding digital data privacy. Improper methods of establishing privacy for accessible datasets can compromise large amounts of user data even if the adversary has a small amount of preliminary knowledge of a user. Many researchers have been developing high-level privacy-preserving mechanisms that also retain the statistical integrity of the data to apply to machine learning. Recent developments of differential privacy, such as the Laplace and Privelet mechanisms, drastically decrease the probability that an adversary can distinguish the elements in a data set and thus extract user information. In this paper, we develop three privacy-preserving mechanisms with the discrete M-band wavelet transform that embed noise into data. The first two methods (LS and LS+) add noise through a Laplace-Sigmoid distribution that multiplies Laplace-distributed values with the sigmoid function, and the third method utilizes pseudo-quantum steganography to embed noise into the data. We then show that our mechanisms successfully retain both differential privacy and learnability through statistical analysis in various machine learning environments., Comment: Part-Time Research Assistant/Helper: Tony Lee; 49 pages, 20 figures, 1 table, to be published by International Press of Boston
Published: 2019

28. Kim Il Sung/mausoleum (1994); The revenge of Fu Manchu

Author: Lee, Tony
Published: 1997

29. The nervous system; Ether

Author: Lee, Tony
Published: 1997

30. Working for Johnnie Walker

Author: Lee, Tony
Published: 1996

31. The King's abandoned residence; Trans fr Mandarin

Author: Lee, Tony
Published: 1995

32. Throughput and Delay Analysis of Slotted Aloha with Batch Service

Author: Huang, Huanhuan, Ye, Tong, and Lee, Tony T.
Subjects: Computer Science - Performance
Abstract: In this paper, we study the throughput and delay performances of the slotted Aloha with batch service, which has wide applications in random access networks. Different from the classical slotted Aloha, each node in the slotted Aloha with batch service can transmit up to M packets once it succeeds in channel competition. The throughput is substantially improved because up to M packets jointly undertake the overhead due to contention. In an innovative vacation model developed in this paper, we consider each batch of data transmission as a busy period of each node, and the process between two successive busy periods as a vacation period. We then formulate the number of arrivals during a vacation period in a renewal-type equation, which characterizes the dependency between busy periods and vacation periods. Based on this formulation, we derive the mean waiting time of a packet and the bounded delay region for the slotted Aloha with batch service. Our results indicate the throughput and delay performances are substantially improved with the increase of batch sizeM, and the bounded delay region is enlarged accordingly. As M goes to infinity, we find the saturated throughput can approach 100% of channel capacity, and the system remains stable irrespective of the population size and transmission probability.
Published: 2019

33. AWG-based Nonblocking Shuffle-Exchange Networks

Author: Ye, Tong, Ding, Jingjie, Lee, Tony Tong, and Maier, Guido
Subjects: Computer Science - Networking and Internet Architecture, Electrical Engineering and Systems Science - Signal Processing
Abstract: Optical shuffle-exchange networks (SENs) have wide application in different kinds of interconnection networks. This paper proposes an approach to construct modular optical SENs, using a set of arrayed waveguide gratings (AWGs) and tunable wavelength converters (TWCs). According to the wavelength routing property of AWGs, we demonstrate for the first time that an AWG is functionally equivalent to a classical shuffle network by nature. Based on this result, we devise a systematic method to design a large-scale wavelength-division-multiplexing (WDM) shuffle network using a set of small-size AWGs associated with the same wavelength set. Combining the AWG-based WDM shuffle networks and the TWCs with small conversion range, we finally obtain an AWG-based WDM SEN, which not only is scalable in several ways, but also can achieve 100% utilization when the input wavelength channels are all busy. We also study the routing and wavelength assignment (RWA) problem of the AWG-based WDM SEN, and prove that the self-routing property and the nonblocking routing conditions of classical SENs are preserved in such AWG-based WDM SEN., Comment: 13 pages, 8 figures
Published: 2019

34. The Effect of Mobility on Delayed Data Offloading

Author: Zhou, Xiaoyi, Ye, Tong, and Lee, Tony T.
Subjects: Computer Science - Networking and Internet Architecture
Abstract: Delayed offloading is a widely accepted solution for mobile users to offload their traffic through Wi-Fi when they are moving in urban areas. However, delayed offloading enhances offloading efficiency at the expense of delay performance. Previous works mainly focus on the improvement of offloading efficiency while keeping delay performance in an acceptable region. In this paper, we study the impact of the user mobility on delayed data offloading in respect to the tradeoff between offloading efficiency and delay performance. We model a mobile terminal with delayed data offloading as an M/MMSP/1 queuing system with three service states. To be practical, we consider the feature of currently commercial mobile terminals in our analysis. Our analytical result shows that the mobility of the users can reduce the queueing delay incurred by the delayed offloading, and suggests that delayed offloading strategies can be optimized according to the mobility of the terminals once the delay requirement is given., Comment: 6 pages,6 figures, conference
Published: 2019

35. In these lines; Bus to purgatory

Author: Lee, Tony
Published: 1994

36. Passive Cantonese

Author: Lee, Tony
Published: 1994

37. The Effect of Mentoring and Self-Esteem on Job Satisfaction: A Comparative Study between U.S.-Born and Foreign-Born Faculty

Author: Lee, Tony, Hun Lim, Doo, and Kim, JoHyun
Abstract: The purpose of this study was to compare U.S.-born and foreign-born faculty members' job satisfaction in research universities and investigate if mentoring and self-esteem affect job satisfaction of faculty members in U.S. institutions. From the data set obtained from 481 faculty members in 21 universities, this study identified there was no significant difference in job satisfaction between U.S.-born and foreign-born faculty members based on marital status, faculty ranking, and tenure status. However, the study results indicated that mentoring and self-esteem as a set was a significant predictor for faculty job satisfaction for both faculty groups. Based on this finding, implications for faculty development through mentoring programs are discussed.
Published: 2021
Full Text: View/download PDF

38. The cognition, information behaviors, and preventive behaviors of Taiwanese people facing COVID-19

Author: Yueh, Hsiu-Ping, Wang, Guan-Yun, and Lee, Tony Szu-Hsien
Published: 2022
Full Text: View/download PDF

39. Clinical and behavior characteristics of individuals who used ketamine

Author: Lee, Tony Szu-Hsien, Liu, Yi-Hsuan, Huang, Yun-Ju, Tang, Wai-Kwong, Wang, Yifan, Hu, Sien, Lin, Ching-Po, Li, Chiang-Shan Ray, and Hung, Chia-Chun
Published: 2022
Full Text: View/download PDF

40. Teaching and Learning in Turbulent Times

Author: Lee, Tony, primary and Cheng, Ruby, additional
Published: 2022
Full Text: View/download PDF

41. Life Satisfaction of US-trained Dental Specialists in Taiwan

Author: Fu, Martin M., Chen, Rebecca Y., Fu, Min-Wen, Kao, Huan-Chen, Kao, Huan-Chiao, Chan, Hsun-Liang, Fu, Earl, and Lee, Tony Szu-Hsien
Published: 2022
Full Text: View/download PDF

42. Modular AWG-based Optical Shuffle Network

Author: Ding, Jingjie, Ye, Tong, Lee, Tony T., and Hu, Weisheng
Subjects: Computer Science - Other Computer Science
Abstract: This paper proposes an arrayed-waveguide grating (AWG) based wavelength-division-multiplexing (WDM) shuffle network. Compared with previous optical shuffle networks, our proposal is compact, easy to implement, highly scalable, and cost effective.
Published: 2017

43. Optimum Transmission Window for EPONs with Gated-Limited Service

Author: Huang, Huanhuan, Ye, Tong, Lee, Tony T., and Hu, Weisheng
Subjects: Computer Science - Networking and Internet Architecture, Computer Science - Performance
Abstract: This paper studies the Ethernet Passive Optical Network (EPON) with gated-limited service. The transmission window (TW) is limited in this system to guaranteeing a bounded delay experienced by disciplined users, and to constrain malicious users from monopolizing the transmission channel. Thus, selecting an appropriate TW size is critical to the performance of EPON with gated-limited service discipline. To investigate the impact of TW size on packet delay, we derive a generalized mean waiting time formula for M/G/1 queue with vacation times and gated-limited service discipline. A distinguished feature of this model is that there are two queues in the buffer of each optical network unit (ONU): one queue is inside the gate and the other one is outside the gate. Furthermore, based on the Chernoff bound of queue length, we provide a simple rule to determine an optimum TW size for gated-limited service EPONs. Analytic results reported in this paper are all verified by simulations., Comment: 14 pages, 10 figures, IEEE transaction on networking
Published: 2017

44. Chemical codes promote selective compartmentalization of proteins

Author: Kilgore, Henry, primary, Chinn, Itamar, additional, Mikhael, Peter, additional, Mitnikov, Ilan, additional, Van Dongen, Catherine, additional, Zylberberg, Guy, additional, Afeyan, Lena, additional, Banani, Salman, additional, Wilson-Hawken, Susana, additional, Lee, Tony, additional, Barzialy, Regina, additional, and Young, Richard, additional
Published: 2024
Full Text: View/download PDF

45. Reversible Bilateral Vision Loss: An Unusual Presentation of Wernicke Korsakoff Syndrome (P3-10.005)

Author: Singh, Divya, primary, Lee, Tony, additional, Rodriguez, Wilson, additional, Pastrana, Francesca, additional, Qaqish, Hanan, additional, and Chand, Pratap, additional
Published: 2024
Full Text: View/download PDF

46. A Forward Train Detection Method Based on Convolutional Neural Network

Author: Wang, Zhangyu, Lee, Tony, Leung, Michael, Tang, Simon, Zhang, Qiang, Yang, Zining, Cheung, Virginia, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Ahram, Tareq, editor, Karwowski, Waldemar, editor, Vergnano, Alberto, editor, Leali, Francesco, editor, and Taiar, Redha, editor
Published: 2020
Full Text: View/download PDF

47. A Comparative Study on Ethics Guidelines for Artificial Intelligence Across Nations

Author: Lee, Tony Szu-Hsien, Liu, Shiang-Yao, Wei, Yin-Ling, Chang, Li-Yun, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Huang, Tien-Chi, editor, Wu, Ting-Ting, editor, Barroso, João, editor, Sandnes, Frode Eika, editor, Martins, Paulo, editor, and Huang, Yueh-Min, editor
Published: 2020
Full Text: View/download PDF

48. Power Efficiency and Delay Tradeoff of 10GBase-T Energy Efficient Ethernet Protocol

Author: Pan, Xiaodan, Ye, Tong, Lee, Tony T., and Hu, Weisheng
Subjects: Computer Science - Networking and Internet Architecture
Abstract: In this paper, we study the power efficiency and delay performance of the IEEE 802.3az Energy Efficient Ethernet (EEE) protocol. A new approach is proposed to analyze the M/G/1 queue with the vacation time that is governed by the arrival process and the parameter {\tau} and N of the BTR strategy. Our key idea is to establish the connection between the vacation time and the arrival process to account for their dependency. We first derive the distribution of the number of arrivals during a vacation time based on an event tree of the BTR strategy, from which we obtain the mean vacation time and the power efficiency. Next, from the condition on the number of arrivals at the end of a vacation period, we derive a generalized P-K formula of the mean delay for EEE systems, and prove that the classical P-K formula of the vacation model is only a special case when the vacation time is independent of the arrival process. Our analysis demonstrates that the {\tau} policy and N policy of the BTR strategy are compensating each other. The {\tau} policy ensures the frame delay is bounded when the traffic load is light, while the N policy ensures the queue length at the end of vacation is bounded when the traffic load is heavy. These results, in turn, provide the rules to select appropriate {\tau} and N. Our analytical results are confirmed by simulations., Comment: 12 pages,8 figure
Published: 2016

49. Reply to Comment on 'Anomalous Edge State in a Non-Hermitian Lattice'

Author: Lee, Tony E.
Subjects: Quantum Physics, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: The Comment by Xiong et al. (arXiv:1610.06275) criticizing my Letter [Phys. Rev. Lett. 116, 133903 (2016)] was rejected by Physical Review Letters. In this Reply, I show that all their claims are wrong., Comment: 2 pages, 1 figure
Published: 2016

50. Ion crystals in anharmonic traps

Author: Bastin, S. R. and Lee, Tony E.
Subjects: Quantum Physics, Condensed Matter - Quantum Gases, Condensed Matter - Statistical Mechanics
Abstract: There is currently intensive research into creating a large-scale quantum computer with trapped ions. It is well known that for a linear ion crystal in a harmonic potential, the ions near the center are more closely spaced compared to the ions near the ends. This is problematic as the number of ions increases. Here, we consider a linear ion crystal in an anharmonic potential that is purely quartic in position. We find that the ions are more evenly spaced compared to the harmonic case. We develop a variational approach to calculate the properties of the ground state. We also characterize the zigzag transition in an anharmonic potential., Comment: 5 pages, 5 figures
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

1,588 results on '"Lee, Tony"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources