Author: "Mendelson, Avi" / Publication Year Range: This year - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mendelson, Avi"' showing total 3 results

Start Over Author "Mendelson, Avi" Publication Year Range This year

3 results on '"Mendelson, Avi"'

1. Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization

Author: Levi, Amit, Himelstein, Rom, Nemcovsky, Yaniv, Mendelson, Avi, and Baskin, Chaim
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning
Abstract: Jailbreak attacks aim to exploit large language models (LLMs) and pose a significant threat to their proper conduct; they seek to bypass models' safeguards and often provoke transgressive behaviors. However, existing automatic jailbreak attacks require extensive computational resources and are prone to converge on suboptimal solutions. In this work, we propose \textbf{C}ompliance \textbf{R}efusal \textbf{I}nitialization (CRI), a novel, attack-agnostic framework that efficiently initializes the optimization in the proximity of the compliance subspace of harmful prompts. By narrowing the initial gap to the adversarial objective, CRI substantially improves adversarial success rates (ASR) and drastically reduces computational overhead -- often requiring just a single optimization step. We evaluate CRI on the widely-used AdvBench dataset over the standard jailbreak attacks of GCG and AutoDAN. Results show that CRI boosts ASR and decreases the median steps to success by up to \textbf{\(\times 60\)}. The project page, along with the reference implementation, is publicly available at \texttt{https://amit1221levi.github.io/CRI-Jailbreak-Init-LLMs-evaluation/}.
Published: 2025

2. The Architecture

Author: Mendelson, Avi, Chattopadhyay, Anupam, Section editor, and Chattopadhyay, Anupam, editor
Published: 2025
Full Text: View/download PDF

3. The Use of Hierarchical Temporal Memory and Temporal Sequence Encoder for Online Anomaly Detection in Industrial Cyber-Physical Systems †.

Author: Malits, Roman and Mendelson, Avi
Subjects: WATER treatment plants, CYBER physical systems, MACHINE learning, INDUSTRIALISM, ANOMALY detection (Computer security)
Abstract: This study introduces a novel, practical approach for designing a hierarchical online anomaly detection system for industrial cyber-physical systems. The proposed method utilizes the Hierarchical Temporal Memory (HTM) unsupervised learning algorithm, which requires data to be encoded as sparse binary distributed representations (SDRs). A new SDR encoding method termed the temporal sequence encoder (TSSE) is presented to convert system outputs into SDRs. This approach enables HTM to retain high memory capacity and robust performance when processing data streams of slowly varying physical measurements, typical of many industrial processes. The effectiveness of the proposed system is demonstrated on the Secure Water Treatment (SWaT) dataset, which comprises data collected from a fully operational, scaled-down water treatment plant. The system achieves a recall of 0.906, a precision of 0.935, and an F1 score of 0.92 on SWaT. Compared to previous methods, our approach achieves state-of-the-art recall (~5.3% improvement), along with competitive precision and F1 score, by learning in an online manner without the need for expensive dataset collection, labeling, or retraining phases. These findings suggest that the proposed online anomaly detection method can be effectively applied to a broad range of water treatment and large-scale industrial cyber-physical systems. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Mendelson, Avi"'

1. Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization

2. The Architecture

3. The Use of Hierarchical Temporal Memory and Temporal Sequence Encoder for Online Anomaly Detection in Industrial Cyber-Physical Systems †.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

3 results on '"Mendelson, Avi"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources