Author: "Noga, Hila" / Topic: computer science - machine learning - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Noga, Hila"' showing total 2 results

Start Over Author "Noga, Hila" Topic computer science - machine learning

2 results on '"Noga, Hila"'

1. Multi-turn Reinforcement Learning from Preference Human Feedback

Author: Shani, Lior, Rosenberg, Aviv, Cassel, Asaf, Lang, Oran, Calandriello, Daniele, Zipori, Avital, Noga, Hila, Keller, Orgad, Piot, Bilal, Szpektor, Idan, Hassidim, Avinatan, Matias, Yossi, and Munos, Rémi
Subjects: Computer Science - Machine Learning
Abstract: Reinforcement Learning from Human Feedback (RLHF) has become the standard approach for aligning Large Language Models (LLMs) with human preferences, allowing LLMs to demonstrate remarkable abilities in various tasks. Existing methods work by emulating the preferences at the single decision (turn) level, limiting their capabilities in settings that require planning or multi-turn interactions to achieve a long-term goal. In this paper, we address this issue by developing novel methods for Reinforcement Learning (RL) from preference feedback between two full multi-turn conversations. In the tabular setting, we present a novel mirror-descent-based policy optimization algorithm for the general multi-turn preference-based RL problem, and prove its convergence to Nash equilibrium. To evaluate performance, we create a new environment, Education Dialogue, where a teacher agent guides a student in learning a random topic, and show that a deep RL variant of our algorithm outperforms RLHF baselines. Finally, we show that in an environment with explicit rewards, our algorithm recovers the same performance as a reward-based RL baseline, despite relying solely on a weaker preference signal.
Published: 2024

2. Flood forecasting with machine learning models in an operational framework

Author: Nevo, Sella, Morin, Efrat, Rosenthal, Adi Gerzi, Metzger, Asher, Barshai, Chen, Weitzner, Dana, Voloshin, Dafi, Kratzert, Frederik, Elidan, Gal, Dror, Gideon, Begelman, Gregory, Nearing, Grey, Shalev, Guy, Noga, Hila, Shavitt, Ira, Yuklea, Liora, Royz, Moriah, Giladi, Niv, Levi, Nofar Peled, Reich, Ofir, Gilon, Oren, Maor, Ronnie, Timnat, Shahar, Shechter, Tal, Anisimov, Vladimir, Gigi, Yotam, Levin, Yuval, Moshe, Zach, Ben-Haim, Zvika, Hassidim, Avinatan, and Matias, Yossi
Subjects: Computer Science - Machine Learning
Abstract: The operational flood forecasting system by Google was developed to provide accurate real-time flood warnings to agencies and the public, with a focus on riverine floods in large, gauged rivers. It became operational in 2018 and has since expanded geographically. This forecasting system consists of four subsystems: data validation, stage forecasting, inundation modeling, and alert distribution. Machine learning is used for two of the subsystems. Stage forecasting is modeled with the Long Short-Term Memory (LSTM) networks and the Linear models. Flood inundation is computed with the Thresholding and the Manifold models, where the former computes inundation extent and the latter computes both inundation extent and depth. The Manifold model, presented here for the first time, provides a machine-learning alternative to hydraulic modeling of flood inundation. When evaluated on historical data, all models achieve sufficiently high-performance metrics for operational use. The LSTM showed higher skills than the Linear model, while the Thresholding and Manifold models achieved similar performance metrics for modeling inundation extent. During the 2021 monsoon season, the flood warning system was operational in India and Bangladesh, covering flood-prone regions around rivers with a total area of 287,000 km2, home to more than 350M people. More than 100M flood alerts were sent to affected populations, to relevant authorities, and to emergency organizations. Current and future work on the system includes extending coverage to additional flood-prone locations, as well as improving modeling capabilities and accuracy., Comment: 36 pages, 10 figures, 3 tables, 1 supplementary table (9 pages)
Published: 2021

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Noga, Hila"'

1. Multi-turn Reinforcement Learning from Preference Human Feedback

2. Flood forecasting with machine learning models in an operational framework

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

2 results on '"Noga, Hila"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources