Are LLMs safe?
Feb 29, 2024 - 00:42:15
Radio and PodcastLive Radio & Podcasts
We discussed adversarial dataset construction and dynamic benchmarking in this episode with Douwe Kiela, a research scientist at Facebook AI Research who has been working on a dynamic benchmarking platform called Dynaben...
128 - Dynamic Benchmarking, with Douwe Kiela is an episode from NLP Highlights by NLP Highlights. We discussed adversarial dataset construction and dynamic benchmarking in this episode with Douwe Kiela, a research scientist at Facebook AI R...
This episode belongs to NLP Highlights.
Use the player on this page to stream the episode online.
Published Jun 19, 2021, 00:47:00 long, audio available.
We discussed adversarial dataset construction and dynamic benchmarking in this episode with Douwe Kiela, a research scientist at Facebook AI Research who has been working on a dynamic benchmarking platform called Dynabench. Dynamic benchmarking tries to address the issue of many recent datasets getting solved with little progress being made towards solving the corresponding tasks. The idea is to involve models in the data collection loop to encourage humans to provide data points that are hard for those models, thereby continuously collecting harder datasets. We discussed the details of this approach, and some potential caveats. We also discussed dynamic leaderboards, a recent addition to Dynabench that rank systems based on their utility given specific use cases. Papers discussed in this episode: 1. Dynabench: Rethinking Benchmarking in NLP ( 2. Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking ( 3. Adversarial NLI: A New Benchmark for Natural Language Understanding ( 4. DynaSent: A Dynamic Benchmark for Sentiment Analysis ( Douwe Kiela's webpage: The hosts for this episode are Pradeep Dasigi and Alexis Ross.
You can listen to 128 - Dynamic Benchmarking, with Douwe Kiela online on Radio and Podcast. Open the player on this page to stream the available audio.
128 - Dynamic Benchmarking, with Douwe Kiela is an episode from NLP Highlights by NLP Highlights.
This episode is 00:47:00 long.
This episode was published on Jun 19, 2021.
Yes. Use the heart button on the episode page to add it to your favorite episodes list.
Yes. This page shows related episodes from NLP Highlights when more episodes are available from the podcast feed.
You can listen to 128 - Dynamic Benchmarking, with Douwe Kiela on this page when the episode audio is available from the podcast feed.
128 - Dynamic Benchmarking, with Douwe Kiela is from NLP Highlights by NLP Highlights.
Published Jun 19, 2021 and 00:47:00 long