Radio and PodcastRadio and PodcastLive Radio & Podcasts
118. Angela Fan - Generating Wikipedia articles with AI artwork
Technology

118. Angela Fan - Generating Wikipedia articles with AI

Towards Data Science by The TDS team

Apr 6, 202200:51:44Technology

Generating well-referenced and accurate Wikipedia articles has always been an important problem: Wikipedia has essentially become the Internet's encyclopedia of record, and hundreds of millions of people use it do unders...

About This Episode

118. Angela Fan - Generating Wikipedia articles with AI is an episode from Towards Data Science by The TDS team . Generating well-referenced and accurate Wikipedia articles has always been an important problem: Wikipedia has essentially bec...

Podcast

This episode belongs to Towards Data Science.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Apr 6, 2022, 00:51:44 long, audio available.

Questions About This Episode

What is 118. Angela Fan - Generating Wikipedia articles with AI about?

Generating well-referenced and accurate Wikipedia articles has always been an important problem: Wikipedia has essentially become the Internet's encyclopedia of record, and hundreds of millions of people use it do understand the world. But over the last decade Wikipedia has also become a critical source of training data for data-hungry text generation models. As a result, any shortcomings in Wikipedia’s content are at risk of being amplified by the text generation tools of the future. If one type of topic or person is chronically under-represented in Wikipedia’s corpus, we can expect generative text models to mirror — or even amplify — that under-representation in their outputs. Through that lens, the project of Wikipedia article generation is about much more than it seems — it’s quite literally about setting the scene for the language generation systems of the future, and empowering humans to guide those systems in more robust ways. That’s why I wanted to talk to Meta AI researcher Angela Fan, whose latest project is focused on generating reliable, accurate, and structured Wikipedia articles. She joined me to talk about her work, the implications of high-quality long-form text generation, and the future of human/AI collaboration on this episode of the TDS podcast. --- Intro music: - Artist: Ron Gelinas - Track Title: Daybreak Chill Blend (original mix) - Link to Track: --- Chapters: 1:45 Journey into Meta AI 5:45 Transition to Wikipedia 11:30 How articles are generated 18:00 Quality of text 21:30 Accuracy metrics 25:30 Risk of hallucinated facts 30:45 Keeping up with changes 36:15 UI/UX problems 45:00 Technical cause of gender imbalance 51:00 Wrap-up

Where can I listen to 118. Angela Fan - Generating Wikipedia articles with AI?

You can listen to 118. Angela Fan - Generating Wikipedia articles with AI online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is 118. Angela Fan - Generating Wikipedia articles with AI from?

118. Angela Fan - Generating Wikipedia articles with AI is an episode from Towards Data Science by The TDS team .

How long is this episode?

This episode is 00:51:44 long.

When was this episode published?

This episode was published on Apr 6, 2022.

Can I save 118. Angela Fan - Generating Wikipedia articles with AI for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Towards Data Science?

Yes. This page shows related episodes from Towards Data Science when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to 118. Angela Fan - Generating Wikipedia articles with AI?

You can listen to 118. Angela Fan - Generating Wikipedia articles with AI on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

118. Angela Fan - Generating Wikipedia articles with AI is from Towards Data Science by The TDS team .

What are the episode details?

Published Apr 6, 2022 and 00:51:44 long