Illustration: Alex Castro / The Verge
Reddit announced new API changes today that will eventually pinch its content pipeline from being used to train artificial intelligence tools, including the models that power OpenAI’s ChatGPT, Google’s Bard, and Microsoft’s Bing AI. AI chatbots’ abilities to provide powerful answers have data resources like Reddit to thank — and now Reddit is planning to put that robot food behind a paywall.
Social media resources, including Reddit, are some of the sources used to train large language models (LLM) that can provide cogent responses to human prompts. Some of this data can be scraped in an unstructured manner, but Reddit’s API has helped companies make it easy to directly find and package useful data.
Reddit’s API, which has been available…