OpenAI agrees to deal with Reddit to scrape its content for AI training - SiliconANGLE

In the latest development in the war for data among artificial intelligence model builders, ChatGPT is getting a new source of up-to-date content thanks to a deal between OpenAI and Reddit Inc. that was announced today.

The partnership will enable OpenAI’s large language models, including GPT-3.5 and GPT-4, to “better understand and showcase Reddit content, especially on recent topics,” the companies said in a joint statement. In addition, the deal will see OpenAI become an advertising partner to Reddit, running ads on its popular website and app.

The deal is said to be valued at about $60 million, though a spokesperson for Reddit declined to disclose the terms of the deal when asked by Reuters.

OpenAI announced the partnership on the same day as it rolled out a number of updates to ChatGPT aimed at enhancing its data analysis capabilities, giving users the ability to interact with tables and charts and upload files from Google Drive and Microsoft OneDrive.

The AI phenom has struck a number of deals with publishers to bring more training data to its artificial intelligence models. In recent weeks, it has announced similar partnerships with the likes of the Financial Times and Dotdash Media Inc. Those initiatives followed a deal that was struck with the German publisher Axel Springer SE last year to enable ChatGPT to be trained on content from publications such as Business Insider and Politico in the U.S., and Bild and Die Welt in Germany.

By partnering with Reddit, OpenAI will be able to access that company’s Data API and obtain “real-time, structured and unique content” directly from Reddit. In addition, Reddit will add some new “AI-powered features” to its platform, but it hasn’t said what they might be.

Reddit caused some controversy last year when it announced that it will start charging developers to access its application programming interface, which provides access to its rich repository of human-generated content, including high-quality information. The move resulted in a number of popular third-party Reddit clients shutting down, leading to protests in many popular subreddits.

The company said at the time it had made the decision because a number of large AI companies were scraping its data without paying anything for it. It consequently began a policy of making money from its trove of content, notably striking a deal with Google LLC first, and then OpenAI today.

For OpenAI, the main advantage it gets from the deal is it can access a wealth of rich, up-to-date content that can aid in the training of its LLMs. Like other AI firms, OpenAI wants to diversify its training methods beyond simple internet scraping, which has become a fairly contentious issue that potentially violates a lot of copyrights. By partnering with Reddit, it knows it won’t have any legal issues if its chatbots lean too much on its content.

For Reddit, the deal brings the company a nice new revenue stream at a time when it’s facing heavy competition for advertising dollars from social media rivals such as Facebook, Instagram and TikTok.

ChatGPT gets better at analyzing data

ChatGPT is also getting a number of enhancements that aim to improve its data analysis skills, giving users the ability to interact with tables and charts via a new, expandable view, OpenAI said in a blog post. In addition, users will be able to feed files to ChatGPT directly from Google Drive and Microsoft OneDrive, and customize and download charts to embed into their presentations and documents.

OpenAI said the improvements build on ChatGPT’s existing ability to understand datasets and complete tasks associated with them. To get started, users simply upload one or more datasets, enabling ChatGPT to analyze them by writing and running Python code on their behalf.

The company says it can work with data in a number of ways. For instance, it can merge and clean large datasets, create charts based on the information in Excel files, uncover insights, create summaries and so on. The idea is that novices can perform more in-depth analyses, while advanced users can save time on tasks such as cleaning up their data.

The improved data analysis capabilities will be made available within OpenAI’s newest flagship model, GPT-4o, for ChatGPT Plus, Team and Enterprise subscribers only.

newsid: cza7ezm2amgz22w

Related stories
6 minutes ago - (Bloomberg) -- Gold surged to an all-time high, boosted by increasing optimism the Federal Reserve will start easing monetary policy this year along with rising geopolitical tensions in the Middle East.Most Read from BloombergWreckage of Iran President’s Helicopter Seen, State Media SaysSpeedier Wall Street Trades Are Putting Global Finance On EdgeEven If Alito Is Right, the Upside-Down Flag Was WrongGantz Says He’ll Quit Unless Netanyahu Moves to New War PlanChina-Bound Oil Tanker Hit by Houthi
8 minutes ago - Eli Lilly (NYSE: LLY) sells a broad portfolio of products, but two in particular have helped revenue growth explode in recent times. I'm talking...
8 minutes ago - AI could be the "biggest technological shift" of our lifetimes. This company has decades of experience and is positioned to thrive.
29 minutes ago - Families in California pay more for groceries than those in any other state. Now, a single fruit is being sold at a SoCal retailer that costs more than the average of $297.72 per week that househol…
56 minutes ago - Berkshire Hathaway is the creation of Warren Buffett, so what will it be when Buffett steps aside?
Other stories
1 hour ago - GameStop (NYSE:GME) stock doesn’t seem to do anything in half measures. Following a long absence, a recent tweet by Roaring Kitty, the person largely responsible fo...
1 hour ago - Every restaurant investor is looking for the next Chipotle Mexican Grill (NYSE: CMG). The fast-casual restaurant chain has posted phenomenal returns...
1 hour ago - My bologna has a first name, it's O-S-C-A-R...We all recognize the famous Oscar Mayer jingle. But the brand has much more than bologna. It also has...
2 hours ago - The Dow Jones Industrial Average is coming off of a notable week, having reached an all-time high on Thursday and a close above 40,000 on Friday.
2 hours ago - Ripple saw substantial progress in numerous aspects during Q1. Will XRP see green as progress is made in Q2?