pwshub.com

Hotshot debuts new AI model for generating video clips

Hotshot has debuted an artificial intelligence model that can generate 10-second clips with a resolution of 1280 pixels by 720 pixels.

The startup launched the model, which is also called Hotshot, into public preview on Monday. It joins the growing list of AI video generators on the market. OpenAI, Runway ML Inc. and a number of other startups likewise offer neural networks capable of generating short clips based on user prompts.

Hotshot launched last year with an AI-powered image generation app for consumers. According to VentureBeat, the company appears to have shuttered the service to focus on its efforts in the nascent video generation segment. Hotshot is reportedly backed by SV Angel, angel investor Lachy Groom and Reddit Inc. co-founder Alexis Ohanian.

The company developed its latest video generation model over the course of several months. It created three different neural networks as part of the project: the core Hotshot model and two other neural networks that helped prepare the AI video generator’s training dataset.

In the first phase of the initiative, Hotshot put together a repository of 600 million clips with captions describing their contents. It combined those clips with one billion images to create its video generation model’s training dataset. “We knew we were going to want to train the model on images and videos jointly in order to take advantage of how much more abundant publicly accessible image data is than video,” Hotshot team members John Mullan, Duncan Crawbuck, Chaitu Aluru and Aakash Sastry explained in a blog post.

During the next phase of the project, the company developed an AI model to generate captions for the videos in its training dataset. Captions can help a neural network better understand the clips on which it’s being trained. The additional learnings that the AI consequently gleans increase the quality of its output.

Hotshot found that the existing caption generation models on the market didn’t meet its requirements. In response, the company took one of those existing models and customized it on a training dataset comprising 300,000 clips with manually-created captions. “In a couple weeks, we had a video captioner we were quite happy to use to annotate our hundreds of millions of video samples,” the Hotshot team detailed.

The second auxiliary AI model that the company built to support the development of its video generator was an autoencoder. This is a type of algorithm that can take a piece of data, in this case a video, and remove details that aren’t necessary for AI training. Deleting superfluous information lowers storage requirements and thereby decreases costs.

After preparing the autoencoder and the captioning model, Hotshot spent four months training its AI video generator. The company used several thousand H100 graphics processing units from Nvidia Corp. that racked up millions of processing hours during the project. 

Hotshot’s engineers applied several optimizations to reduce the training run’s infrastructure requirements. The company stored many of the files it used for the project in the bfloat16 format, which can compress 32 bits of data into 16 bits to save storage space. Additionally, Hotshot carried out some of the computations that would have normally been performed during the training process ahead of time to make better use of the Nvidia chips’ processing capacity.

Hotshot’s new AI is accessible as part of a free video generation service on its website. The company also plans to make the model available to developers through an application programming interface.

Source: siliconangle.com

Related stories
3 weeks ago - A flurry of new artificial intelligence models this week illustrated what’s coming next in AI: smaller language models targeted at vertical industries and functions. Both Nvidia and Microsoft debuted smaller large language models too....
3 weeks ago - Viggle AI, a startup with a popular video generation service of the same name, today announced that it has closed a $19 million investment. The Series A round was led by Andreessen Horowitz with participation from Two Small Fish....
3 weeks ago - Unfortunately, trading tips and investment advice come with a lot of the same jargon that make it easy to confuse the two if you’re not paying careful attention.
1 month ago - (Bloomberg) -- Signs that equity investors are getting cold feet over the rapid advances in artificial intelligence leaders have put a spotlight on some less obvious beneficiaries of the technology boom.Most Read from BloombergTrump Risks...
Other stories
34 minutes ago - YouTubers will soon be able to play with a host of new generative artificial intelligence-powered tools for creating content, including the ability to generate six-second YouTube Shorts clips, and backgrounds for their videos, using...
35 minutes ago - Salesforce Inc. is making a major push to deploy AI agents on its CRM platform, an initiative the company views as the next step in enterprise artificial intelligence adoption. Building on its predictive Einstein platform for sales,...
35 minutes ago - In a positive step forward and a possible sign of things to come, artificial intelligence video generation startup Runway AI Inc. has signed a deal with entertainment company Lions Gate Entertainment Corp. to explore the use of AI in...
56 minutes ago - (Bloomberg) -- Asian equities braced for a tailwind from the Federal Reserve’s half-point rate cut and signs of further policy easing in the months ahead.Most Read from BloombergCalifornia’s Anti-Speeding Bill Can Be a Traffic Safety...
56 minutes ago - (Bloomberg) -- US equities will climb through the rest of the year with the Federal Reserve’s aggressive interest-rate cut bolstering the chances of a soft landing for the economy, according to a survey of Bloomberg Terminal...