News from the AI & ML world

DeeperML - #videogeneration

Kara Sherrer@eWEEK //
Runway AI Inc. has launched Gen-4, its latest AI video generation model, addressing the significant challenge of maintaining consistent characters and objects across different scenes. This new model represents a considerable advancement in AI video technology and improves the realism and usability of AI-generated videos. Gen-4 allows users to upload a reference image of an object to be included in a video, along with design instructions, and ensures that the object maintains a consistent look throughout the entire clip.

The Gen-4 model empowers users to place any object or subject in different locations while maintaining consistency, and even allows for modifications such as changing camera angles or lighting conditions. The model combines visual references with text instructions to preserve styles throughout videos. Gen-4 is currently available to paying subscribers and Enterprise customers, with additional features planned for future updates.

Recommended read:
References :
  • Analytics India Magazine: Runway Introduces its Next-Gen Image-to-Video Generation AI Model
  • SiliconANGLE: Runway launches new Gen-4 AI video generator
  • THE DECODER: Runway releases Gen-4 video model with focus on consistency
  • venturebeat.com: Runway's new Gen-4 AI creates consistent characters across entire videos from a single reference image, challenging OpenAI's viral Ghibli trend and potentially transforming how Hollywood makes films.
  • www.producthunt.com: Product Hunt page for Runway Gen-4.
  • eWEEK: The Gen-4 model aims to solve several problems with AI video generation including inconsistent characters and objects.
  • iThinkDifferent: Runway has released Gen-4, its latest AI model for video generation. The company says the system addresses one of the biggest challenges in AI video generation: maintaining consistent characters and objects throughout scenes.
  • Charlie Fink: Runway Gen-4 Upstages ChatGPT Image Upgrades As Higgsfield, Udio, Prodia, And Pika Launch New Tools

Emily Forlini@PCMag Middle East ai //
Google is enhancing its Workspace platform, with Google Drive now offering searchable video transcripts. This new feature, rolling out to all Google Workspace users by March 26th, allows users to access and search transcripts for videos stored in Drive. The transcripts appear in a sidebar next to the video player, highlighting the currently spoken text, making it easier to find specific moments. Users can enable transcripts by clicking the settings icon and selecting "Transcript," provided the video already has captions, indicated by the "CC" button.

Google DeepMind has announced pricing for its Veo 2 video generation model, setting the cost at $0.50 per second of generated video, translating to $30 per minute or $1,800 per hour. Veo 2 creates videos with realistic motion and high-quality output, up to 4K, from a simple text prompt. This pricing positions Veo 2 as a premium service targeted towards professionals and enterprises, offering a competitive alternative compared to conventional filmmaking methods, despite additional expenses like human labor.

Recommended read:
References :
  • Dataconomy: Google Veo 2 pricing: 50 cents per second of AI-generated video
  • TechCrunch: Google Drive users can now access and search transcripts for videos
  • The Verge: Google Drive gets searchable video transcripts
  • PCMag Middle East ai: Google's Veo 2 costs $1,800 per hour for AI-generated videos.
  • Shelly Palmer: Google’s Veo 2, the AI video generator unveiled in December, has a price tag that’s turning heads: 50 cents per second.
  • TechCrunch: Google has quietly revealed the pricing of Veo 2, the video-generating AI model that it unveiled in December.

Emily Forlini@PCMag Middle East ai //
Google DeepMind has announced the pricing for its Veo 2 AI video generation model, making it available through its cloud API platform. The cost is set at $0.50 per second, which translates to $30 per minute or $1,800 per hour. While this may seem expensive, Google DeepMind researcher Jon Barron compared it to the cost of traditional filmmaking, noting that the blockbuster "Avengers: Endgame" cost around $32,000 per second to produce.

Veo 2 aims to create videos with realistic motion and high-quality output, up to 4K resolution, based on simple text prompts. While it's not the cheapest option compared to alternatives like OpenAI's Sora, which costs $200 per month, Google is targeting filmmakers and studios with larger budgets. The primary customers for Veo are filmmakers and studios, who typically have bigger budgets than film hobbyists. They would run Veo throughVertexAI, Google's platform for training and deploying advanced AI models."Veo 2 understands the unique language of cinematography: ask it for a genre, specify a lens, suggest cinematic effects and Veo 2 will deliver," Google says.

Recommended read:
References :
  • Shelly Palmer: Shelly Palmer discusses Google’s Veo 2, an AI video generator priced at 50 cents a second.
  • www.livescience.com: LiveScience reports Google's AI is now 'better than human gold medalists' at solving geometry problems.
  • PCMag Middle East ai: Google's Veo 2 Costs $1,800 Per Hour for AI-Generated Videos
  • THE DECODER: Google Deepmind sets pricing for Veo 2 AI video generation
  • Dataconomy: Google Veo 2 pricing: 50 cents per second of AI-generated video
  • TechCrunch: Reports Google’s new AI video model Veo 2 will cost 50 cents per second.

Ashutosh Singh@The Tech Portal //
References: SiliconANGLE , THE DECODER , Maginative ...
Elon Musk's xAI has acquired Hotshot, a startup specializing in AI-powered video generation. Hotshot, founded by Aakash Sastry and John Mullan, has developed three video foundation models: Hotshot-XL, Hotshot Act One, and Hotshot. The move signals xAI's intention to enter the AI video generation market, potentially competing with OpenAI's Sora and Google's Veo 2.

The acquisition will see Hotshot's models scaled on xAI's supercomputer, Colossus, which utilizes a vast number of Nvidia chips. Hotshot trained its models on 600 million video clips, employing techniques like neural networks for automatic captioning and the bfloat16 data format to accelerate AI training. The company discontinued new video creation on March 14, 2025, and allowed existing users to download their content until March 30.

Recommended read:
References :
  • SiliconANGLE: XAI acquires AI video generation startup Hotshot
  • THE DECODER: Elon Musk's AI company xAI buys AI video generation startup Hotshot
  • The Tech Portal: Musk’s xAI acquires gen-AI video startup ‘Hotshot’ to compete with OpenAI’s Sora and Google’s Veo 2
  • Maginative: xAI Buys Hotshot, a Startup Working on AI-Generated Video