OpenAI Dev Day, Issues surface, Pika goes Hollywood and more

OpenAI boosts accessibility with cost-cutting features, Anthropic gains OpenAI co-founder as Chief Scientist, and Pika Labs revolutionizes AI video creation with surreal effects.

Spring 2024 brought a whirlwind of change to the AI landscape, playing out like a three-act drama of innovation.

OpenAI took center stage first, unveiling four powerful updates at DevDay 2024 that promised to make AI more accessible to developers everywhere. Their Realtime API, Vision Fine-Tuning, and cost-saving features opened doors for creators of all sizes, like giving everyone a key to a grand orchestra.

Major shifts rocked the AI landscape as OpenAI saw the departure of key leaders including CTO Mira Murati, Research Chief Bob McGrew, and VP of Research Barret Zoph. CEO Sam Altman responded by appointing Mark Chen and Jakub Pachocki to top research roles. Shortly after, OpenAI co-founder Durk Kingma's move to Anthropic as Chief Scientist stunned Silicon Valley.

Meanwhile, Pika Labs added the final flourish with their Pika 1.5 release, bringing Hollywood-worthy video effects to everyday creators through their enhanced AI tools.

Let’s dive deeper into these developments and explore what they mean for the future of AI and its impact on creators and the industry as a whole.

Four Updates from OpenAI to Enhance AI Accessibility and Affordability

OpenAI's DevDay 2024 unveiled four key updates aimed at enhancing AI accessibility and affordability for developers. The event emphasized a strategy focused on empowering developers rather than launching major new products, catering to smaller organizations in a competitive landscape.

Realtime API allows developers to create low-latency, multimodal applications with voice input and output, facilitating natural interactions and enhancing user experience without extensive backend infrastructure.

Vision Fine-Tuning enables customization of GPT-4o’s image understanding capabilities with small datasets (as few as 100 images), making advanced vision applications more accessible in fields like autonomous vehicles and medical imaging.

Prompt Caching reduces costs and latency by reusing input tokens across API calls, potentially cutting costs by up to 50%. Additionally, Model Distillation offers a streamlined workflow for converting larger models into smaller, efficient versions, making high-quality AI tools more attainable for smaller organizations.

Anthropic Welcomes OpenAI Co-Founder Durk Kingma as Chief Scientist

Anthropic has announced the hiring of Durk Kingma, a co-founder of OpenAI, as its new Chief Scientist. This strategic move is aimed at bolstering Anthropic's research capabilities and enhancing its efforts in developing advanced AI systems. Kingma is well-known for his contributions to the field of artificial intelligence, particularly in the areas of machine learning and natural language processing. In his new role, Kingma will focus on advancing Anthropic's mission to create safe and beneficial AI technologies. His expertise is expected to play a crucial part in shaping the company's research direction and product development. This hiring reflects Anthropic's ongoing commitment to attracting top talent in the AI sector as it competes with other leading firms in the industry. Kingma's background includes significant achievements at OpenAI, where he was instrumental in various projects that pushed the boundaries of AI capabilities. His transition to Anthropic marks a notable shift in the competitive landscape of AI research and development.

Pika 1.5 Launches with Surreal Effects and Enhanced Cinematic Controls

Pika Labs has launched Pika 1.5, an upgraded AI video generator featuring Pikaffects, which allows users to create surreal transformations in videos, such as inflating or crushing objects. The update includes enhanced cinematic controls, enabling complex camera movements like Bullet Time, making it easier to produce professional-quality videos. Users can still access the older Pika 1.0 for specific features like Lip Sync and AI Sound Effects. While subscription prices remain unchanged, the cost to generate clips has increased to 15 credits per five seconds due to the new features. Overall, Pika 1.5 offers unique creative tools that appeal to both amateur and professional creators, enhancing imaginative storytelling in digital media.

Hand Picked Video

In this video, we'll look at OpenAI's Sora and its impact on creativity.

Top AI Products from this week 

  • Semblian 2.0 - Be the first to experience the next groundbreaking evolution in AI-driven work automation, enabling turn-key 𝐀𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝 𝐖𝐨𝐫𝐤𝐞𝐫 𝐈𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞. Semblian 2.0 - AI with a deep understanding of you and your work

  • Hedy AI - Meetings just got hacked. Hedy is your covert AI assistant, slipping you genius-level insights in real-time

  • guidde - guidde is the generative AI platform for business that helps your team create video documentation 11x faster.

  • buzzabout - Win more customers by understanding their pains, gains, and thoughts. AI-driven tool that extracts real-time insights from billions online conversations.

  • Lookie AI - Lookie is designed for consuming knowledge on YouTube, where it often takes too much time and it's difficult to organize key information. With just a simple share, the process becomes 100x smarter and faster, turning YouTube into a personal knowledge hub.

  • NVLM 1.0 - A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models (e.g., Llama 3-V 405B and InternVL 2).

This week in AI

  • MM1.5: Advancements in Multimodal LLMs - MM1.5 introduces a new family of multimodal large language models, enhancing image understanding and reasoning through optimized training with diverse data mixtures.

  • Causal Inference with Neural Networks - The paper presents a novel method for causal inference using neural networks, focusing on improving the estimation of causal effects in complex data environments.

  • ByteDance to Develop AI Model Using Huawei Chips - ByteDance plans to train a new AI model using Huawei's Ascend 910B chips amid U.S. export restrictions on advanced technology. The company has ordered over 100,000 chips but received fewer than 30,000 so far.

  • Google Tests Gemini Video Search in India - Google is trialing a Gemini-powered video search feature in India, enabling users to capture video snippets and ask questions via Google Lens.