AI Report by Explainx
Posts
OpenAI killed a few startups, again..

OpenAI killed a few startups, again..

Last week was all about Open source and AI Agents, this week is all about AI Giants like OpenAI, Nvidia, Google making their next move. We look at SORA, Gemini Pro v1.5, and Nvidia's RTX Chat.

February 16, 2024

Last week was all about Open source and AI Agents, this week is all about AI Giants like OpenAI, Nvidia, Google making their next move.

The most shocking thing to happen this week though is the SORA Model by OpenAI, a novel text to video generation model that will change everything. Gemini announced their v1.5, increasing their context size to 1M Tokens! If you don’t know why this is a big deal, keep reading. Nvidia announced their own chatbot. and a lot more!

I’m so excited to bring all of this to you.

AI Stuff you should know:
- SORA Model, OpenAI
- Why Gemini v1.5 is a Huge deal
Top AI Products launched this week
Video Guide for running all open source models, locally
This week in AI (Quick skim through top AI News)

One more thing. I mentioned how Perplexity AI is a big deal. Well:

Text within this block will maintain its original spacing when published👋 I just launched a new course full course on Perplexity AI, the Potential Google Killer. If you’d like to enrol at a 90% discount, here’s your chance. We cover features, use cases and pretty much everything you need to know.

Let’s jump in.

AI Stuff you should know

OpenAI SORAes above them all

OpenAI just introduced Sora, an innovative AI model capable of creating videos from text prompts. This model can generate realistic and imaginative scenes, complete with complex motions, accurate details, and expressive characters. Sora is currently in the testing phase, with access limited to select red teamers and creative professionals for feedback and safety assessments.

Gorgeous, no? We decided to cover how SORA fares vs Runway’s Gen-2 and Pika Video Generator, not a great news for them, here’s what you should look at.

Stats and Bullets

Sora can generate videos up to a minute long with high visual quality.
It uses a diffusion model and transformer architecture for video creation.
The model can animate still images and extend existing videos with precision.
Sora is being tested for potential risks, including misinformation and bias.
Access is currently limited to red teamers and invited visual artists and designers.

Business Use Cases

Sora has the potential to revolutionize various industries with its text-to-video capabilities. Here are some potential business applications:

Marketing and Advertising: Create engaging video content for campaigns directly from text descriptions.
Film and Animation: Assist filmmakers and animators in generating video mockups or storyboards quickly.
Education and Training: Produce educational videos and simulations for various learning scenarios.
Gaming: Generate cutscenes or in-game animations based on narrative text.
Event Planning: Visualize events like weddings or corporate functions before they happen.

Why it Matters

Sora represents a significant leap in AI's ability to understand and simulate the physical world in motion. This technology could democratize video production, making it accessible to those without traditional video editing skills. However, it also raises important questions about the future of creative jobs and the potential for misuse in generating misleading content. OpenAI's cautious approach to Sora's release reflects the balance between harnessing AI's creative potential and ensuring its responsible use.

Google’s Gemini gets a v1.5

Google has unveiled Gemini 1.5, a next-generation AI model that boasts a massive context window and improved performance across various benchmarks. This model is designed to handle a wide range of data inputs, including text, images, and audio, and can process information with a context window of up to 1 million tokens. Currently, Gemini 1.5 is available to developers and enterprise users, with plans for a broader consumer rollout in the future.

Stats and Bullets

Gemini 1.5 has a context window of up to 1 million tokens, significantly larger than its predecessor and competitors.
The model outperforms Gemini 1.0 Pro in 87% of benchmarks and is comparable to Gemini 1.0 Ultra.
It utilizes a Mixture of Experts (MoE) architecture for improved efficiency in training and serving.
Gemini 1.5 can process complex data such as an hour of video, 11 hours of audio, or codebases with over 30,000 lines.
Google is actively working on safety and ethical considerations for the model, including testing for content safety and representational harms.

Business Use Cases

Gemini 1.5's advanced capabilities open up new possibilities for businesses:

Software Development: Analyze and suggest improvements to large codebases.
Content Creation: Summarize and reason about extensive documents or multimedia content.
Research and Analysis: Process and interpret vast amounts of data for insights.
Customer Support: Provide in-depth, context-aware responses to customer inquiries.
Education: Enhance learning experiences with detailed analysis of educational materials.

Why it Matters

Gemini 1.5 represents a significant advancement in AI, with its ability to understand and process large amounts of information in a single context. This leap forward in AI technology could transform how businesses and developers work with data, making complex tasks more manageable and efficient. However, the introduction of such powerful tools also necessitates careful consideration of ethical implications and the potential impact on jobs and society as a whole.

Top AI Tools this week

Fforward.ai - Upload interviews to fforward to extract user needs, score opportunities, identify themes, and evidence-base your roadmap, sparking new solutions and stakeholder discussions.

Visme AI Designer - Turn ideas into stunning visuals with Visme AI Designer. Just input a prompt, pick a style, and let AI craft editable, professional designs for presentations, social media, and more.

MagiScan AI 3D Scanner app - MagiScan: AI 3D scanning app turns objects into models with multiple export formats, including for NVIDIA Omniverse and Minecraft block integration.

NVIDIA Chat with RTX - Chat With RTX: Customize a GPT LLM to interact with your docs, notes, videos, or data for personalized conversations.

Propeller - Propeller transforms education with AI-driven interactive roleplaying, providing deeply immersive experiences tailored to limitless customization options.
Anxiety Simulator - This app simulates messaging conversations to help users comprehend the experience of someone with anxiety, providing insights into both the friend's thoughts and the user's messages.
Studio Neiro AI - Generate video avatars with human-like features and micro-expressions that accurately represent your brand script or audio speech. Customize the voice of the AI avatar to match the speaker's persona.

Handpicked video for you

You can run any open source model locally with this approach.

This week in AI

Bumble dating app - Bumble fights catfishing with Deception Detector™. A.I. blocks 95% of scams, reducing fake accounts by 45% in two months. CEO Lidiane Jones leads the charge.

NVIDIA Chat with RTX - NVIDIA unveils GeForce RTX™ SUPER GPUs for enhanced generative AI, AI laptops, and RTX-accelerated AI tools. Transformative breakthroughs in PC experiences and gaming.

Meet DeepGO-SE - an AI tool revolutionizing protein function prediction. Using advanced language models, it excels in accuracy and efficiency. A game-changer for drug discovery and biotech exploration.

NVIDIA's AI Triumph - Nvidia surpasses Amazon, becoming the third most valuable U.S. company at $1.82 trillion. Their AI breakthroughs redefine industries, earning accolades and reshaping the tech landscape. Kudos to Nvidia for leading the AI revolution!
Karpathy's Exit from OpenAI - Andrej Karpathy departs OpenAI for the second time, emphasizing no specific incident led to his exit. Despite enjoying his time at OpenAI, he's set to explore personal projects. The AI community eagerly awaits his next move.

A lot goes into this newsletter, since we’re constantly on the look out. I’d love to hear how your experience is, please write to us as [email protected] or reply to this email so we know how you feel and if it does really add value to you as a user.