Welcome to the AI Connection Club, a welcoming and interactive community centered around AI, where members can learn, share knowledge, stay updated, and support each other.

AI Tips and News of the Week

AND NEWS 2

Invideo 3.0 update

invideo 3 b

Big news from InVideo - their latest update, V3, is a game-changer for video creation. You can now create entire videos—script, footage, voiceovers, music, subtitles, animations, the whole package—just by typing a single text prompt. No editing skills or juggling multiple tools needed!

This means anyone, whether you’re a creator, marketer, or entrepreneur, can easily produce professional-quality videos to tell your story, promote a product, or create engaging content for social media. Imagine whipping up a polished video ad or even translating existing videos with just a few clicks!

If you’ve ever felt intimidated by video editing, this might be the perfect time to give it a try here.

ChatGPT update giving it "eyes"

ChatGPT

OpenAI has unveiled a groundbreaking feature for ChatGPT—real-time video and voice interaction through its new Advanced Voice mode. This update enables the chatbot to visually interpret its surroundings and respond contextually to what users show through their device's camera.

For example, users can display items or situations, and ask for guidance. The chatbot can provide detailed, step-by-step instructions, answer clarifying questions, and adapt its responses to what’s in the frame. Additionally, users can share their device screen, allowing ChatGPT to view and assist with tasks, such as drafting replies to messages within a messenger app.

This feature is part of ChatGPT Plus and Pro subscription plans and will roll out next week. Business and educational users can expect access to this functionality by early 2025. With these advancements,

MagicQuill (revolutionary image editing)

MagicQuill

MagicQuill is here to make image editing simpler, smarter, and more fun for everyone. With its AI-powered tools and an intuitive interface, you can easily do things like insert new elements, erase objects, or tweak colors—no complex skills required.

What’s really cool? MagicQuill understands what you’re trying to do in real time, so there’s no need to type out prompts or navigate tricky menus. Just a few quick strokes, and you’re in control, getting exactly the edits you want with precision and ease.

Whether you’re working on casual photo tweaks or intricate design projects, MagicQuill combines powerful AI with simplicity to bring your creative vision to life. If you’ve been looking for a tool that makes advanced editing feel effortless, this one’s definitely worth checking out.

MultiFoley

MultiFoley

MultiFoley is an impressive new AI tool for creating soundtracks that perfectly match silent videos, whether you’re going for realistic sound effects or something more imaginative. With MultiFoley, you can generate high-quality, synchronized sounds using text, audio, or video as inputs.

One of its coolest tricks is that you can guide it with reference sounds—like pulling audio from a sound effects library or a partial video soundtrack—and it will build a complete, seamless audio experience. Need a skateboard’s wheels spinning without the wind noise, or maybe a lion’s roar that sounds like a cat’s meow? MultiFoley can help you with this.

It’s super versatile, too. You can use it to create sounds based on text prompts, extend incomplete soundtracks, or tweak audio using existing references. By combining AI smarts with professional sound effects, it produces clear, full-bandwidth audio that’s perfect for everything from film production to creative projects.

If sound design is part of your workflow, MultiFoley could save you a lot of time while opening up endless creative possibilities.

NotebookLM update

nlm

Google has just rolled out some exciting updates to NotebookLM, their AI-powered productivity tool, and they’re pretty game-changing.

The standout feature? You can now jump into the podcast conversations with their new Audio Overview update. This means you can interact with the audio using your voice—ask questions, get extra details, or even request different explanations, all in real-time. It’s like being part of the discussion!

They’ve also redesigned the interface to make things easier and more intuitive. You’ve got three key panels now: one for keeping track of your sources, one for AI-powered chats (with citations!), and another for creating things like study guides and custom audio overviews.

And for those who need even more power, there’s a new premium tier called NotebookLM Plus coming early next year. It’s built for teams, schools, and businesses, offering more storage, shared notebooks, and collaboration features.

You can check it out here.

Pika 2 update

Pika 2 update

Pika Labs has introduced Pika 2.0, a fun and user-friendly AI video generator designed for everyday creators, not just big studios. One of its standout features is the Scene Ingredients tool, which lets you upload your own characters, props, and settings to mix with AI-generated content. Whether it’s a dragon flying over a castle or a cat surfing through space, you get more control to bring your ideas to life.

Unlike traditional text-prompt-based video tools, Pika 2.0 has improved text alignment for better results and upgraded motion rendering for smoother, more natural movements. It’s made with small creators and social media users in mind, making it perfect for TikToks, marketing clips, or just having fun with creative video projects.

Available for both free and paid users, Pika 2.0 is all about making video creation accessible and enjoyable for “actual people,” as they put it.

Nvidia's Fugato

unnamed (1)

Fugatto by Nvidia is a revolutionary AI model that generates and transforms audio using text and audio prompts. It allows users to compose music, modify voices, add or remove instruments, and even create entirely new sounds.

It allows fine-grained control over attributes like accent, emotion, and sound evolution. For example, it can morph sounds over time, such as a train transitioning into a string orchestra, or a choir.

Its debut showcased impressive creativity, from music with barking dogs to instruments mimicking animal sounds, marking Fugatto as a groundbreaking leap in generative audio technology.

The video below shows what amazing capabilities it will give to filmmakers.

Hunyuan video generator

unnamed (2)

Hunyuan AI Video is a new, state of the art, AI Video Generator that creates high-quality videos from text descriptions. With massive horsepower and state-of-the-art performance, it claims to be the most powerful open-source video generation model available.

It generates high-quality AI videos with superior motion stability, scene transitions, and realistic visuals.

Try it at https://fal.ai/models/fal-ai/hunyuan-video

CapCut's AI Avatar Generator

You can now make a lip-syncing talking Custom AI Avatar for free in Capcut!

unnamed (3)

CapCut's AI avatar generator is completely free to use. You can create and personalize your avatar without any subscriptions or hidden fees, allowing you to explore your creativity without breaking the bank.

From Capcut's promo:

"Key Features of CapCut’s AI Avatar Generator

  • Diverse Avatar Styles: Explore a library of unique styles, from bold and graphic to soft and whimsical, tailored to your vision.
  • User-Friendly Interface: Create avatars effortlessly with intuitive tools, perfect for beginners and pros alike.
  • Extensive Customization: Personalize avatars with detailed features, sound effects, and backgrounds to reflect your individuality.

Benefits of Using CapCut’s AI Avatar Generator

  • Free Creativity: Design avatars at no cost, eliminating the need for expensive software or subscriptions.
  • Pre-Designed Templates: Start with diverse character templates that inspire and simplify the creative process.
  • Seamless Video Integration: Easily incorporate custom avatars into videos with CapCut’s editing tools.

Creative Applications of AI Avatars

  • Gaming & Entertainment: Enhance gameplay commentary or skits with unique character avatars.
  • Marketing & Advertising: Create memorable campaigns featuring custom avatars to elevate your brand.
  • Reaction & Review Videos: Add personality and engagement to your content with visually captivating avatars."

Learn more on how to use CapCut AI avatar generator here: https://www.capcut.com/tools/free-avatar-creator

Google Raises the Bar in Video Generation

unnamed

Google just announced Veo 2 which produces another advance in video quality, out-performing even OpenAI's Sora. They also announced Imagen 3, an upgraded image model also offering state-of-the-art quality.

While video models frequently “hallucinate” unwanted details—such as extra fingers or unexpected objects—Veo 2 minimizes these occurrences, resulting in more realistic outputs.

Additionally, Veo 2 embeds an invisible SynthID watermark in its videos, allowing them to be identified as AI-generated. This helps mitigate risks of misinformation and misattribution.

Visit Google Labs to sign up for the waitlist. They also plan to expand Veo 2 to YouTube Shorts and other products next year.

Read more about it at https://blog.google/technology/google-labs/video-image-generation-update-december-2024

Imagen 3 outperformed all models, including Midjourney, Flux, and Ideogram, in human evaluations for preference, visual quality, and prompt adherence. The model is now available through Google Labs’ ImageFX.

PUT YOUR FRIENDS IN ANY ENVIRONMENT IN ANY POSITION

OpenArt combines numerous great image generation and editing tools into one online program, but what sets it apart is its ability to train a "model" composed of different images that you upload of a friend, a family member, a pet etc. that you can then place into any environment, in any pose, and any style.

You can see it in action in this great video from Bob Doyle at 5:10 to about 20:30: https://www.youtube.com/watch?v=gEjm0Mc1jkc 

 

 

 

 

Ai-Da, a humanoid robot artist, just made history by selling her portrait of Alan Turing for over $1 million at Sotherby's. The painting is below.

2.-Ai-God-Polyptych-by-Ai-Da-Robot

OMNIGEN - REMARKABLE NEW IMAGE EDITOR

omnigen

Imagine being able to say "take the person on the left in image 1 and the middle person in image 2 and have them [whatever you went them to do, wherever you want them to do it]. Or telling it to deblur an image, or add or remove things when combining multiple images or parts of images.

Omnigen can do this and much more. You just tell it what you want it to do to the image and it does it. You can see it in action at https://www.youtube.com/watch?v=PCL9SAlHqzw

And try it out at https://huggingface.co/spaces/Shitao/OmniGen

Warning: Being designed by geeks, it's not the most intuitive, and it can cost you in credits after a while. If you have a powerful enough PC and graphics card, you can install it locally and use it for free with no limits.

RECRAFT AKA RED PANDA

fb68852f-4c99-4ff6-aa79-a50ba8a8aa1e

Another new image generator, Recraft.ai, has appeared and is claiming to be the best, but in all the tests I've seen, while it is actually on a par with the best - Ideogram, MidJourney, Flux etc. - it is not better than them.

It is very good for photorealism and long text, and has similar extra features to some of the others (upscaling, background removal, erasing portions), and it adds vector images, collages, and mockups. There is a free version, so it is definitely worth a try.

RUNWAYML ADDS ADVANCED CAMERA CONTROLS

runway-advanced-camera-control-1456x1202

RunWayML has added advanced camera controls the give ultraprecise, and much easier to use camera controls when you are generating your videos.

You can check these out at https://www.youtube.com/watch?v=0buDtZKLDJ8

WONDERANIMATION

wonderanim

Wonder Dynamics, the folks who enabled us to drop animated CGI characters into our videos, and who I featured in in many of my seminars, have now introduced WonderAnimation, which turns any footage that you shoot into fully rendered 3D animated scenes that you have full post-production control over!

You can literally shoot a scene with any camera, (or phone) in any location, and turn the sequence into an animated scene with CG characters in a 3D environment - even with shots from multiple angles!

You can read about it at https://adsknews.autodesk.com/en/news/autodesk-launches-wonder-animation-video-to-3d-scene-technology/

You can see it in action at https://www.youtube.com/watch?v=xad1ajxln28

CHATGPT SEARCH

Per ChatGPT, it "can now search the web in a much better way than before. You can get fast, timely answers with links to relevant web sources, which you would have previously needed to go to a search engine for. This blends the benefits of a natural language interface with the value of up-to-date sports scores, news, stock quotes, and more.

ChatGPT will choose to search the web based on what you ask, or you can manually choose to search by clicking the web search icon.

Search will be available at chatgpt.com (opens in a new window), as well as on our desktop and mobile apps. All ChatGPT Plus and Team users, as well as SearchGPT waitlist users, will have access today. Enterprise and Edu users will get access in the next few weeks. We’ll roll out to all Free users over the coming months."

ChatGPT also added a much-needed conversation search function at the top left enabling you to search through all your previous conversations.

IDEOGRAM AND MIDJOURNEY ADD IMAGE EDITING

Both ideogram and MidJourney have introduced excellent editing tools for the images you create with them, or that you upload.

id dogs 2

With Ideogram's Canvas, you can upload your own images or generate new ones, then seamlessly edit, extend, or combine them using Magic Fill (inpainting - adding things to the image, like the girl added above) and Image Extending (outpainting) tools. You can also seamlessly combine multiple images into one unified image. Magic Fill allows you to edit specific regions of your images to replace objects, add text, fix imperfections, change backgrounds, and more.

mj cars

With Midjourney, users can upload any image of their choosing and edit sections of it with AI, or change the style and texture of it from the source to something totally different, such as turning a vintage photograph into anime — while preserving most of the image’s subjects and objects and spatial relationships. It also works on doodles and hand drawings that the user submits, turning scribbles into full art pieces in seconds.

Shorts:

RunwayML introduced Act-One, an extraordinary way to add fully controllable facial expressiveness to any face - real or animated - in a video. Instead of trying to explain all that it does, check it out here: https://runwayml.com/research/introducing-act-one

Stability AI released the open source Stable Diffusion 3.5 - with improved photorealism of people and much better rendering of hands.

Alibaba’s MIMO - Alibaba's got a new AI tool called MIMO  that can swap out people in videos using just a single photo reference, and change them into whatever characters you like, doing whatever you wish. It eliminates the need for complicated stuff like multi-camera setups or motion capture.


Leonardo in Canvas
-
Canva has launched Dream Lab which incorporates Leonardo in its text to image creations. The new Dream Lab tool can generate up to 19 different types of graphics, including 3D renders and illustrations, and can also reference other images to fine-tune outputs, making its outputs more reliable. It’s also capable of generating multi-subject images and photorealistic portraits.

HEYGEN INTERACTIVE AVATARS FOR ZOOM

heygen new

HeyGen has introduced an innovative feature that allows users to integrate AI-powered avatars into Zoom meetings, enhancing virtual interactions. These Interactive Avatars can join multiple Zoom sessions simultaneously, operating 24/7, and are designed to look, sound, and behave like the user, making real-time decisions based on provided knowledge bases. https://www.heygen.com/


Key Features:

  • Real-Time Interaction: The avatars engage in dynamic conversations, responding promptly to participants using OpenAI's real-time voice integration. This ensures natural and efficient interactions during meetings.
  • Versatility: Suitable for various applications such as online coaching, customer support, sales calls, and interviews, these avatars can handle repetitive tasks, allowing users to focus on more critical aspects of their work.
  • Personalization: Users can create custom avatars that mirror their appearance and voice, and how they speak, providing a consistent and authentic presence in virtual meetings. Additionally, users can create up to 100 different "looks" for their avatar, enabling variations in backgrounds, outfits, and camera angles to keep the virtual presence engaging and versatile.

While it is definitely getting better all the time, the avatars still look and sound fake to me - almost there, but not quite.

krea new

Image generator Krea - https://www.krea.ai/ - has released a major update where they partnered with some of the top AI video generators to bring multiple video models into Krea. Now you can create videos with MiniMax, LumaLabs, RunwayML, Pika Labs and Kling all in the one place.

They also have real-time image generation, image to video, and can upscale images and videos, as well as animations that morph from one image to another.

adobe new

NEW ADOBE AI TOOLS

At Adobe MAX 2024, Adobe announced many new AI features which include:

Adobe Firefly Video Model (Beta): Adobe expanded its Firefly family of generative AI models to include video, enabling creators to generate videos from text and image prompts. This model is designed to be commercially safe and is integrated into Premiere Pro, offering features like Generative Extend to seamlessly add frames to video clips .

Photoshop Enhancements: Photoshop received several AI-driven updates:

  • Distraction Removal: Automatically identifies and removes elements like people, wires, and poles from images.
  • Generative Workspace (Beta): Allows designers to ideate and iterate concepts simultaneously using generative AI.
  • Substance 3D Viewer (Beta): Enables viewing and editing 3D objects within Photoshop.
  • Premiere Pro Enhancements:  Premiere Pro introduces Generative Extend, allowing editors to seamlessly add frames to video clips using AI.
  • Adobe Express:  Adobe Express introduces new AI capabilities to simplify content creation, such as campaign creation, animation, and one-click brand setup.

NOT DIAMOND

My favorite new GPT is Not Diamond at https://chat.notdiamond.ai

Like Poe, it enables you to try different GPTs (ChatGPT, Claude, Gemini, Perplexity etc.) in the one place, but it does more. Based on what you ask, it chooses the best GPT for your query.

And you can compare the output of different GPTs side by side. And it does image generation, including the new Flux. And it is free.

INSTANT PODCASTS

Google recently enhanced its NotebookLM tool with an experimental Audio Overview feature, turning any collection of sources into a captivating podcast discussion hosted by two AI personalities. The AI-generated dialogue is downloadable, engaging, and tailored for auditory learners, as advertised by Google.

However, the feature goes beyond mere audio playback. The AI hosts display remarkable pacing, tone, and delivery, mimicking the natural flow of a human conversation. It's quite remarkable.

Credit: Lifehacker

FREE YOUTUBE TRANSCRIPTS

Another way to get a free transcript of a YouTube video is to add 3 "t's"after the youtube in the address - e.g. https://www.youtubettt.com/watch?v=cw0UOQd3ZB8 of any YouTube video you're watching.