Google’s Newest Approaches to Multimodal Foundational Mannequin | by Eileen Pangu

Multimodal foundational fashions are much more thrilling than giant language fashions. Let’s overview Google analysis’s current progress to have a glimpse of the bleeding edge.

Picture supply: https://unsplash.com/photos/U3sOwViXhkY

Background

Whereas the hype on giant language mannequin (LLM) remains to be iron sizzling within the trade, the main analysis organizations have turned their eyes to multimodal foundational fashions — fashions which have the identical scale and flexibility traits as LLM however can deal with knowledge past simply textual content, similar to photographs, audio, sensor indicators, and so forth. Multimodal foundational fashions are believed by many to be the important thing to unlock the subsequent part of Synthetic Intelligence (AI) advance.

On this weblog submit, we take a more in-depth take a look at how Google approaches multimodal foundational fashions. The content material coated on this weblog submit is drawn from the important thing strategies and insights of Google’s current papers, for which we offer references on the finish of this text.

Why Ought to You Care

Multimodal foundational fashions are thrilling, however why must you care? Chances are you’ll be:

an AI/ML practitioner who desires to meet up with the most recent analysis growth of the sector, however you don’t have the persistence to undergo dozens of recent papers and a whole lot of pages of surveys.
a present or rising trade chief who’s questioning what’s subsequent after giant language fashions, and is considering the way to align what you are promoting with the brand new developments within the tech world.
a curious reader who might find yourself being the buyer of present or future multimodal AI merchandise, and desires to get a visible and intuitive understanding of how issues work behind the scenes.

For all of the above audiences, this text will present a superb overview to jump-start your understanding of multimodal foundational fashions, which is a nook stone for future extra accessible and useful AI.

Yet one more factor to notice earlier than we dive in: when folks discuss multimodal foundational fashions, they typically imply the enter is multimodal, consisting of textual content, photographs, movies, indicators, and so forth. The output, nonetheless, is all the time simply textual content. The…

Google’s Newest Approaches to Multimodal Foundational Mannequin | by Eileen Pangu | Aug, 2023

Multimodal foundational fashions are much more thrilling than giant language fashions. Let’s overview Google analysis’s current progress to have a glimpse of the bleeding edge.

New Technology Revolutionizes Insect Research

Open Source AI Has Founders—and the FTC—Buzzing

You Don't Understand AI Until You Watch THIS

Think Deepfakes Aren’t a Risk? Check Out This AI Video of Biden Flinging Slurs at His Enemies

Leak Shows That Google-Funded AI Video Generator Runway Was Trained on Stolen YouTube Content, Pirated Films

Study Finds That AI Is Adding to Employees’ Workload and Burning Them Out

New Technology Revolutionizes Insect Research

Open Source AI Has Founders—and the FTC—Buzzing

Think Deepfakes Aren’t a Risk? Check Out This AI Video of Biden Flinging Slurs at His Enemies

Leak Shows That Google-Funded AI Video Generator Runway Was Trained on Stolen YouTube Content, Pirated Films

Study Finds That AI Is Adding to Employees’ Workload and Burning Them Out

When AI Is Trained With AI-Generated Data, It Starts Spouting Gibberish

Bind AI Copilot (www.getbind.co)

Forensic Analysis Finds Overwhelming Similarities Between OpenAI’s Voice and Scarlett Johansson

WriteText.ai for WooCommerce (writetext.ai)

World’s Largest Radiology AI Marketplace CARPL Raises $6 Million to Accelerate the Adoption of AI in Clinical Workflows

Google for Startups Accelerator: AI First MENA-T

Grasp Spark: Optimize File Dimension & Partitions

Accessing Your Private Knowledge. The Intensive and Typically Stunning Knowledge… | by Jeff Braun | Aug, 2023

Multimodal foundational fashions are much more thrilling than giant language fashions. Let’s overview Google analysis’s current progress to have a glimpse of the bleeding edge.

Log In

With social network:

Or with username:

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections