in

Gound gen AI outputs in Google Search and enterprise data


Today, we are announcing the preview of Ground with Google Search in Vertex AI. Businesses can now augment Gemini models with Google Search grounding, and can easily integrate the enhanced model into their AI agents.

When it comes to enterprise data, we offer multiple ways for businesses to ground model responses in enterprise data sources by leveraging retrieval augmented generation, or RAG. RAG helps improve the accuracy of model outputs by using vectors and embeddings to gather facts from relevant data sources.

Vertex AI includes not only a Google-quality, out-of-box RAG solution, but also a variety of component APIs for building bespoke retrieval, ranking, and document processing systems that enable enterprises to easily ground foundation models in their own data. 

For organizations that need embeddings-based information retrieval, Vertex AI offers powerful vector search capabilities. Today, to enhance vector search, we are excited to announce the preview of our hybrid search feature, which integrates vector-based and keyword-based search techniques to ensure relevant and accurate responses for users.

Besides these, customers can connect models to Google databases like AlloyDB and BigQuery for the contextual retrieval of operational data and analytics like purchase preferences, rewards, basket analysis, interaction history, and more. To enable actions and transactions, we provide a host of data connectors to help businesses connect their models to enterprise applications like Workday, Salesforce, ServiceNow, Hadoop, Confluence, and JIRA to access the latest data on customer interactions and internal knowledge updates like issue tracking, program management, and employee records. 

With a comprehensive approach to grounding that covers web search, enterprise data, and third-party enterprise applications, businesses can ensure that their models will deliver enterprise truth – wherever it is hosted.

How more sources of truth create more value 

Let’s walk through an example to show how grounding lets organizations integrate sources of enterprise truth and create more helpful AI agents. 

Suppose an athletic brand wants to create an AI agent to help customers find and purchase shoes.

If the company just puts an interface atop a foundation model API, they won’t accomplish much. The resulting app would be able to discuss shoes generally, based on its training knowledge, but it wouldn’t have particular expertise in the brand’s shoes or any awareness of new footwear trends that emerged after its training cutoff date. 

With grounding in Google Search, the shoe brand’s app can become much more functional, able to search the web for fresh information. However, it wouldn’t have insight into the brand’s internal data such as product information, inventory levels, and manufacturing timelines, nor would it be able to call functions for transactions — so the shoe company would still be dealing with a basic and rather limited agent. 

To cross this chasm, the company also needs to connect its gen AI models to enterprise data sources via RAG mechanisms, so the agent can ground its advice in the specificity and factuality of internal documents and databases. 

Imagine an advanced, proactive version of the shoe-recommending agent, with access to the full spectrum of aforementioned search, databases, and analytics. It would be able to observe patterns like the customer’s last several purchases all having green stripes. It would remember the earlier chat in which the customer said they dislike shoes that squeak on hardwood floors, and then go about reviewing customer reviews to purge squeakiness from its recommendations. It would also generate tables on the fly so the customer can more easily compare options, and it would know up-to-date inventory and shipping information to help execute transactions. With the right grounding in enterprise truth, the sky’s the limit — and so is the value the agent can create.

Enterprise truth: fueling gen AI innovation across businesses

Generative AI adoption isn’t just about access to capable models. It’s also about grounding foundation models in first-party data and high-quality external sources — and using these connections to steer model behavior, creating more accurate, relevant, and factual generative AI experiences for businesses to offer their customers, partners, and employees. 

With access to high-quality and relevant data, models can power experiences that move beyond traditional passive applications, giving rise to the next generation of AI agents grounded in enterprise truth. That’s the future we are rapidly moving towards. Backed by our commitment to this journey with our customers, we’re excited to help make the outputs of today’s agents factual, relevant, and actionable.

To learn more about Google Cloud’s AI news at Next, check out our Vertex AI Agent Builder announcement.


https://storage.googleapis.com/gweb-cloudblog-publish/images/Next24_Blog_blank_2-03.max-2500x2500.jpg

Gen AI and app development tools and partnerships

https://storage.googleapis.com/gweb-cloudblog-publish/images/Next24_Blog_Images_6-04.max-2500x2500.jpg

Google Cloud Gemini, Image 2, and MLOps updates