Deploying a TFLite Mannequin on GCP Serverless | by Vishal Rajput

The way to deploy a quantized mannequin in a Serverless vogue

11 min learn

23 hours in the past

Mannequin deployment is difficult; with the repeatedly altering panorama of cloud platforms and different AI-related libraries updating nearly weekly, again compatibility and discovering the proper deployment technique is an enormous problem. In immediately’s weblog put up, we are going to see find out how to deploy a tflite mannequin on the Google Cloud Platform in a serverless vogue.

This weblog put up is structured within the following method:

Understanding Serverless and different methods of Deployment
What’s Quantization and TFLite?
Deploying TFLite mannequin utilizing GCP Cloud Run API

Img Src: https://pixabay.com/photos/man-pier-silhouette-sunrise-fog-8091933/

Let’s first perceive what will we imply by serverless as a result of serverless doesn’t imply and not using a server.

An AI mannequin, or any software for that matter will be deployed in a number of other ways with three main categorisations.

Serverless: On this case, the mannequin is saved on the cloud container registry and solely runs when a consumer makes a request. When a request is made, a server occasion is mechanically launched to meet the consumer request, which shuts down after some time. From beginning, configuring, scaling, and shutting down, all of that is taken by the Cloud Run API supplied by the Google Cloud platform. Now we have AWS Lambda and Azure Features as alternate options in different clouds.

Serverless has its personal benefits and drawbacks.

The most important benefit is the cost-saving, for those who don’t have a big consumer base, more often than not, the server is sitting idle, and your cash is simply going for no purpose. One other benefit is that we don’t want to consider scaling the infrastructure, relying upon the load on the server, it may possibly mechanically replicate the variety of cases and deal with the site visitors.
Within the drawback column, there are three issues to contemplate. It has a small payload restrict, which means it may be used to run an even bigger mannequin. Secondly, the server mechanically shuts down after 15 min of idle time, thus once we make a request after a very long time, the primary requests take a lot…

Deploying a TFLite Mannequin on GCP Serverless | by Vishal Rajput | Jul, 2023

The way to deploy a quantized mannequin in a Serverless vogue

New Technology Revolutionizes Insect Research

Open Source AI Has Founders—and the FTC—Buzzing

You Don't Understand AI Until You Watch THIS

Think Deepfakes Aren’t a Risk? Check Out This AI Video of Biden Flinging Slurs at His Enemies

Leak Shows That Google-Funded AI Video Generator Runway Was Trained on Stolen YouTube Content, Pirated Films

Study Finds That AI Is Adding to Employees’ Workload and Burning Them Out

New Technology Revolutionizes Insect Research

Open Source AI Has Founders—and the FTC—Buzzing

Think Deepfakes Aren’t a Risk? Check Out This AI Video of Biden Flinging Slurs at His Enemies

Leak Shows That Google-Funded AI Video Generator Runway Was Trained on Stolen YouTube Content, Pirated Films

Study Finds That AI Is Adding to Employees’ Workload and Burning Them Out

When AI Is Trained With AI-Generated Data, It Starts Spouting Gibberish

Bind AI Copilot (www.getbind.co)

Forensic Analysis Finds Overwhelming Similarities Between OpenAI’s Voice and Scarlett Johansson

WriteText.ai for WooCommerce (writetext.ai)

World’s Largest Radiology AI Marketplace CARPL Raises $6 Million to Accelerate the Adoption of AI in Clinical Workflows

Google for Startups Accelerator: AI First MENA-T

Easy methods to Automate Code High quality with Python Pre-Commit Hooks | by Ahmed Besbes | Jul, 2023

Inspecting Knowledge Science Predictions: Particular person + Detrimental Case Evaluation | by Adam Ross Nelson | Jul, 2023

The way to deploy a quantized mannequin in a Serverless vogue

Log In

With social network:

Or with username:

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections