Understanding Speech Recognition using OpenAI's Whisper Model

With the recent release of OpenAI’s Whisper model there has been a huge shift in the field of speech recognition. This new system was able to beat humans in translating and transcribing the audio samples even in very noisy conditions. In this talk we’ll explore the paper, model card, and code to learn how to get started with OpenAI’s Whisper model.

Vishal Rajput works as an AI-Vision Engineer for a drone company where he heads up AI development. He’s authored several research papers and has 5 years of experience in Deep Learning. He is also a 2 time top-50 writer on Medium in the Artificial Intelligence category.

You can find a summary of this talk in the recap blog post:
https://voxel51.com/blog/computer-vision-meetup-feb-2023-recap/

Join the Computer Vision Meetup friendliest to your timezone by scrolling to the bottom of this page:
https://www.meetup.com/pro/computer-vision-meetups/

Recorded on Feb 9, 2023 at the virtual Computer Vision Meetup.

#computervision #machinelearning #datascience #ai #speechrecognition

Understanding Speech Recognition using OpenAI's Whisper Model

OpenAI Whisper: Robust Speech Recognition via Large-Scale Weak Supervision | Paper and Code

AI News: OpenAI's Next Model Revealed!

OpenAI – Understanding Structured Output

[ML News] OpenAI's Whisper | Meta Reads Brain Waves | AI Wins Art Fair, Annoys Humans

Supercharge eCommerce Search: OpenAI's CLIP, BM25, and Python

2. OpenAI Whisper – Fed Speech Recognition

OpenAI’s GPT-4, GitHub Copilot X – 2023 03 23 – Some Introduction – 4k

#22 – Is OpenAI Codex Fizzling Out?

Build Your Own ChatGPT AI App in JavaScript | OpenAI, Machine Learning

Creating fine-tuned GPT-3 models via the OpenAI fine-tuning API

Is Claude 3.5 Sonnet Better Than OpenAI's GPT-4o? | AI Rising | English Podcast

Build Next.js AI SaaS App: OpenAI, Langchain, Postgres, Stripe | FullStack

Everyone Hates That “Hideous” AI Video Of Celebs Hugging Their Younger Selves

Verve AI: Real-Time Interview Assistance for Job Seekers (www.vervecopilot.com)

PyTorch 2.0 and OpenAI Triton, is Nvidia in Trouble?

Encoding graphs for large language models – Google Research Blog

Deepfake Creators Are Revictimizing GirlsDoPorn Sex Trafficking Survivors

Musicians are eyeing a legal shortcut to fight AI voice clones

The Next Leap In AI: OpenAI Launches Custom ChatGPT

Monitoring Large Language Models in Production using OpenAI & WhyLabs

Log In

With social network:

Or with username:

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections