in

Understanding Speech Recognition using OpenAI's Whisper Model



With the recent release of OpenAI’s Whisper model there has been a huge shift in the field of speech recognition. This new system was able to beat humans in translating and transcribing the audio samples even in very noisy conditions. In this talk we’ll explore the paper, model card, and code to learn how to get started with OpenAI’s Whisper model.

Vishal Rajput works as an AI-Vision Engineer for a drone company where he heads up AI development. He’s authored several research papers and has 5 years of experience in Deep Learning. He is also a 2 time top-50 writer on Medium in the Artificial Intelligence category.

You can find a summary of this talk in the recap blog post:
https://voxel51.com/blog/computer-vision-meetup-feb-2023-recap/

Join the Computer Vision Meetup friendliest to your timezone by scrolling to the bottom of this page:
https://www.meetup.com/pro/computer-vision-meetups/

Recorded on Feb 9, 2023 at the virtual Computer Vision Meetup.

#computervision #machinelearning #datascience #ai #speechrecognition

The Next Leap In AI: OpenAI Launches Custom ChatGPT

Monitoring Large Language Models in Production using OpenAI & WhyLabs