Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

Sorry, you do not have permission to ask a question, You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please type your username.

Please type your E-Mail.

Please choose an appropriate title for the post.

Please choose the appropriate section so your post can be easily searched.

Please choose suitable Keywords Ex: post, video.

Browse

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

Querify Question Shop: Explore Expert Solutions and Unique Q&A Merchandise

Querify Question Shop: Explore Expert Solutions and Unique Q&A Merchandise Logo Querify Question Shop: Explore Expert Solutions and Unique Q&A Merchandise Logo

Querify Question Shop: Explore Expert Solutions and Unique Q&A Merchandise Navigation

  • Home
  • About Us
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • About Us
  • Contact Us
Home/ Questions/Q 6652

Querify Question Shop: Explore Expert Solutions and Unique Q&A Merchandise Latest Questions

Author
  • 60k
Author
Asked: November 27, 20242024-11-27T08:31:08+00:00 2024-11-27T08:31:08+00:00

How to Get Audio Transcriptions from Whisper without a File System

  • 60k

Whisper is OpenAI's intelligent speech-to-text transcription model. It allows developers to enter audio and an optional styling prompt, and get transcribed text in response.

However, the official OpenAI Node.js SDK API docs only show one way to use Whisper – reading an audio file with fs.

async function main() {   const transcription = await openai.audio.transcriptions.create({     file: fs.createReadStream("audio.mp3"),     model: "whisper-1",   });    console.log(transcription.text); } 
Enter fullscreen mode Exit fullscreen mode

That works fine if you have static files… but in any consumer application, we'll be processing data from an end-user client such as an app or web browser. To receive audio from thousands of users and save it as files is a major waste of disk space and a huge ineffeciency. Plus, serverless deployment is extremely popular today, and in a serverless environment we usually don't have persistent file storage. I wrote this article because it was surprisingly hard to figure out how to achieve audio transcription without saving the audio as a file first.

How to use Whisper without files

On the client-side, you'll need to get your audio into a Base64 encoded string. I'm using the library “@ricky0123/vad-react” for this purpose, which comes with utilities to accomplish that:

onSpeechEnd: (audio) => {       const wavBuffer = utils.encodeWAV(audio);       const base64 = utils.arrayBufferToBase64(wavBuffer);       const audioUrlAsData = `${base64}`;       // chose POST here with a payload to ensure the Base64 string doesn't violate the max length of a URL       fetch("/api/transcribe", {         method: "POST",         body: JSON.stringify({ audioData: audioUrlAsData }),        }) } 
Enter fullscreen mode Exit fullscreen mode

Then on the server-side, the trick is to create a buffer from the base64 data and use the undocumented toFile function from OpenAI's library.

import OpenAI, { toFile } from "openai";  const openai = new OpenAI({   apiKey: process.env.OPENAI_API_KEY, });  export default async function handler(   req,   res ) {   try {     // Extract Base64 encoded data from the request     const bodyData = JSON.parse(req.body);     const base64Audio = bodyData.audioData;      // Decode Base64 to binary     const audioBuffer = Buffer.from(base64Audio, "base64");      // Use OpenAI API to transcribe the audio     const transcription = await openai.audio.transcriptions.create({       file: await toFile(audioBuffer, "audio.wav", {         contentType: "audio/wav",       }),       model: "whisper-1",     });      // Send the transcription text as response     res.json({ transcription: transcription.text });   } catch (error) {     console.error("Error during transcription:", error);     res.status(500).send("Error during transcription");   } } 
Enter fullscreen mode Exit fullscreen mode

Voila! Through this process, you can use Whisper without saving audio from every user as static files, allowing it to be used in a serverless environment.

javascriptopenaiwebdev
  • 0 0 Answers
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

Sidebar

Ask A Question

Stats

  • Questions 4k
  • Answers 0
  • Best Answers 0
  • Users 2k
  • Popular
  • Answers
  • Author

    Insights into Forms in Flask

    • 0 Answers
  • Author

    Kick Start Your Next Project With Holo Theme

    • 0 Answers
  • Author

    Refactoring for Efficiency: Tackling Performance Issues in Data-Heavy Pages

    • 0 Answers

Top Members

Samantha Carter

Samantha Carter

  • 0 Questions
  • 20 Points
Begginer
Ella Lewis

Ella Lewis

  • 0 Questions
  • 20 Points
Begginer
Isaac Anderson

Isaac Anderson

  • 0 Questions
  • 20 Points
Begginer

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help

Footer

Querify Question Shop: Explore Expert Solutions and Unique Q&A Merchandise

Querify Question Shop: Explore, ask, and connect. Join our vibrant Q&A community today!

About Us

  • About Us
  • Contact Us
  • All Users

Legal Stuff

  • Terms of Use
  • Privacy Policy
  • Cookie Policy

Help

  • Knowledge Base
  • Support

Follow

© 2022 Querify Question. All Rights Reserved

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.