logo
Nvidia launches fully open source transcription AI model Parakeet-TDT-0.6B-V2 on Hugging Face

Nvidia launches fully open source transcription AI model Parakeet-TDT-0.6B-V2 on Hugging Face

Business Mayor05-05-2025

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
Nvidia has become one of the most valuable companies in the world in recent years thanks to the stock market noticing how much demand there is for graphics processing units (GPUs), the powerful chips Nvidia makes that are used to render graphics in video games but also, increasingly, train AI large language and diffusion models.
But Nvidia does far more than just make hardware, of course, and the software to run it. As the generative AI era wears on, the Santa Clara-based company has also been steadily releasing more and more of its own AI models — mostly open source and free for researchers and developers to take, download, modify and use commercially — and the latest among them is Parakeet-TDT-0.6B-v2, an automatic speech recognition (ASR) model that can, in the words of Hugging Face's Vaibhav 'VB' Srivastav, 'transcribe 60 minutes of audio in 1 second [mind blown emoji].'
This is the new generation of the Parakeet model Nvidia first unveiled back in January 2024 and updated again in April of that year, but this version two is so powerful, it currently tops the Hugging Face Open ASR Leaderboard with an average 'Word Error Rate' (times the model incorrectly transcribes a spoken word) of just 6.05% (out of 100).
To put that in perspective, it nears proprietary transcription models such as OpenAI's GPT-4o-transcribe (with a WER of 2.46% in English) and ElevenLabs Scribe (3.3%).
And it's offering all this while remaining freely available under a commercially permissive Creative Commons CC-BY-4.0 license, making it an attractive proposition for commercial enterprises and indie developers looking to build speech recognition and transcription services into their paid applications.
The model boasts 600 million parameters and leverages a combination of the FastConformer encoder and TDT decoder architectures.
It is capable of transcribing an hour of audio in just one second, provided it's running on Nvidia's GPU-accelerated hardware.
The performance benchmark is measured at an RTFx (Real-Time Factor) of 3386.02 with a batch size of 128, placing it at the top of current ASR benchmarks maintained by Hugging Face.
Released globally on May 1, 2025, Parakeet-TDT-0.6B-v2 is aimed at developers, researchers, and industry teams building applications such as transcription services, voice assistants, subtitle generators, and conversational AI platforms.
The model supports punctuation, capitalization, and detailed word-level timestamping, offering a full transcription package for a wide range of speech-to-text needs.
Developers can deploy the model using Nvidia's NeMo toolkit. The setup process is compatible with Python and PyTorch, and the model can be used directly or fine-tuned for domain-specific tasks.
The open-source license (CC-BY-4.0) also allows for commercial use, making it appealing to startups and enterprises alike.
Parakeet-TDT-0.6B-v2 was trained on a diverse and large-scale corpus called the Granary dataset. This includes around 120,000 hours of English audio, composed of 10,000 hours of high-quality human-transcribed data and 110,000 hours of pseudo-labeled speech.
Sources range from well-known datasets like LibriSpeech and Mozilla Common Voice to YouTube-Commons and Librilight.
Nvidia plans to make the Granary dataset publicly available following its presentation at Interspeech 2025.
The model was evaluated across multiple English-language ASR benchmarks, including AMI, Earnings22, GigaSpeech, and SPGISpeech, and showed strong generalization performance. It remains robust under varied noise conditions and performs well even with telephony-style audio formats, with only modest degradation at lower signal-to-noise ratios.
Parakeet-TDT-0.6B-v2 is optimized for Nvidia GPU environments, supporting hardware such as the A100, H100, T4, and V100 boards.
While high-end GPUs maximize performance, the model can still be loaded on systems with as little as 2GB of RAM, allowing for broader deployment scenarios.
NVIDIA notes that the model was developed without the use of personal data and adheres to its responsible AI framework.
Although no specific measures were taken to mitigate demographic bias, the model passed internal quality standards and includes detailed documentation on its training process, dataset provenance, and privacy compliance.
The release drew attention from the machine learning and open-source communities, especially after being publicly highlighted on social media. Commentators noted the model's ability to outperform commercial ASR alternatives while remaining fully open source and commercially usable.
Developers interested in trying the model can access it via Hugging Face or through Nvidia's NeMo toolkit. Installation instructions, demo scripts, and integration guidance are readily available to facilitate experimentation and deployment.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Is ChatGPT Down for You, Too? Widespread Outage Continues to Grow
Is ChatGPT Down for You, Too? Widespread Outage Continues to Grow

CNET

time34 minutes ago

  • CNET

Is ChatGPT Down for You, Too? Widespread Outage Continues to Grow

Have you noticed that ChatGPT is a little less chatty this morning? OpenAI is experiencing a widespread outage Tuesday morning that's affecting its ChatGPT AI chatbot service, as well as its Sora tool for AI-generated videos. The number of reported outages has continued to increase throughout the morning. An OpenAI representative responded via email, directing us to its post on X and its status page. Both stated that OpenAI is experiencing "elevated errors and latency" and that it has identified the root cause and is working to mitigate the underlying issue. The technical issues are also affecting OpenAI's APIs, which allow developers to tap into the company's AI models. The troubles been ongoing for seven hours, OpenAI noted, meaning they likely started around midnight PT. The Downdetector service also shows outage reports starting around that time and then spiking several hours later. (Downdetector is owned by Ziff Davis, which is also the parent company of CNET.) Launched in 2022, ChatGPT has become the most popular AI application ever released, with 400 million weekly users. A barrage of generative AI competitors have followed, including Meta AI, Google's Gemini and Microsoft's Copilot, but ChatGPT remains the leader largely because it's easy to use. At its Worldwide Developers Conference 2025, Apple even touted an expansion of its ChatGPT integration. The AI chatbot uses learning algorithms and large language models to process massive amounts of data from books and the internet, which it uses to deliver human-like responses to prompts from users. (Ziff Davis in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.) This is a developing story. What can you do while OpenAI is down? Although OpenAI's ChatGPT may be among the most popular options, there are plenty of alternatives while it's down -- and many of them are free to use. Chat Claude is our current favorite chatbot we've tested, and Claude even knew all the details about ChatGPT's outage, according to my colleague Jon Reed, CNET senior editor who covers AI. Images If you rely on ChatGPT's Dall-E 3 as your image generator, we'd recommend trying for really creative work and Canva for free, beginner-friendly work. Video If you're looking for an alternative to Sora tool for AI-generated videos, we just checked out Microsoft's Bing Video Creator, which is super easy to use and live on mobile now.

ChatGPT Down: Eight Hours And Counting
ChatGPT Down: Eight Hours And Counting

Forbes

time35 minutes ago

  • Forbes

ChatGPT Down: Eight Hours And Counting

It's been a dark day for ChatGPT ChatGPT is suffering from a prolonged outage, which has seen the service disrupted for more than eight hours. The OpenAI service status page reported that 'some users are experiencing elevated error rates and latency across the listed services.' from around 4am ET. As of 11:15 ET, the service is still struggling, with OpenAI reporting that it's 'still working on implementing the mitigation for this issue." The 'partial outage' is disrupting ChatGPT, the Sora video generation service and the company's APIs. Although the ChatGPT site appears to be functional, responses to queries have been slow or showing error messages for several hours. The 'too many requests'" error appeared when I entered a prompt shortly before this article was published, with the service appearing to slow down further with more of the U.S. entering working hours. This is a breaking news story. More to follow.

Nordstrom Rack to Open New Location in Pompano Beach, FL
Nordstrom Rack to Open New Location in Pompano Beach, FL

Yahoo

time35 minutes ago

  • Yahoo

Nordstrom Rack to Open New Location in Pompano Beach, FL

SEATTLE, June 10, 2025 /PRNewswire/ -- Seattle-based fashion retailer Nordstrom, Inc. announced plans to open a new Nordstrom Rack in Pompano Beach, FL in fall 2026. "We look forward to being a part of the Pompano Beach community and serving our customers with an amazing offering of great brands at great prices," said Gemma Lionello, President of Nordstrom Rack. "We're excited to grow our footprint in the Florida market and introduce new customers to the Nordstrom experience." Lionello added that in this location "customers will be able to take advantage of our convenient services such as online order pick up from both and and they can make returns easily." The 28,000 square foot store will be in Pompano Citi Centre, a popular shopping center that includes national tenants such as Burlington, Five Below, TJ Maxx, Ross Dress for Less, Chuck E. Cheese, and Amped Fitness. Pompano Citi Centre is owned and managed by Sterling Organization and is the largest critical mass of retail space in northeast Broward County and spanning almost 60 acres. Pompano Citi Centre is located at the intersection of Federal Highway and Copans Road in Pompano Beach, Florida. "We're thrilled to welcome Nordstrom Rack to Pompano Citi Centre," said Bob Dake, Principal at Sterling Organization. "Their presence adds tremendous value to our tenant mix and further strengthens our commitment to creating a dynamic, high-quality experience for the community." Nordstrom Rack is the off-price retail division of Nordstrom, Inc. and plays a critical role in the company's Closer to You strategy, which focuses on delivering customers a more convenient and interconnected experience across its stores and digital platforms. Nordstrom Rack offers customers up to 70 percent off on-trend apparel, accessories, beauty products, home decor and shoes from many of the top brands sold at Nordstrom stores as well as core services like online order pickup for and easy returns and alterations at select stores. Nordstrom Rack is the largest source of new customers to Nordstrom. This new location expands the company's physical footprint and economic impact in Florida. It currently operates six Nordstrom stores and 19 Nordstrom Rack stores in Florida, generating more than 2,700 jobs statewide. Nordstrom is committed to investing in the diverse communities where it operates. Over the past four years, Nordstrom, with its customers, has donated more than $2 million in support of its long-term partnership with Big Brothers Big Sisters of the United States. These proceeds support the recruitment, training and engagement of adult mentors and mentorship moments between Bigs and Littles, including preparing for an interview, learning to tie a tie and helping with homework. About Nordstrom At Nordstrom, Inc., we exist to help our customers feel good and look their best. Since starting as a shoe store in 1901, how to best serve customers has been at the center of every decision we make. This heritage of service is the foundation we're building on as we provide convenience and true connection for our customers. Our interconnected model enables us to serve customers when, where and how they want to shop – whether that's in-store at more than 350 Nordstrom, Nordstrom Local and Nordstrom Rack locations or digitally through our Nordstrom and Rack apps and websites. Through it all, we remain committed to leaving the world better than we found it. About Sterling OrganizationSterling Organization owns 77 properties across various funds in major markets throughout the United States, encompassing over 13 million square feet and exceeding $3 billion in value. Headquartered in West Palm Beach, FL, Sterling Organization operates with offices nationwide. MEDIA CONTACT: Manuela UscherNordstrom, Inc. NordstromPR@ View original content to download multimedia: SOURCE Nordstrom, Inc. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store