logo
#

Latest news with #GPT-OSS

OpenAI GPT-OSS Models Optimized for NVIDIA RTX GPUs
OpenAI GPT-OSS Models Optimized for NVIDIA RTX GPUs

Geeky Gadgets

time3 days ago

  • Geeky Gadgets

OpenAI GPT-OSS Models Optimized for NVIDIA RTX GPUs

NVIDIA and OpenAI have collaborated to release the gpt-oss family of open-source AI models, optimized for NVIDIA RTX GPUs. These models, gpt-oss-20b and gpt-oss-120b, bring advanced AI capabilities to consumer PCs and workstations, enabling faster and more efficient on-device AI performance. OpenAI, has unveiled its gpt-oss family of open-weight AI models, specifically optimized for NVIDIA RTX GPUs. These models—gpt-oss-20b and gpt-oss-120b—are designed to deliver advanced AI capabilities to both consumer-grade PCs and professional workstations. By using NVIDIA's innovative GPU technology, the models provide faster on-device performance, enhanced efficiency, and greater accessibility for developers and AI enthusiasts. The latest OpenAI models feature cutting-edge architecture, extended context lengths, and support for various AI applications, making them accessible to developers and enthusiasts through tools like Ollama, and Microsoft AI Foundry Local. Key Highlights of GPT-OSS Models Two Models, Tailored for Performance The easiest way to test these models on RTX AI PCs, on GPUs with at least 24GB of VRAM, is using the new Ollama app. Ollama is fully optimized for RTX, making it ideal for consumers looking to experience the power of personal AI on their PC or workstation. The gpt-oss family consists of two distinct models, each tailored to meet specific hardware requirements and performance needs: gpt-oss-20b: Designed for consumer-grade NVIDIA RTX GPUs with at least 16GB of VRAM, such as the RTX 5090. This model achieves processing speeds of up to 250 tokens per second, making it suitable for individual developers and small-scale projects. Designed for consumer-grade NVIDIA RTX GPUs with at least 16GB of VRAM, such as the RTX 5090. This model achieves processing speeds of up to 250 tokens per second, making it suitable for individual developers and small-scale projects. gpt-oss-120b: Optimized for professional-grade RTX PRO GPUs, this model caters to enterprise and research environments requiring higher computational power and scalability. Both models support extended context lengths of up to 131,072 tokens, allowing them to handle complex reasoning tasks and process large-scale documents. This capability is particularly advantageous for applications such as legal document analysis, academic research, and other tasks requiring long-form comprehension and detailed analysis. Technological Innovations Driving Efficiency The gpt-oss models incorporate several technological advancements that enhance their performance and functionality. These innovations include: MXFP4 Precision: The gpt-oss models are the first to support this precision format on NVIDIA RTX GPUs. MXFP4 improves computational efficiency while maintaining output accuracy, reducing resource consumption without compromising performance. The gpt-oss models are the first to support this precision format on NVIDIA RTX GPUs. MXFP4 improves computational efficiency while maintaining output accuracy, reducing resource consumption without compromising performance. Mixture-of-Experts (MoE) Architecture: This architecture activates only the necessary components of the model for specific tasks, minimizing computational overhead while maintaining high performance. This design ensures efficient resource utilization, particularly for complex or specialized tasks. This architecture activates only the necessary components of the model for specific tasks, minimizing computational overhead while maintaining high performance. This design ensures efficient resource utilization, particularly for complex or specialized tasks. Chain-of-Thought Reasoning: This feature enables the models to perform step-by-step logical analysis, improving their ability to follow instructions and solve intricate problems. It enhances their effectiveness in real-world applications, such as troubleshooting, decision-making, and problem-solving. These innovations collectively contribute to the models' ability to deliver high-speed, accurate results across a variety of use cases, making them versatile tools for developers and organizations alike. Versatile Applications and Use Cases The gpt-oss models are designed to support a wide range of applications and industries, making them highly adaptable tools for diverse needs. Key use cases include: Web Search and Information Retrieval: The models can process and summarize vast amounts of information, making them ideal for search engines and knowledge management systems. The models can process and summarize vast amounts of information, making them ideal for search engines and knowledge management systems. Coding Assistance: Developers can use the models for code generation, debugging, and optimization, streamlining software development workflows. Developers can use the models for code generation, debugging, and optimization, streamlining software development workflows. Document Comprehension: With their extended context lengths, the models excel at analyzing lengthy documents, such as legal contracts, research papers, and technical manuals. With their extended context lengths, the models excel at analyzing lengthy documents, such as legal contracts, research papers, and technical manuals. Multimodal Input Processing: The ability to handle both text and image inputs broadens their applicability, allowing tasks like image captioning, data analysis, and content generation. The customizable context lengths allow users to tailor the models to specific requirements, whether summarizing extensive documents or generating detailed responses to complex queries. This adaptability makes the gpt-oss models suitable for both general-purpose use and specialized applications, from enterprise workflows to individual projects. Developer Tools for Seamless Integration To assist adoption and integration, OpenAI and NVIDIA have provided a comprehensive suite of developer tools. These resources simplify the deployment and testing of the gpt-oss models, making sure accessibility for developers of varying expertise levels. Key tools include: Ollama App: An intuitive interface for running and testing the models on NVIDIA RTX GPUs, allowing quick experimentation and deployment. An intuitive interface for running and testing the models on NVIDIA RTX GPUs, allowing quick experimentation and deployment. Framework: An open-source framework that supports collaboration and optimization, allowing developers to fine-tune the models for specific hardware configurations. An open-source framework that supports collaboration and optimization, allowing developers to fine-tune the models for specific hardware configurations. Microsoft AI Foundry Local: A set of command-line tools and software development kits (SDKs) designed for Windows developers, allowing seamless integration into existing workflows. These tools empower developers to experiment with advanced AI solutions without requiring extensive expertise in AI infrastructure, fostering innovation and accessibility. NVIDIA's Role in Advancing AI The gpt-oss models were trained on NVIDIA H100 GPUs, using NVIDIA's state-of-the-art AI training infrastructure. Once trained, the models are optimized for inference on NVIDIA RTX GPUs, showcasing NVIDIA's leadership in end-to-end AI technology. This approach ensures high-performance AI capabilities on both cloud-based and local devices, making advanced AI more accessible to a broader audience. Additionally, the models use CUDA Graphs, a feature that minimizes computational overhead and enhances performance. This optimization is particularly valuable for real-time applications, where speed and efficiency are critical. Open-Source Collaboration and Community Impact The gpt-oss models are open-weight, allowing developers to customize and extend their capabilities. This openness encourages innovation and collaboration within the AI community, allowing the development of tailored solutions for specific use cases. NVIDIA has also contributed to open-source frameworks such as GGML and further enhancing the accessibility and performance of the gpt-oss models. These frameworks provide developers with the tools needed to optimize AI models for a variety of hardware configurations, from consumer-grade PCs to enterprise-level systems. Empowering the Future of AI Development The release of the gpt-oss models highlights a pivotal moment in the evolution of AI technology. By harnessing the power of NVIDIA RTX GPUs, these models deliver exceptional performance, flexibility, and accessibility. Their open-source nature, combined with robust developer tools, positions them as valuable assets for driving innovation across a wide range of applications. Whether for individual developers or large organizations, the gpt-oss models offer a practical and efficient solution for advancing AI-driven projects. Browse through more resources below from our in-depth content covering more areas on AI models. Filed Under: AI, Technology News, Top News Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

OpenAI Brings Open-Source GPT AI Models That Are Powerful But Can Run On Your Phones
OpenAI Brings Open-Source GPT AI Models That Are Powerful But Can Run On Your Phones

News18

time3 days ago

  • Business
  • News18

OpenAI Brings Open-Source GPT AI Models That Are Powerful But Can Run On Your Phones

Last Updated: OpenAI has announced new open source models that can assist with agentic AI tools and can be customised as per needs. The AI news continues to pile on this week, and OpenAI is making a big wave with its latest development. While the world waits for the GPT 5 launch, the company has released its first-ever GPT-OSS models called GPT-OSS 120b and 20b versions. The company is offering these models for everyone to download, customise and adopt in their own way. Brands like Amazon and Microsoft have already dedicated their systems to run the new models and we expect more people to gradually put it to test. OpenAI Goes Open With New GPT Models: What It Offers OpenAI claims these models have been optimised to run on hardware like phones and laptops. It is also using models like O3 to help the new ones mature and learn faster. The GPT-oss120b model is further advanced but its abilities mean you get benchmark scores close to the O4 mini model. The GPT-oss 20b is much lighter and it can run on devices with around 16GB GPU memory. The more flashy stuff of ChatGPT like generating images and videos is not the focus of these new versions. Instead, OpenAI is building to create more agentic AI services that can be integrated to web search and its customisation gives them more room to learn and reason with the systems. OpenAI is close to releasing the GPT 5 version in the market but it seems the company also has one more big weapon in its plans. New reports say ChatGPT could get a new premium version that will be priced lower than the Plus model. The new ChatGPT Go subscription tier could allow OpenAI to add more paid users to its network, and give more people the chance to try out the popular features without spending big on the AI chatbot. ChatGPT Go being priced below the 'Plus' version means you are looking at a sub-$15 (Rs 1,200 approx) per month plan which should get less number of features than the Plus and the Pro variants. view comments First Published: August 07, 2025, 08:17 IST Disclaimer: Comments reflect users' views, not News18's. Please keep discussions respectful and constructive. Abusive, defamatory, or illegal comments will be removed. News18 may disable any comment at its discretion. By posting, you agree to our Terms of Use and Privacy Policy.

Elon Musk says xAI will open source Grok 2 next week, days after openAI launches two new closed-weight AI models
Elon Musk says xAI will open source Grok 2 next week, days after openAI launches two new closed-weight AI models

Time of India

time4 days ago

  • Business
  • Time of India

Elon Musk says xAI will open source Grok 2 next week, days after openAI launches two new closed-weight AI models

Elon Musk, founder of xAI and known for his outspoken views on artificial intelligence, announced that the company will release the Grok 2 chatbot as open source next week. Musk made this announcement on X (formerly Twitter), stating: 'It's high time we open sourced Grok 2. Will make it happen next week.' The move places xAI in direct contrast with industry peers who have been shifting toward more proprietary AI model strategies. OpenAI launches GPT-OSS models with limited access Musk's announcement follows OpenAI's launch of two new AI reasoning models—GPT-OSS-120b and GPT-OSS-20b. Though these models are being referred to as 'open-weight,' they are not fully open source. OpenAI has made only the trained parameters (or weights) available, while keeping source code, training data, and development methodologies private. This reflects OpenAI's ongoing transition from a previously open-source model to a more commercially focused, closed system. The company now primarily offers its technologies via paid APIs, benefiting enterprise users and developers while retaining control of core model architecture. xAI reiterates commitment to transparency Unlike OpenAI, Musk and xAI are continuing with a more transparent model strategy. Musk clarified that xAI will continue to open source previous versions as new ones are developed. 'As we create the next version, we open source the prior version, as we did with Grok 1 when Grok 2 was released,' he stated. This follows Musk's earlier commitment made in October 2024, promising that Grok models would be shared openly. By sticking to this approach, xAI is positioning itself as a promoter of open innovation in a field that's becoming increasingly proprietary. Industry divides on open access The contrasting approaches of xAI and OpenAI represent a broader debate within the AI sector. OpenAI's CEO Sam Altman recently acknowledged that the company may have been 'on the wrong side of history' regarding open source. This comment signals the internal conflicts faced by companies attempting to balance innovation with commercial interests. In contrast, xAI's open-source model could encourage collaboration and lower entry barriers for developers and researchers. It may also push the industry toward more accessible tools and increase the pace of AI advancement. Grok 2 release May set a new industry benchmark The open-sourcing of Grok 2 could influence how other companies shape their AI strategies. By making its model publicly available, xAI invites broader community involvement, potentially leading to faster iterations and wider applications. This openness may also help build trust in AI technologies at a time when transparency is a key concern for both developers and Grok 2's open-source release, xAI challenges the current industry norm, reigniting the debate over transparency and innovation in AI. To stay updated on the stories that are going viral follow Indiatimes Trending.

OpenAI releases a free GPT model that can run on your laptop
OpenAI releases a free GPT model that can run on your laptop

The Verge

time4 days ago

  • Business
  • The Verge

OpenAI releases a free GPT model that can run on your laptop

OpenAI is releasing a new open-weight model dubbed GPT-OSS that can be downloaded for free, be customized, and even run on a laptop. The model comes in two variants: 120-billion-parameter and 20-billion-parameter versions. The bigger version can run on a single Nvidia GPU and performs similarly to OpenAI's existing o4-mini model, while the smaller version performs similarly to o3-mini and runs on just 16GB of memory. Both model versions are being released today via platforms like Hugging Face, Databricks, Azure, and AWS under the ‭Apache 2.0 license, which allows them to be widely modified for commercial purposes. This is OpenAI's first open-weight model in over six years, years before the debut of ChatGPT. Until earlier this year, CEO Sam Altman cited safety concerns as the main reason for not releasing a follow-up. Meanwhile, developers have flocked to open models due to their lower cost and customizability. In January, after the rise of DeepSeek, Altman said that OpenAI had 'been on the wrong side of history' by not releasing its own open models. Now, OpenAI is reasserting itself with an open-weight model that it says can perform reasoning tasks, browse the web, write code, and operate agents via the company's existing APIs. 'I think a lot of people are actually surprised to know that the vast majority of our customers are already using a lot of open models,' Chris Cook, an OpenAI researcher, said during a media briefing. 'We wanted to plug that gap and allow them to use our technology across the board.' On the safety front, OpenAI says that GPT-OSS is its most rigorously tested model to date, and that it was tested with external safety firms to ensure it doesn't pose risks in areas like cybersecurity and biological weapons. The model's chain of thought, or visible process used to arrive at an answer, is shown 'to monitor model misbehavior, deception and misuse,' according to a company press release. Its output is text-only and, like all of OpenAI's models, GPT-OSS's training data is undisclosed. 'The team really cooked with this one.' OpenAI hasn't shared benchmarks comparing GPT-OSS to other open models like Llama, DeepSeek, or Google's Gemma. Both variants of GPT-OSS perform similarly to OpenAI's closed reasoning models on coding tasks and tests like Humanity's Last Exam. 'These are incredible models,' said OpenAI co-founder Greg Brockman. 'The team really cooked with this one.' OpenAI isn't committing to a release schedule for future versions of GPT-OSS, but it hopes that the model will be used by smaller developers and companies that want more control over how their data is used. 'We've always believed that if you lower the barrier to access, then innovation just goes up,' said Brockman. 'You let people hack, then they will do things that are incredibly surprising.'‬ Posts from this author will be added to your daily email digest and your homepage feed. See All by Alex Heath Posts from this topic will be added to your daily email digest and your homepage feed. See All AI Posts from this topic will be added to your daily email digest and your homepage feed. See All OpenAI

OpenAI's release of open-source models seen as challenge to China's lead in the field
OpenAI's release of open-source models seen as challenge to China's lead in the field

South China Morning Post

time4 days ago

  • Business
  • South China Morning Post

OpenAI's release of open-source models seen as challenge to China's lead in the field

US artificial intelligence powerhouse OpenAI on Tuesday launched two open-source AI models, a move that is expected to challenge China's dominance in the field. Advertisement Sam Altman, co-founder and chief executive at the San Francisco-based firm, said in an post that the two new open models, the GPT-OSS-120b and GPT-OSS-20b, were 'a big deal', adding that the company believed them to be 'the best and most usable open models in the world'. The launch came as more Chinese open-source AI models – including the Qwen family developed by Alibaba Cloud, the AI and cloud computing unit of Alibaba Group Holding – are gaining global popularity. Alibaba owns the South China Morning Post. OpenAI is also expected to launch its GPT-5 model soon. Bloomberg News reported that OpenAI was in early talks about a potential stock sale for current and former employees that would value the company at about US$500 billion. According to OpenAI, the performance of the two open source models was comparable to o4-mini and o3-mini, two of the Microsoft-backed firm's smaller o-series of reasoning models. This illustration photograph taken on January 29, 2025 shows screens displaying the logos of DeepSeek and OpenAI. Photo: AFP OpenAI's change of direction signals a return to the company's origins – its last open model was in 2019 – in what is seen as a challenge to the flurry of Chinese open-source models. However, analysts said China still leads in the field in terms of sheer numbers of open models in the market.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store