logo
#

Latest news with #HuggingFace

Your Business' AI Has A Digital Immune System—Here's How To Protect It
Your Business' AI Has A Digital Immune System—Here's How To Protect It

Forbes

time8 hours ago

  • Business
  • Forbes

Your Business' AI Has A Digital Immune System—Here's How To Protect It

Anand Oswal, SVP & GM of Network Security, Palo Alto Networks. In the last couple of years, artificial intelligence (AI) has transformed from a lab concept into the central nervous system of modern business, powering everything from developer tools to customer support. Now, with the rise of autonomous AI agents that can learn, reason and act on our behalf, we are not just using a new tool; we are introducing a new, dynamic entity into our corporate bodies. This represents a profound architectural shift. The old three-tier application stack was like a simple organism. The move to cloud computing modernized this system, but the fundamental constructs were similar. But the AI stack—with its models, new uses of data for training and inferencing, plugins and autonomous agents—is a whole new level of biological complexity. It can learn, adapt and interact in ways that are often unpredictable, which creates an entirely new landscape for potential attacks. Just like any highly evolved organism, this new digital nervous system is born vulnerable. It doesn't yet have the mature, battle-hardened immune system needed to defend against a new generation of sophisticated threats. To truly protect it, we need an equally sophisticated, built-in defense mechanism. Business leaders I talk to every day are keenly aware of this challenge. They see the immense promise and competitive advantage that AI offers, but they're also grappling with the hidden risks that come with adopting it. A common concern is the lack of visibility into which AI apps are running, what data they're accessing and what permissions they have. To navigate this new world confidently, security can no longer be a simple shield; it must evolve to be a comprehensive, adaptive immune system. This means building a defense system that is deeply integrated, intelligent and flexible enough to tell the difference between helpful and harmful actions, neutralize threats in real time and continuously learn from every interaction to become stronger and more resilient over time. Just as our own biological immune systems have evolved over millennia to protect us from countless unseen threats, our approach to cybersecurity must also evolve dramatically for the age of AI. Here's what that essential digital immune system needs to do: 1. Know Your Genetic Makeup: Proactive Scanning and Posture A healthy immune system starts with understanding its own body. For an AI ecosystem, this means having a complete, dynamic inventory of all components and actively screening every component before it enters your ecosystem. Start by integrating automated model scanning directly into your development pipeline. Before any open-source model is downloaded (from a repository like Hugging Face), it must be scanned for embedded malware, unsafe code within files or signs of model poisoning. To maintain the equivalent of a "healthy lifestyle," you will need to enforce strong posture management, monitoring your entire AI ecosystem for risks like excessive permissions or sensitive data exposure. This ensures your AI doesn't develop chronic vulnerabilities. You will also want to ensure that automated alerts notify you if, for example, a customer service chatbot attempts to access a sensitive financial database or if a developer grants an AI model overly broad permissions that violate the principle of least privilege. 2. Build Resilience Through Exposure: AI Red Teaming An immune system strengthens itself through controlled exposure, and your AI is no different. Go beyond theoretical assessments and implement an automated AI red teaming program that acts as a training ground against real-world attacks. You can begin by using established frameworks, such as the OWASP Top 10 for large language models, to simulate common attacks like prompt injection and data leakage. Take it to the next level by expanding your tests to include more than just technical risks, but also business-specific threats. For example, could an agent be tricked into issuing unauthorized refunds? Could a compromised internal tool manipulate a report for an executive? You must create a direct feedback loop where the findings from these red team exercises are used to immediately fine-tune models, update security filters and provide targeted training for your developers. 3. The Real-Time Immune Response: Runtime And Agent Security When an active threat emerges, the immune response must be swift and immediate. A key step is to implement protections at runtime, capable of analyzing the prompt intent and responses to block sophisticated injection attacks that built-in protections would miss. For autonomous agents, the stakes are even higher since agents have memory and the authority to use other tools. A truly robust digital immune system must be able to distinguish a healthy agent from one that has been hijacked, neutralizing its ability to cause harm from within. To do this, enforce the principle of least privilege by scoping out the tools and APIs each agent can access. For example, an agent designed to schedule meetings should have no ability to read emails. An additional safeguard would be ensuring you can automatically disable an agent if it is compromised or begins behaving erratically, like performing an unusual number of actions or making unusual data updates. Also, be sure to log every action taken by an agent, from the prompt it received to the tools it used and the output it generated, to ensure you can conduct effective forensic analysis if an incident occurs. Forging A Path In The AI Era To gain a competitive edge in today's fast-moving market, businesses must embrace AI's transformative power. But they simply can't do this with a compromised digital immune system. Relying on fragmented security tools that act merely as simple, isolated shields is no longer sufficient. What's needed is a holistic and integrated security approach that protects every single AI model, every dataset and every autonomous agent across the entire AI lifecycle, from development to deployment and ongoing operation. By proactively building and maintaining this comprehensive digital immune system, organizations not only protect their enterprise from evolving threats but also empower themselves to confidently navigate the future, innovate at a faster pace and lead with assurance in the exciting and complex age of AI. Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?

Positron AI Secures $51.6 Million in Oversubscribed Series A to Accelerate Inference-Optimized Hardware
Positron AI Secures $51.6 Million in Oversubscribed Series A to Accelerate Inference-Optimized Hardware

Business Wire

timea day ago

  • Business
  • Business Wire

Positron AI Secures $51.6 Million in Oversubscribed Series A to Accelerate Inference-Optimized Hardware

RENO, Nev.--(BUSINESS WIRE)--Positron AI, the premier company for American-made semiconductors and inference hardware, today announced the close of a $51.6 million oversubscribed Series A funding round, bringing its total capital raised this year to over $75 million. The round was led by Valor Equity Partners, Atreides Management and DFJ Growth. Additional investment came from Flume Ventures, which includes tech icon Scott McNealy, Resilience Reserve, 1517 Fund and Unless. This new funding will support the continued deployment of Positron's first-generation product, Atlas, and accelerate the rollout of its second-generation products in 2026. With global tech firms projected to spend over $320 billion on AI infrastructure in 2025, enterprises face intensifying cost pressures, power ceilings and chronic shortages of NVIDIA GPUs. Positron's purpose-built alternative delivers cost and efficiency advantages that come from specialization. The company is currently shipping its first-generation product, Atlas, which delivers 3.5x better performance-per-dollar and up to 66% lower power consumption than NVIDIA's H100. Unlike general-purpose GPUs, Atlas is designed solely to accelerate and serve generative AI applications. 'The early benefits of AI are coming at a very high cost – it is expensive and energy-intensive to train AI models and to deliver curated results, or inference, to end users. Improving the cost and energy efficiency of AI inference is where the greatest market opportunity lies, and this is where Positron is focused,' said Randy Glein, co-founder and managing partner at DFJ Growth. 'By generating 3x more tokens per watt than existing GPUs, Positron multiplies the revenue potential of data centers. Positron's innovative approach to AI inference chip and memory architecture removes existing bottlenecks on performance and democratizes access to the world's information and knowledge.' Positron Atlas's memory-optimized FPGA-based architecture achieves 93% bandwidth utilization, compared to the typical 10–30% in GPU-based systems, and supports up to half a trillion-parameter models in a single 2-kilowatt server. It's fully compatible with Hugging Face transformer models and serves inference requests through an OpenAI API compatible endpoint. Atlas is powered by chips fabricated in the U.S. and is already deployed in production environments, enabling LLM hosting, generative agents and enterprise copilots with significantly lower latency and reduced hardware overhead. 'Memory bandwidth and capacity are two of the key limiters for scaling AI inference workloads for next-generation models,' said Dylan Patel, founder and CEO of SemiAnalysis, and an advisor and investor in Positron. SemiAnalysis is a leading research firm specializing in semiconductors and AI infrastructure that provides detailed insights into the full compute stack. 'Positron is taking a unique approach to the memory scaling problem, and with its next-generation chip, can deliver more than an order of magnitude greater high-speed memory capacity per chip than incumbent or upstart silicon providers.' Capital-Efficient Execution and Technical Depth Positron was co-founded in 2023 by CTO Thomas Sohmers and Chief Scientist Edward Kmett, with former Lambda COO Mitesh Agrawal joining as CEO to scale the company's commercial operations. In just 18 months, the team brought Atlas to market with only $12.5 million in seed funding. They validated performance, landed early enterprise customers and hardened the product in deployment environments before raising this Series A. Now, with growing adoption and a clear product roadmap, Positron is developing custom ASICs to unlock the next level of performance, power efficiency and deployment scale for inference. 'We founded Positron to meet the demands of modern AI: aiming to run the frontier models at the lowest cost per token generation with the highest memory capacity of any chip available,' said Mitesh Agrawal, CEO of Positron AI. 'Our highly optimized silicon and memory architecture allows for superintelligence to be run in a single system with our target of running up to 16-trillion-parameter models per system on models with tens of millions of tokens of context length or memory-intensive video generation models.' Early Enterprise Traction and Strategic Deployment Positron's first publicly announced customers include Parasail (with SnapServe) and Cloudflare, alongside additional deployments within other major enterprises and leading neocloud providers. A New Standard for American AI Infrastructure With the Series A closed, Positron is now advancing its next-generation system, engineered specifically for large-scale frontier model inference. Titan, the follow-on to Atlas powered by Positron's 'Asimov' custom silicon, will feature up to two terabytes of directly attached high-speed memory per accelerator, allowing for up to 16-trillion-parameter models to be run on a single system, and massively expanding context limits for the world's largest models. Titan will support parallel hosting of multiple agents or models, removing the traditional 1:1 model-to-GPU constraint that limits efficiency. Its over-provisioned, inference-tuned networking architecture will ensure low-latency, high-throughput performance even under heavy concurrency. With a standard data center form factor and no need for exotic cooling, the system is designed for seamless integration into existing infrastructures. 'We have passed on the overwhelming majority of AI accelerator startups we have diligenced over the last 6 years as most of them were mounting frontal assaults on NVIDIA that were unlikely to succeed. Positron has carefully chosen a defensible niche in low-cost inference. More importantly, they have proven that their software works before developing an ASIC: Positron is running competitively priced production inference workloads today on 2022 era FPGAs in a server of Positron's own design. This speaks to the quality of their software stack, their system-level expertise and the judgment of their management team,' said Gavin Baker, managing partner and chief investment officer of Atreides Management. About Positron AI Positron AI is transforming generative AI inference with energy-efficient, high-performance compute systems built entirely in the United States. Headquartered in Reno, Nevada, with a remote-first team distributed across the country, Positron's unique hardware architecture offers the lowest total cost of ownership for transformer models by solving the power, memory, and scalability bottlenecks of legacy infrastructure.

The first look: Disrupt 2025 AI Stage revealed
The first look: Disrupt 2025 AI Stage revealed

TechCrunch

timea day ago

  • Business
  • TechCrunch

The first look: Disrupt 2025 AI Stage revealed

At TechCrunch Disrupt 2025, the AI Stage comes alive with the voices shaping the future of technology, creativity, and security. Top VCs reveal what it takes to win funding in a rapidly evolving landscape, as they place bets on the next wave of AI founders. Hear groundbreaking insights from leaders at visionary startups like Apptronik, ElevenLabs, Hugging Face, Runway, Wonder Dynamics, Writer, and Wayve — pioneers redefining the future of tech. These leaders, and many more below, represent the cutting edge of AI's growing impact. On the AI Stage at Disrupt, get an inside look at how breakthroughs are transforming industries, user experiences, and what startups and investors must know to stay ahead. More sessions and speakers are coming soon. In the meantime, secure your ticket for Disrupt 2025 and get ready to lean into dynamic conversations happening on the AI Stage, along with four other industry stages — Space, Builders, Going Public, and Disrupt — from October 27–29. Save up to $675 before prices jump after July. Register here to save big. Introducing the AI Stage lineup Betting on the Next Wave: What VCs Want in AI Startups Speakers to be announced. From model infrastructure to niche applications, AI is producing a new breed of founders and a new set of investor expectations. In this candid conversation, top VCs share what's catching their eye (and what's not), how they're thinking about defensibility in a world of AI monopolies, and what founders need to show to get that next term sheet. Creative Machines and Where AI Meets Imagination Nikola Todorovic, co-founder and CEO, Wonder Dynamics; and more speakers to be announced AI is no longer just optimizing workflows; it's co-creating art, media, and experiences in ways we're only beginning to understand. Wonder Dynamics CEO Nikola Todorovic joins a panel of creative technologists to explore how AI is reshaping the creative process, blurring the lines between artist and algorithm, and opening up new frontiers for storytellers, designers, and dreamers alike. Techcrunch event Tech and VC heavyweights join the Disrupt 2025 agenda Netflix, ElevenLabs, Wayve, Sequoia Capital — just a few of the heavy hitters joining the Disrupt 2025 agenda. They're here to deliver the insights that fuel startup growth and sharpen your edge. Don't miss the 20th anniversary of TechCrunch Disrupt, and a chance to learn from the top voices in tech — grab your ticket now and save up to $675 before prices rise. Tech and VC heavyweights join the Disrupt 2025 agenda Netflix, ElevenLabs, Wayve, Sequoia Capital — just a few of the heavy hitters joining the Disrupt 2025 agenda. They're here to deliver the insights that fuel startup growth and sharpen your edge. Don't miss the 20th anniversary of TechCrunch Disrupt, and a chance to learn from the top voices in tech — grab your ticket now and save up to $675 before prices rise. San Francisco | REGISTER NOW AUSTIN, TEXAS – MARCH 14: Nikola Todorovic speaks onstage at 'Featured Session: Understanding the Role of AI in Reshaping the Film & Television Industry' during the 2023 SXSW Conference and Festivals at Austin Convention Center on March 14, 2023 in Austin, Texas. (Photo byfor SXSW) Image Credits:Diego Donamaria / Getty Images Writing the Future with AI? May Habib, co-founder and CEO, Writer What happens when AI learns to write with purpose, personality, and persuasion? Writer CEO May Habib joins us to talk about the evolving relationship between language and machines and what the rise of generative content means for the future of brand, business, and beyond. Why the Next Frontier Is Search Edo Liberty, founder and CEO, Pinecone In a world overflowing with data, finding what matters is everything. Pinecone founder Edo Liberty unpacks why infrastructure, not algorithms, might be the biggest unlock in AI, and what's coming next in the race to power smarter applications at scale. Intelligence in Motion and the Future of Physical AI Jeff Cardenas, co-founder and CEO, Apptronik; and Raquel Urtasun, founder and CEO, Waabi AI in the physical world hasn't had its ChatGPT moment…yet. Waabi CEO Raquel Urtasun and Apptronik CEO Jeff Cardenas join us to explore and demonstrate what it takes to bring intelligence into motion, whether it's behind the wheel or on two legs. From simulation to sensors to scaling safely, this panel explores the breakthroughs driving the next generation of physical machines. Image Credits:Apptronik/Mercedes Building Intelligence for Modern Defense Ethan Thornton, founder and CEO, Mach Industries From stealth mode to center stage, Mach Industries is bringing AI into one of the world's most complex and controversial sectors: defense. CEO Ethan Thornton joins us to talk about what it takes to build in high-stakes environments, where speed and autonomy matter most, and why next-gen infrastructure starts with rethinking the fundamentals. Driving Intelligence Alex Kendall, co-founder and CEO, Wayve From self-driving cars to self-learning systems, Alex Kendall is rethinking how machines perceive and act in the world. The Wayve CEO joins us to explore how real-world autonomy is shaping the next chapter of AI, and why breakthroughs on the road may unlock progress far beyond it. From Ads to Films: Creating with Code Alejandro Matamala Ortiz, co-founder and chief design officer, Runway Creatives aren't being replaced, they're being rearmed. Alejandro Matamala Ortiz, co-founder of Runway, shares how creative work is being reshaped by machine learning, what AI-native tools mean for visual storytelling, and why this is just the beginning of a new creative era. Shaping the AI Stack Thomas Wolf, co-founder and CSO, Hugging Face From models and datasets to ethics and infrastructure, Hugging Face is helping define what building responsibly with AI actually looks like. Co-founder and CSO Thomas Wolf joins us to talk about the shifting power dynamics in the AI ecosystem, the rise of community-led innovation, and what it takes to stay open while moving fast. Synthetic Voices and Real Impact Mati Staniszewski, co-founder and CEO, ElevenLabs From audiobooks to avatars, synthetic speech is having a moment. ElevenLabs is helping lead the charge. CEO Mati Staniszewski joins us to explore what it takes to build AI that speaks like us and how voice technology is reshaping the creative industries, accessibility, and entertainment. LONDON, ENGLAND – JUNE 04: Mati Staniszewski speaks onstage during 'The AI Voice Revolution' panel discussion on day three of SXSW London 2025 at the Truman Brewery on June 04, 2025 in London, England. (Photo byfor SXSW London) Image Credits:Jeff Spicer / Getty Images Love, Lies & Algorithms: The Truth About AI in Matters of the Heart Mark Kantor, head of product and Innovation, Match Group; Eugenia Kuyda, CEO, Replika; and more speakers to be announced. AI is changing the way we meet, match, and fall in love, sometimes in ways we don't even notice. This panel explores how technology is reshaping modern relationships, for better or worse. From dating apps to digital soulmates, we'll look at where things are headed and what that means for the human heart. AI and National Security in the High-Stakes Race to Innovate Sri Chandrasekar, managing partner, Point72; Justin Fanelli, CTO, The Navy; and Kathleen Fisher, director, DARPA From defense labs to Wall Street and naval operations, AI is reshaping how countries protect themselves and project power. DARPA's Kathleen Fisher, Point72's Sri Chandrasekar, and Navy CTO Justin Fanelli dive into the cutting-edge AI breakthroughs driving security innovation. They'll discuss what it means for entrepreneurs, investors, and the future of global stability. Step into the future of AI at Disrupt 2025 Don't miss your chance to be part of the conversation shaping the future of AI at TechCrunch Disrupt 2025. For over two full days, the AI Stage will feature nonstop sessions with visionary founders, top VCs, and industry leaders — alongside four other cutting-edge stages: Space, Builders, Going Public, and Disrupt — from October 27 to 29 in San Francisco's Moscone West. Secure your ticket now to save up to $675 before prices rise after July, and get ready to unlock the next wave of AI innovation redefining technology, creativity, security, and much more.

Everyone's Using AI Wrong : Hugging Face CSO Explains
Everyone's Using AI Wrong : Hugging Face CSO Explains

Geeky Gadgets

time3 days ago

  • Business
  • Geeky Gadgets

Everyone's Using AI Wrong : Hugging Face CSO Explains

What if the way you're using AI is holding you back? Despite the buzz around artificial intelligence, many people are stuck in a cycle of shallow experimentation—dabbling with tools like ChatGPT or Midjourney without truly understanding how to use AI for meaningful impact. The truth is, AI isn't just a novelty; it's a fantastic force reshaping industries, redefining creativity, and altering the job market at breakneck speed. Yet, most of us are missing the mark. Platforms like Hugging Face, for instance, offer powerful tools that could transform how you work, create, and innovate, but they remain underutilized or misunderstood. If you're feeling overwhelmed by the hype or unsure where to start, you're not alone—and that's exactly why this how-to exists. In this guide, Silicon Valley Girl uncover how to move beyond surface-level AI usage and tap into its full potential. You'll learn how to harness open source platforms like Hugging Face to create, automate, and innovate—whether you're a developer, marketer, educator, or artist. We'll explore how AI can amplify your creativity, streamline your workflows, and even future-proof your career in an era of rapid technological change. Along the way, you'll discover actionable strategies and insights that bridge the gap between AI's possibilities and your goals. The key isn't just using AI—it's using it right. Ready to rethink what's possible? Let's explore how to truly win with AI. Unlocking AI's Full Potential The Fantastic Impact of AI on Jobs AI is set to disrupt millions of jobs within the next five years, fundamentally altering the employment landscape. Automation is taking over repetitive tasks, while AI tools are enhancing productivity in creative and technical fields. To remain competitive, you must adapt by mastering AI tools and focusing on areas where human creativity and strategic thinking are indispensable. For example: Marketing professionals are using AI to analyze consumer behavior, predict trends, and optimize campaigns for better engagement. are using AI to analyze consumer behavior, predict trends, and optimize campaigns for better engagement. Designers are using AI to generate innovative concepts, streamline workflows, and enhance visual storytelling. are using AI to generate innovative concepts, streamline workflows, and enhance visual storytelling. Software developers are integrating AI to automate coding, debugging, and testing processes, accelerating project timelines. By embracing these advancements, you can position yourself as a leader in your field, making sure you stay ahead in a rapidly evolving job market. Hugging Face: Providing widespread access to AI for All Hugging Face, a prominent open source AI platform, is transforming access to AI tools and resources. Initially designed for developers, it now offers pre-trained models, datasets, and tools that simplify the creation of AI applications. Its 'Spaces' feature, which enables low-code AI application development, is particularly noteworthy for non-technical users. This accessibility enables individuals and small businesses to innovate without requiring extensive technical expertise. By lowering the barriers to entry, Hugging Face fosters a more inclusive AI ecosystem, allowing you to actively participate in and benefit from the AI revolution. Whether you are a developer or a business owner, platforms like Hugging Face provide the tools to turn ideas into reality. How to Use AI Effectively Watch this video on YouTube. Advance your skills in Artificial Intelligence (AI) by reading more of our detailed content. Open source AI and Licensing: A Collaborative Future Open source AI platforms are transforming how AI is developed and deployed. These platforms allow you to download, modify, and run AI models locally, offering greater privacy, control, and customization. Licensing frameworks, such as MIT and Apache, encourage collaboration while protecting intellectual property. This open approach accelerates innovation and ensures that AI remains accessible to a broader audience. By engaging with open source communities, you can contribute to innovative advancements while benefiting from shared knowledge and resources. Open source AI not only provide widespread access tos technology but also fosters a culture of collaboration and transparency, which is essential for sustainable progress. AI's Role in Education and Creativity AI is transforming education and creativity, making it easier for you to acquire technical skills and explore new creative possibilities. Tools like AI-assisted coding platforms and generative design software enable non-technical individuals to tackle complex projects with ease. For instance: Educators are using AI to personalize learning experiences, tailoring content to individual needs and improving student outcomes. are using AI to personalize learning experiences, tailoring content to individual needs and improving student outcomes. Artists are using AI to experiment with new forms of expression, pushing creative boundaries and redefining artistic possibilities. By fostering adaptability and innovative thinking, these tools prepare you and future generations for an AI-driven future. The integration of AI into education and creative fields highlights its potential to empower individuals and unlock new opportunities. Robotics and AI: A Synergistic Integration The integration of robotics and AI is advancing rapidly, with household robots expected to become mainstream in the near future. Open source robotics platforms are making these technologies more accessible, allowing you to customize robots for specific tasks. Whether automating household chores or enhancing industrial processes, robotics powered by AI is set to transform how we interact with machines. By understanding and using these technologies, you can improve efficiency and quality of life. Robotics and AI together represent a powerful combination that has the potential to transform industries and redefine daily living. Emerging Trends in AI AI models are becoming smaller, more efficient, and increasingly embedded in everyday devices—a trend known as embedded AI. This development enables seamless integration into smartphones, appliances, and other technologies you use daily. Beyond convenience, AI is driving breakthroughs in critical fields such as: Material science: AI-powered simulations are accelerating the discovery of new materials with unique properties. AI-powered simulations are accelerating the discovery of new materials with unique properties. Energy: Predictive models are optimizing energy usage, reducing waste, and advancing renewable energy solutions. Predictive models are optimizing energy usage, reducing waste, and advancing renewable energy solutions. Healthcare: AI is improving patient outcomes through advanced diagnostics, personalized treatments, and drug discovery. These advancements highlight AI's potential to address some of the world's most pressing challenges. By staying informed and engaged, you can contribute to meaningful progress in these fantastic areas. Societal Implications and the Human Element As automation reshapes industries, society must adapt to new realities. Governments and organizations face the challenge of addressing job displacement, ethical concerns, and the need for reskilling while fostering innovation. For you, this means focusing on areas where human skills—such as creativity, empathy, and critical thinking—remain irreplaceable. The rise of automation may also lead to a greater emphasis on creativity and entertainment, opening doors to new forms of expression and connection. By cultivating these uniquely human abilities, you can ensure your relevance in an increasingly automated world. Balancing the use of AI with the cultivation of your unique skills and perspectives ensures that technology serves as a tool for empowerment rather than a replacement for human ingenuity. Thriving in an AI-Driven Future The insights shared by industry leaders, such as Thomas Wolf of Hugging Face, emphasize the importance of embracing AI responsibly while fostering innovation, creativity, and adaptability. By using open source platforms, mastering AI tools, and focusing on areas where human skills excel, you can navigate the challenges and opportunities of an AI-driven future. As AI continues to evolve, its impact will extend far beyond automation, shaping industries, education, and society in profound ways. Understanding these changes and positioning yourself to thrive in this dynamic landscape will be key to success. By combining the power of AI with the enduring value of human connection, you can harness technology to create a future that is both innovative and inclusive. Media Credit: Silicon Valley Girl Filed Under: AI, Technology News, Top News Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

Alibaba Unveils Cutting-Edge AI Coding Model Qwen3-Coder - Middle East Business News and Information
Alibaba Unveils Cutting-Edge AI Coding Model Qwen3-Coder - Middle East Business News and Information

Mid East Info

time6 days ago

  • Business
  • Mid East Info

Alibaba Unveils Cutting-Edge AI Coding Model Qwen3-Coder - Middle East Business News and Information

Alibaba has launched Qwen3-Coder, its most advanced agentic AI coding model to date. Designed for high-performance software development, Qwen3-Coder excels in agentic AI coding tasks, from generating new codes and managing complex coding workflows to debugging across entire codebases. Built on a Mixture-of-Experts MoE architecture, this open-sourced model Qwen3-Coder-480B-A35B-Instruct, which has a total of 480 billion parameters but activates 35 billion parameters per token, delivers efficiency without sacrificing performance. The model achieves competitive results against leading state-of-the-art (SOTA) models across key benchmarks in agentic coding, browser use, and tool use. Qwen3-Coder-480B-A35B-Instruct achieves competitive results against leading state-of-the-art (SOTA) models across key benchmarks Additionally, Alibaba is open-sourcing Qwen Code, a powerful command-line interface (CLI) tool that enables developers to delegate engineering tasks to AI using natural language. Optimized with custom prompts and interaction protocols, Qwen Code unlocks the full potential of Qwen3-Coder for real-world agentic programming. The model also supports integration with the Claude Code interface, making it even easier for developers to execute their coding tasks. Trained on an extensive dataset of codes and general text data, Qwen3-Coder is engineered for robust agentic coding. It natively supports a context window of 256K tokens, extendable up to 1 million tokens, enabling it to process vast codebases in a single session. Its superior performance stems not only from scaling across tokens, context length, and synthetic data during pre-training, but also from innovative post-training techniques such as long-horizon reinforcement learning agent RL. This advancement allows the model to solve complex, real-world problems through multi-step interactions with external tools. As a result, Qwen3-Coder achieves SOTA performance among open-source models on SWE-Bench Verified (a benchmark for evaluating AI models' ability to solve real-world software issues), even without test-time or inference scaling. Agentic AI coding is transforming software development by enabling more autonomous, efficient, and accessible programming workflows. With its open-source availability, strong agentic coding capabilities, and seamless compatibility with popular developer tools and interfaces, Qwen3-Coder is positioned as a valuable tool for global developers in software development. The Qwen3-Coder-480B-A35B-Instruct model is now available on Hugging Face and GitHub. Developers can also access the model on Qwen Chat or via cost-effective APIs through Model Studio, Alibaba's generative AI development platform. Qwen-based coding models have already surpassed 20 million downloads globally. Tongyi Lingma, Alibaba Cloud's Qwen-powered coding assistant, will soon be upgraded with Qwen3-Coder's enhanced agentic capabilities. Since its launch in June 2024, Tongyi Lingma's 'AI Programmer' feature—offering code completion, optimization, debugging support, snippet search, and batch unit test generation—has generated over 3 billion lines of code. About Alibaba Cloud: Established in 2009, Alibaba Cloud is the digital technology and intelligence backbone of Alibaba Group. It offers a complete suite of cloud services to customers worldwide, including elastic computing, database, storage, network virtualization services, large-scale computing, security, big data analytics, machine learning and artificial intelligence (AI) services. Alibaba has been named the leading IaaS provider in Asia Pacific by revenue in U.S. dollars since 2018, according to Gartner. It has also maintained its position as one of the world's leading public cloud IaaS service providers since 2018, according to IDC.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store