Latest news with #LargeLanguageModel

Yahoo
7 hours ago
- Business
- Yahoo
Tachyum Radically Cuts the Cost of DeepSeek by Quantizing it to 2-bits
LAS VEGAS, June 03, 2025--(BUSINESS WIRE)--Tachyum® today announced the release of a new white paper detailing how it efficiently scales Large Language Model (LLM) training and inference through the Mixture of Experts (MoE) approach. The company's method is further improved by a DeepSeekMoE architecture with 4-bit FP4 data types for activations quantization and 2-bit Tachyum AI (TAI2) sparse weights quantization. The white paper, "Tachyum Successfully Quantized DeepSeek LLM to its 2-bit TAI2," illustrates how Tachyum integrates MoE with low-bit data formats to unlock scalable AI with unmatched efficiency. The combination allows for the development of more powerful models while significantly lowering resource requirements. MoEs can match the performance of dense models using approximately 4 times less computing and memory bandwidth, while only memory capacity needs to be increased by approximately 4 times. It is expected that that ratio will continue to grow. This architecture benefits from Tachyum's proprietary high-performance memory, eliminating the need for costly high-bandwidth memory (HBM) solutions. Successfully quantizing DeepSeek LLM to 2-bit TAI2 further doubles benefit of DeepSeekMoE LLM compared to other architectures. Tachyum's AI researchers applied FP4 activation quantization and 2-bit TAI2 sparse weights quantization to DeepSeekMoE and Llama 3.1 models. Benchmark testing demonstrated up to 25x faster inference speeds and a 20x cost reduction per token, marking a major leap in LLM deployment efficiency. "DeepSeek approach has shown the potential to make next-generation models 10 times more efficient at today's costs, avoiding the exponential scaling challenges faced by organizations today," said Dr. Radoslav Danilak, founder and CEO of Tachyum. "With the Prodigy platform, we're enabling this kind of breakthrough efficiency for AI applications at global scale." The white paper also emphasizes the critical role of Tachyum's hardware in facilitating this transformation, showcasing the Prodigy Universal Processor's ability to support high-efficiency AI workloads with industry-leading performance. As a Universal Processor offering industry-leading performance for all workloads, Prodigy-powered data center servers can seamlessly and dynamically switch between computational domains (such as AI/ML, HPC, and cloud) with a single homogeneous architecture. By eliminating the need for expensive dedicated AI hardware and dramatically increasing server utilization, Prodigy reduces CAPEX and OPEX significantly while delivering unprecedented data center performance, power, and economics. Prodigy integrates 256 high-performance custom-designed 64-bit compute cores to deliver up to 18x the highest performing GPU for AI applications, 3x the performance of the highest-performing x86 processors for cloud workloads, and up to 8x that of the highest performing GPU for HPC. Those interested in reading the "Tachyum Successfully Quantized DeepSeek LLM to its 2-bit TAI2" white paper can visit to download. Follow Tachyum About Tachyum Tachyum is transforming the economics of AI, HPC, public and private cloud workloads with Prodigy, the world's first Universal Processor. Prodigy unifies the functionality of a CPU, a GPU, and a TPU in a single processor to deliver industry-leading performance, cost and power efficiency for both specialty and general-purpose computing. As global data center emissions continue to contribute to a changing climate, with projections of their consuming 10 percent of the world's electricity by 2030, the ultra-low power Prodigy is positioned to help balance the world's appetite for computing at a lower environmental cost. Tachyum received a major purchase order from a US company to build a large-scale system that can deliver more than 50 exaflops performance, which will exponentially exceed the computational capabilities of the fastest inference or generative AI supercomputers available anywhere in the world today. Tachyum has offices in the United States, Slovakia and the Czech Republic. For more information, visit View source version on Contacts Mark SmithJPR Communications818-398-1424marks@ Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data
Yahoo
15 hours ago
- Automotive
- Yahoo
Storm Reply and Audi Set New Standards for Cloud Management with Agentic AI
TURIN, Italy, June 03, 2025--(BUSINESS WIRE)--Storm Reply, the Reply Group company specialising in innovative cloud-based solutions and services, is supporting Audi's Cloud Foundation Services team in enhancing its cloud management activities through the introduction of Devbot, a digital assistant powered by Agentic AI technologies. Designed to optimise the use of cloud infrastructure, Devbot automates the handling of recurring support requests. Preparing to deploy a system where Audi employees receive personalized recommendations in real time, while ensuring the secure and efficient use of cloud services across the organization. Unlike traditional chatbots, Devbot is built on a multi-agent architecture, combining specialised AI agents for different tasks. The platform acts proactively by identifying security vulnerabilities in the cloud infrastructure, suggesting validated Infrastructure-as-Code (IaC) components for the rapid development of new environments, and providing actionable recommendations to optimise cloud costs. To develop this digital assistant, Storm Reply combined state-of-the-art technologies within the AWS Cloud. The Large Language Model (LLM) Claude enables users to interact through natural language, while a specialised Retrieval-Augmented Generation (RAG) process, developed by Storm Reply, ensures that all responses are based on verifiable and up-to-date information. Tailored access controls further protect sensitive data, guaranteeing that only authorised users have access. The success of the digital assistant demonstrates the significant potential of Agentic AI when integrated into existing business processes to reduce repetitive and error-prone manual tasks. With Devbot, Audi's Cloud Foundation Services team can now focus on shaping future-proof cloud strategies, while the other departments benefit from faster, more tailored support. Storm ReplyStorm Reply is specialized in the design and implementation of innovative Cloud-based solutions and services. Through consolidated expertise in the creation and management of Infrastructure as a Service (IaaS), Software as a Service (SaaS), and Platform as a Service (PaaS) Cloud solutions, Storm Reply supports important companies in Europe and all over the world in the implementation of Cloud-based systems and applications. Storm Reply is AWS Premier Consulting Partner. AudiThe Audi Group is one of the most successful manufacturers of automobiles and motorcycles in the premium and luxury segment. The brands Audi, Bentley, Lamborghini, and Ducati produce at 21 locations in 12 countries. Audi and its partners are present in more than 100 markets worldwide. In 2024, the Audi Group delivered 1.7 million Audi vehicles, 10,643 Bentley vehicles, 10,687 Lamborghini vehicles, and 54,495 Ducati motorcycles to customers. In the 2024 fiscal year, Audi Group achieved a total revenue of €64.5 billion and an operating profit of €3.9 billion. As of December 31, more than 88,000 people worked for the Audi Group, more than 55,000 of them at AUDI AG in Germany. With its attractive brands and numerous new models, the group is systematically pursuing its path toward becoming a provider of sustainable, fully networked premium mobility. ReplyReply [EXM, STAR: REY, ISIN: IT0005282865] specialises in the design and implementation of solutions based on new communication channels and digital media. As a network of highly specialised companies, Reply supports major European industrial groups in the telecom and media; industry and services; banking and insurance and public sectors in defining and developing business models enabled by the new paradigms of AI, cloud computing, digital media and the internet of things. Reply's services include: consulting, system integration and digital services. View source version on Contacts Media contact: Reply Fabio Tel. +39 0117711594 Sandra Tel. +49 170 4546229 Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data


Business Wire
15 hours ago
- Automotive
- Business Wire
Storm Reply and Audi Set New Standards for Cloud Management with Agentic AI
TURIN, Italy--(BUSINESS WIRE)-- Storm Reply, the Reply Group company specialising in innovative cloud-based solutions and services, is supporting Audi's Cloud Foundation Services team in enhancing its cloud management activities through the introduction of Devbot, a digital assistant powered by Agentic AI technologies. Designed to optimise the use of cloud infrastructure, Devbot automates the handling of recurring support requests. Preparing to deploy a system where Audi employees receive personalized recommendations in real time, while ensuring the secure and efficient use of cloud services across the organization. Unlike traditional chatbots, Devbot is built on a multi-agent architecture, combining specialised AI agents for different tasks. The platform acts proactively by identifying security vulnerabilities in the cloud infrastructure, suggesting validated Infrastructure-as-Code (IaC) components for the rapid development of new environments, and providing actionable recommendations to optimise cloud costs. To develop this digital assistant, Storm Reply combined state-of-the-art technologies within the AWS Cloud. The Large Language Model (LLM) Claude enables users to interact through natural language, while a specialised Retrieval-Augmented Generation (RAG) process, developed by Storm Reply, ensures that all responses are based on verifiable and up-to-date information. Tailored access controls further protect sensitive data, guaranteeing that only authorised users have access. The success of the digital assistant demonstrates the significant potential of Agentic AI when integrated into existing business processes to reduce repetitive and error-prone manual tasks. With Devbot, Audi's Cloud Foundation Services team can now focus on shaping future-proof cloud strategies, while the other departments benefit from faster, more tailored support. Storm Reply Storm Reply is specialized in the design and implementation of innovative Cloud-based solutions and services. Through consolidated expertise in the creation and management of Infrastructure as a Service (IaaS), Software as a Service (SaaS), and Platform as a Service (PaaS) Cloud solutions, Storm Reply supports important companies in Europe and all over the world in the implementation of Cloud-based systems and applications. Storm Reply is AWS Premier Consulting Partner. Audi The Audi Group is one of the most successful manufacturers of automobiles and motorcycles in the premium and luxury segment. The brands Audi, Bentley, Lamborghini, and Ducati produce at 21 locations in 12 countries. Audi and its partners are present in more than 100 markets worldwide. In 2024, the Audi Group delivered 1.7 million Audi vehicles, 10,643 Bentley vehicles, 10,687 Lamborghini vehicles, and 54,495 Ducati motorcycles to customers. In the 2024 fiscal year, Audi Group achieved a total revenue of €64.5 billion and an operating profit of €3.9 billion. As of December 31, more than 88,000 people worked for the Audi Group, more than 55,000 of them at AUDI AG in Germany. With its attractive brands and numerous new models, the group is systematically pursuing its path toward becoming a provider of sustainable, fully networked premium mobility.


Business Wire
6 days ago
- Business
- Business Wire
CloudWalk Earns Fraud Prevention Certification from Brazil's National Confederation of Financial Institutions
SíO PAULO--(BUSINESS WIRE)--CloudWalk, the financial technology company behind InfinitePay and has been awarded the Fraud Prevention Certification by the Brazilian National Confederation of Financial Institutions (CNF). The certification recognizes institutions that adopt best practices for detecting, preventing, and raising awareness around fraud — a proactive industry standard for ensuring operational integrity. CloudWalk's proprietary anti-fraud system merges machine learning with Large Language Model (LLM)-powered agents to flag suspicious activity, shut down organized digital crime, and block fraudulent account creation. In 2024 alone, it helped prevent an estimated R$15 billion in fraud. This 12-month CNF certification falls under the organization's 2025 initiative cycle. 'While fraud rings have started using AI to launch new attack strategies, we've built a defense system that's constantly evolving — able to analyze hundreds of millions of transactions autonomously,' said Alan Dias, Chief Risk and Compliance Officer at CloudWalk. 'This certification celebrates our team's tireless work to protect the more than 4 million customers who rely on us.' How It Works CloudWalk's system monitors transactions 24/7, detects fraud in real time, and adapts to new threats. With an accuracy rate above 99%, the technology works with minimal human input. It not only spots fraud — it learns from each case, getting better over time at protecting customer and partner assets. 'It's an incredibly reliable system. Just 0.3% of fraud decisions require a second look from a human,' Dias added. According to CNF, this certification reinforces a company's commitment to user security and proves it meets key market standards and regulations. 'Certification shows a clear and proactive stance in protecting the financial market against fraud,' reads a statement on CNF's website. Recognition for AI Innovation CloudWalk's fraud-fighting AI has already won industry acclaim. In both 2023 and 2024, the company received awards in the digital security categories of the Mastercard Excellence Program. It also ranked in Silverguard's monthly list of the 'Top 10 Anti-Fraud Companies Fighting Fake and Fraudulent Accounts.' Thanks to these security investments, CloudWalk brand InfinitePay saw zero listings in the Brazilian Central Bank's financial sector complaints ranking in Q4 2024. These results reflect the company's ongoing focus on safe operations, outstanding service, and long-term customer success. CloudWalk has also launched a dedicated security webpage, centralizing digital safety information and tools to help customers protect themselves from fraud and scams.


CNA
25-05-2025
- CNA
Multilingual India's race to build own Large Language Model easier said than done
India is gearing up for what it hopes will be its own ChatGPT moment. The country is building its own Large Language Model that may one day rival OpenAI's chatbot. But in a country of countless languages and dialects, this is easier said than done. Ishan Garg reports from New Delhi.