logo
Microsoft's New Compact 1-Bit LLM Needs Just 400MB of Memory

Microsoft's New Compact 1-Bit LLM Needs Just 400MB of Memory

Yahoo22-04-2025

Microsoft's new large language model (LLM) puts significantly less strain on hardware than other LLMs—and it's free to experiment with. The 1-bit LLM (1.58-bit, to be more precise) uses -1, 0, and 1 to indicate weights, which could be useful for running LLMs on small devices, such as smartphones. Microsoft put BitNet b1.58 2B4T on Hugging Face, a collaboration platform for the AI community.
'We introduce BitNet b1.58 2B4T, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale,' the Microsoft researchers wrote. 'Trained on a corpus of 4 trillion tokens, the model has been rigorously evaluated across benchmarks covering language understanding, mathematical reasoning, coding proficiency, and conversational ability.'
The keys to b1.58 2B4T are the performance and efficiency it provides. Where other LLMs often use 16-bit (or 32-bit) floating-point formats. The weights (parameters) are expressed using just the three values (-1, 0,1). Although this isn't the first BitNet of its kind, its size makes it unique. As TechRepublic points out, this is the first 2 billion-parameter, 1-bit LLM.
Credit: Microsoft
An important goal when developing LLMs for less-powerful hardware is to reduce the model's memory needs. In the case of b1.58 2B4T, it requires only 400MB, a dramatic drop from previous record holders, like Gemma 3 1B, which uses 1.4GB.
'The core contribution of this work is to demonstrate that a native 1-0bit LLM, when trained effectively at scale, can achieve performance comparable to leading open-weight, full-precision models of similar size across a wide range of tasks,' the researchers wrote in the report.
One thing to keep in mind is that BitNet b1.58 2B4T only works on Microsoft's own bitnet.cpp system, instead of other traditional frameworks. Training the LLM requires three steps, or phases. The first is pre-training is broken into several of its own steps and (in the case of the researchers' testing) involves 'synthetically generated mathematical data,' along with data from large web crawls, educational web pages, and other 'publicly available' text.
The next phase is supervised fine-tuning (SFT). Researchers used WildChat for conversational training. The last phase, direct preference optimization (DPO), is meant to improve the AI's conversational skills and to put it in sync with your target audience's preferences.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Microsoft Warns 750 Million Windows Users As Upgrades Stop
Microsoft Warns 750 Million Windows Users As Upgrades Stop

Forbes

time2 hours ago

  • Forbes

Microsoft Warns 750 Million Windows Users As Upgrades Stop

New upgrade warning has just hit. Getty Images Microsoft has responded days after news that its critical Windows 11 upgrades have essentially stopped. October's deadline for 750 million users to upgrade to Windows 11 or lose security updates is fast approaching. And that latest data is alarming. As spotted by Windows Central, 'Microsoft has threatened Windows 10 users to upgrade to be on the 'right side of risk'.' And while 'it's accurate that Windows 11 will be more secure than an unsupported Windows 10, some feel the ad is too aggressive.' The comes after major PC maker Asus also warned Windows 10 PCs are 'magnets for security threats,' with the cybersecurity nightmare of hundreds of millions of PCs suddenly falling off support on October 14 now beginning to come true. Windows 10 versus Windows 11 (global share) Statcounter The comments below the YouTube video reflect the usual frustration with warnings to upgrade to Windows 11. 'Using the end of support for your old operating system as a selling point for your new one is actually insane,' says one, while another reflects the bigger issue: 'casually saying like everyone has enough money to buy a new laptop.' Windows 11 upgrade process had seemed to be getting on track, with several months of steady progress seeming likely to continue. But that is not the case. A month ago, Windows 11 was within 10% of Windows 11 globally for the first time, albeit that still left more than half of all users on Windows 10. That has changed and is getting worse. Windows 10 versus Windows 11 (U.S. share) Statcounter Statcounter's data at the end of May shows Windows 11 market share has declined globally and in the U.S., the key market where it has the largest relative share. After four months of progress the other way, this reversal sees Windows 10 holding stubbornly above 50%, with Windows 11 around 10% behind. Per those YouTube comments, the issue for at least 240 million of those 750 million Windows 10 users is that they need to upgrade their PCs as current ones are not eligible for the free upgrade. That's not cheap and is a factor. That said, most Windows 10 users can upgrade but are choosing not to — at least not yet. And time is running out.

These 3 Nuclear Stocks Should Be on Your Energy Radar
These 3 Nuclear Stocks Should Be on Your Energy Radar

Yahoo

time3 hours ago

  • Yahoo

These 3 Nuclear Stocks Should Be on Your Energy Radar

Nuclear energy stocks have been on a tear again after U.S. President Donald Trump signed executive orders that will facilitate the expansion of nuclear energy production, including expediting the regulatory approvals for new nuclear reactors. The Trump administration intends to reform the nuclear energy sector by overhauling the Nuclear Regulatory Commission (NRC), allowing the DoE to build nuclear reactors on federally-owned land, enhancing research at the U.S. Department of Energy and expanding domestic uranium mining and enrichment. And, Big Tech companies are seizing this opportunity to secure cheap, abundant power supplies for their power-hungry AI data centers. Shares of America's leading nuclear power plant operator, Constellation Energy Corp. (NYSE:CEG), have surged more than 15% after the company unveiled on Tuesday an agreement to sell more than 1,100 MW of nuclear power to Meta Platforms (NASDAQ:META) from its Illinois nuclear plant for 20 years. According to The Wall Street Journal, the deal is the first deal of its kind for an operating nuclear plant in the United States, and closely mirrors a similar deal Constellation signed with Microsoft Corp. (NASDAQ:MSFT) last year. The Microsoft deal is a 20-year power purchase agreement (PPA) that will see Constellation Energy restart its undamaged reactor in Three Mile Island, which was undergoing deal will draw power from the main grid. However, Meta appears to have secured a better deal, with Citi's Ryan Levine estimating that the 20-year PPA is priced in the $70-$95/MWh range, considerably cheaper than Jefferies' estimate of at least $110/MWh for Microsoft's PPA, because Meta's deal '…does not offer a substantial premium for low-carbon nuclear power'. Levine has projected that ~70% of Constellation's existing nuclear plants could secure comparable datacenter deals at ~$80/MWh. Constellation is unlikely to be the only nuclear power producer that will see surging power demand under a Trump administration that refuses to put a premium on low-carbon energy. Nuclear stocks have mostly taken a breather after a scorching rally triggered by Russia's war in Ukraine. However, here are 3 nuclear stocks with significant upside. Denison Mines Corp. Consensus Price Target: $4.04 Implied 12- Month Upside Potential: 148% Denison Mines Corp.(NYSE:DNN) engages in the exploration, acquisition and development of uranium properties in Canada. Denison has become a Wall Street favorite, with BMO analyst Alexander Pearce saying the stock's price-to-net present value ratio of 0.9x is one of the most attractive in its group, with clear near-term catalysts. Denison boasts one of the sector's strongest balance sheets, critical for funding modest capital requirements for its 2.2M lbs Phoenix In-Situ Uranium Recovery project. Last month, Denison reported Q1 2024 revenue of C$1.38M, good for +66.3% Y/Y growth while quarterly loss of $0.03 per share missed the Wall Street consensus by $0.01. The company achieved ~75% completion of total engineering for Phoenix, and has committed $67 million for long-lead capital purchases. NexGen Energy Consensus Price Target: $12.85 Implied 12- Month Upside Potential: 102% NexGen Energy Ltd. (NYSE:NXE), is a Canadian exploration and development stage company that develops uranium properties in Canada. The company holds a 100% interest in the Rook I project in southwestern Athabasca Basin of Saskatchewan, totaling an area of ~35,065 hectares. Back in March, NXE shares surged after the company revealed that recent drilling at its Rook I site intersected a rich uranium concentration at its Patterson Corridor East property, the largest development-stage uranium deposit in Canada. According to the company, drillhole RK-25-232 unveiled rich uranium concentration, making it one of the shallowest high-grade intersections at Patterson Corridor. "Discovering mineralization of this intensity so early in our 2025 program outpaces the success pattern experienced at the Arrow deposit," CEO Leigh Curyer said. Paladin Energy Consensus Price Target: $5.08 Implied 12-Month Upside Potential: 21.5% Paladin Energy Ltd (ASX:PDN TSX: PDN OTCQX:PALAF) is an independent uranium developer with a 75% stake in Namibia's Langer Heinrich Mine. Last year, Paladin acquired Canada's Fission Uranium Corp., with the company now operating an extensive portfolio of uranium assets across Canada. Paladin is positioning itself as a significant player in baseload energy provision in multiple countries across the globe and contributing to global decarbonization. Last month, Paladin reported Q3 revenue of $60.97M and GAAP EPS of $0.06. Uranium sales for the quarter were 872,000 pounds, at an average price of $69.90 per pound. The Langer Heinrich property produced 745,000 pounds of uranium, good for a 17% increase on the previous quarter's production to bring total production to over 2 million pounds in the financial year-to-date. By Alex Kimani for More Top Reads From this article on Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store