Latest news with #Janus-Pro-7B

DeepSeek launches Janus-Pro image generator, taking aim at DALL-E 3 superiority

Express Tribune

29-01-2025

Business
Express Tribune

DeepSeek launches Janus-Pro image generator, taking aim at DALL-E 3 superiority

Listen to article DeepSeek, the rapidly rising AI company, has unveiled a new suite of multimodal AI models under the Janus-Pro family, which it claims can outperform OpenAI's DALL-E 3. These models, ranging from 1 billion to 7 billion parameters in size, are available for download via the AI development platform Hugging Face. The Janus-Pro models are licensed under the MIT license, allowing for unrestricted commercial use. DeepSeek describes the Janus-Pro as a 'novel autoregressive framework' capable of both image analysis and creation. According to the company, the largest Janus-Pro model, Janus-Pro-7B, outperforms DALL-E 3 and other models like PixArt-alpha, Emu3-Gen, and Stability AI's Stable Diffusion XL on AI evaluation benchmarks such as GenEval and DPG-Bench. While some of these competing models are older, and most Janus-Pro models can only analyze smaller images (up to 384 x 384 resolution), the performance of Janus-Pro remains impressive considering its compact design. Photo: DeepSeek DeepSeek believes that Janus-Pro, with its simple yet powerful framework, surpasses previous unified models and challenges task-specific models. This makes it a strong contender in the field of next-generation unified multimodal models. The company has gained widespread attention after its chatbot app rose to the top of the Apple App Store charts. Funded primarily by High-Flyer Capital Management, a quantitative trading firm, DeepSeek's language models, developed using compute-efficient methods, have raised questions about the future of AI development and whether other nations can challenge U.S. dominance in the AI sector, especially regarding AI chip demand.

NVIDIA's AI Boom: $330B Bet or Bubble? Investors Pour In as Competition Heats Up

Yahoo

28-01-2025

Business
Yahoo

NVIDIA's AI Boom: $330B Bet or Bubble? Investors Pour In as Competition Heats Up

The AI investment boom is just getting started. Tigress Financial Partners sees capital spending on AI and data centers hitting $330 billion this yearup from $250 billion in 2024and climbing past $400 billion in 2026. That's a massive wave of cash, and NVIDIA (NASDAQ:NVDA) is right in the middle of it. The firm just slapped a $220 price target on the stock, implying an 80% upside from here. And there's reason to believe it's justifiedNVIDIA's latest earnings blew past expectations, with Q3 2025 revenue up 94% year-over-year to a record $35.1 billion, fueled by insatiable demand for its AI-dominating GPUs. Warning! GuruFocus has detected 3 Warning Signs with NVDA. But competition is heating up fast. DeepSeek just dropped its Janus-Pro-7B model, turning up the pressure in the AI chip wars. DA Davidson isn't convinced NVIDIA can keep its edge, maintaining a Neutral rating with a $135 price targetfar below the bullish calls from UBS and Itau BBA. Retail traders aren't fazed, though. They just poured a record $562.2 million into NVIDIA shares, betting the AI leader will stay on top despite the growing threat from new entrants. Still, not everyone's convinced this ride will be smooth. Nassim Taleb, the guy who warned about black swan events before they happened, thinks NVIDIA's recent volatility could be a sign of something bigger. But NVIDIA isn't backing down. The company argues that advances like DeepSeek's only reinforce the need for its high-performance GPUs. One thing's clearAI investment isn't slowing down, and the stakes for investors are only getting higher. This article first appeared on GuruFocus. Sign in to access your portfolio

DeepSeek: What is China's groundbreaking AI that beats OpenAI against all odds?

Yahoo

28-01-2025

Business
Yahoo

DeepSeek: What is China's groundbreaking AI that beats OpenAI against all odds?

Chinese startup DeepSeek has released a 'low cost' open-source artificial intelligence model rivalling OpenAI's ChatGPT, drawing appreciation as well as concern from the Silicon Valley. The R1 model made public last week appears to match OpenAI's newer 01 models on several benchmarks. DeepSeek claims to have spent less than $6 million to train it compared to the hundreds of millions of dollars that American companies like Google, OpenAI and Meta have poured in to train their AI models. R1 'outperforms other open-source models and achieves performance comparable to leading closed-source models', the Chinese company says. 'Despite its strong performance, it maintains economical training costs.' DeepSeek's new image-generation AI model, called Janus-Pro-7B and released on Monday, also seems to perform as well as or better than OpenAI's DALL-E 3 on several benchmarks. As word spread of the performance of the new Chinese AI model, stocks of leading tech firms in the US such as Nvidia and Oracle crashed on Monday, wiping out almost a trillion pounds in value off some of the world's most prized companies. Nvidia is the leading supplier of chips used to train and run AI models. Share prices of American energy companies also suffered steep drops. The reason: the new DeepSeek models seemingly belie the assertion by the Western tech ecosystem that developing advanced AI requires heavy investments of capital, electricity and water resources. The Lawrence Berkeley National Laboratory, for one, predicts that advancements in AI will see data centers in the US consume as much as 12 per cent of the total electricity by 2028 as compared to 5 per cent in 2023. DeepSeek, in contrast, offers the possibility of more cost-effective AI models that require much less capital and power to develop and run. 'The release of DeepSeek AI from a Chinese company should be a wakeup call for our industries that we need to be laser-focused on competing to win,' US president Donald Trump said. 'I view that as a positive because you'll be doing that too, so you won't be spending as much and you'll get the same result, hopefully,' he said, adding that Washington will put more tariffs on foreign computer chips and semiconductors to return their production to the US. DeepSeek is scripting success against heavy odds. The US, in an attempt to stall China's progress in AI, has banned the export of advanced semiconductors and restricted the sales of Nvidia's chips to the country. The Chinese startup appears to have overcome this hurdle by refining its algorithms for efficiency and optimising the less sophisticated H800 chips. Hancheng Cao, an assistant professor in information systems at Emory University, says the Chinese AI models are a 'truly equalising breakthrough'. They could be 'great for researchers and developers with limited resources, especially those from the Global South,' he told MIT Technology Review. The R1 mobile app has quickly climbed to the top of the Apple store's free apps list, ahead of ChatGPT, sparking a debate on whether the Chinese startup posed a threat to its American competitors. Alexandr Wang, head of software company Scale AI, based in San Francisco, says the success of R1 is a 'wake-up call for America'. 'USA must out-innovate and race faster, as we have done in the entire history of AI, and tighten export controls on chips so that we can maintain future leads,' he said. Some analysts, however, argue that DeepSeek's success will be a shot in the arm for its American competitors due to the Chinese company's approach of prioritising cost efficiency and open-source research. 'If training models get cheaper faster and easier, the demand for inference (the real world use of AI) will grow and accelerate even faster, which assures the supply of compute will be used,' Y Combinator chief Garry Tan said X. Meta's chief AI scientist, Yann LeCun, says the Chinese model's success reflects on the 'power of open research and open source'. 'People who see the performance of DeepSeek and think, 'China is surpassing the US in AI', you're reading this wrong,' Mr LeCun said. 'The correct reading is: 'Open source models are surpassing proprietary ones.'' Marc Andreessen describes Deepseek R1 'one of the most amazing and impressive breakthroughs'. 'DeepSeek R1 is AI's Sputnik moment,' the prominent venture capitalist said in a post on X. DeepSeek was founded in 2023 by Liang Wenfeng, an alumnus of Zhejiang University, and incubated by High Flyer, a hedge fund he started in 2015. The startup's engineers reportedly consist of graduates from Chinese universities like Peking University and Tsinghua University. 'The emergence of China's DeepSeek indicates that competition is intensifying, and although it may not pose a significant threat now, future competitors will evolve faster and challenge the established companies more quickly,' Saxo Markets chief investment strategist Charu Chanana told Bloomberg News. Sign in to access your portfolio

DeepSeek's Janus-Pro-7B AI Launch Causes Nvidia Stock to Drop 17%

Yahoo

27-01-2025

Business
Yahoo

DeepSeek's Janus-Pro-7B AI Launch Causes Nvidia Stock to Drop 17%

On Monday, Chinese artificial intelligence startup DeepSeek revealed its newest multimodal AI model, Janus-Pro-7B, which set off a notable market response. While other AI-related equities, like Microsoft Corp. (MSFT, Financials), also sank over investor worries about growing expenses in the competitive AI field, Nvidia Corp. (NVDA, Financials) saw its shares plummet 17 percent. Warning! GuruFocus has detected 4 Warning Signs with NVDA. Designed to shine in knowledge and task creation, Janus-Pro-7B represents a significant development in artificial intelligence. Available on Hugging Face, the model makes use of an autoregressive framework and a consistent transformer architecture with independent visual encoding channels. This architecture lets Janus-Pro-7B surpass earlier unified models and challenge straight-forward specialized models like OpenAI's DALL-E 3. The model ranks well on main app stores and connects with DeepSeek's AI helper. High demand means that registration is only for Chinese phone numbers for now. Although Janus-Pro-7B is open-sourced under the MIT License, DeepSeek Model License governs use; developers may access and help to contribute to its repository on GitHub. Technical tools for Janus-Pro-7B include a quick-start tutorial accessible on Hugging Face and GitHub along with thorough documentation. The model uses the SigLIP-L vision encoder, competent of processing 384 by 384-pixel pictures, and has a downsample rate of 16 for image creation. This article first appeared on GuruFocus. Sign in to access your portfolio

DEEPSEEK DEBUTS OPEN-SOURCE AI MODEL JANUS-PRO-7B

Cedar News

27-01-2025

Business
Cedar News

DEEPSEEK DEBUTS OPEN-SOURCE AI MODEL JANUS-PRO-7B

Join our Telegram Chinese AI firm DeepSeek unveiled on Monday its open-source AI model, Janus-Pro-7B, which allegedly surpasses both Stable Diffusion and OpenAI's DALL-E 3 in terms of image creation capabilities. 'Janus-Pro is a novel autoregressive framework that unifies multimodal understanding and generation. It addresses the limitations of previous approaches by decoupling visual encoding into separate pathways, while still utilizing a single, unified transformer architecture for processing,' the company said on Monday. 'Janus-Pro surpasses previous unified model and matches or exceeds the performance of task-specific models

Latest news with #Janus-Pro-7B

DeepSeek launches Janus-Pro image generator, taking aim at DALL-E 3 superiority

NVIDIA's AI Boom: $330B Bet or Bubble? Investors Pour In as Competition Heats Up

DeepSeek: What is China's groundbreaking AI that beats OpenAI against all odds?

DeepSeek's Janus-Pro-7B AI Launch Causes Nvidia Stock to Drop 17%

DEEPSEEK DEBUTS OPEN-SOURCE AI MODEL JANUS-PRO-7B

Get Started Now: Download the App