logo
Chinese AI firm SenseTime bets on multimodal models to stand out from rivals

Chinese AI firm SenseTime bets on multimodal models to stand out from rivals

SenseTime , an artificial intelligence (AI) pioneer in China, has launched new models that it claims surpass OpenAI products in reasoning capabilities, as it bets on multimodal models to secure its position in the competitive AI landscape.
Advertisement
The company on Thursday unveiled SenseNova V6 and V6 Reasoner, new iterations of its self-developed AI model series. V6 outperformed OpenAI's GPT-4o across several metrics, including fact-checking, numerical reasoning, data analysis and visualisation, according to
SenseTime chairman and CEO Xu Li , citing data from benchmarking platform TableBench.
With 600 billion parameters, V6 is China's leading model in multimodal reasoning and also the most cost-effective option for inference across the industry, according to the company.
Xu also said that V6 Reasoner outperformed OpenAI's o1 and Google's Gemini 2.0 Flash Thinking in multimodal reasoning abilities. The advances are designed to address an industry-wide challenge: the depletion of high-quality text data for training large language models (LLMs).
SenseTime's booth at an AI conference in Shanghai. Photo: Costfoto/NurPhoto via Getty Images
Unlike traditional LLMs that focus primarily on text, multimodal LLMs integrate various modalities – such as images, audio and video – to improve comprehension and generation capabilities.
Advertisement
The industry's initial strategy of expanding model parameters under the scaling law had 'hit a wall', Xu said in an interview in Shanghai on Thursday. 'We've nearly exhausted all text data that can be collected from the internet,' he said.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

China's DeepSeek closes in on US rival OpenAI, surpasses Alibaba with upgraded model
China's DeepSeek closes in on US rival OpenAI, surpasses Alibaba with upgraded model

South China Morning Post

time4 hours ago

  • South China Morning Post

China's DeepSeek closes in on US rival OpenAI, surpasses Alibaba with upgraded model

Chinese artificial intelligence (AI) start-up DeepSeek said R1-0528, the first significant upgrade to its R1 reasoning model that debuted in January, matched the performance of top global competitors, including OpenAI and Google. Advertisement In a statement released late on Thursday, DeepSeek highlighted improvements in the new model's reasoning and creative writing capabilities, making it more adept at crafting argumentative essays, fiction and prose in styles that closely mimic human authors. Coding capabilities have also been enhanced. The company said the latest version achieved a 50 per cent reduction in 'hallucinations' – instances where AI generates misleading information with little factual basis. These upgrades were achieved by investing additional computing resources in the post-training stage, when developers make final adjustments and enhancements to the model after the main training process, the company said. Post-training usually focuses on boosting efficiency and enhancing content safety and accuracy. 'The updated R1 model excelled among domestic AI models in a range of benchmark tests, including maths, coding and general logic, and matched up to global top models such as [OpenAI's] O3 and [Google's] Gemini2.5-Pro,' DeepSeek said. Benchmark results cited by DeepSeek shows that R1-0528 outperforms Alibaba's Qwen3 AI model. Photo: Shutterstock The update comes after the original R1 model was dethroned in late April by Alibaba Group Holding's flagship model, Qwen3, in the LiveBench rankings for leading open-source AI systems. The shift underscores the heated competition among Chinese tech players in advancing AI capabilities.

Man who made iPhone helping Sam Altman bury it
Man who made iPhone helping Sam Altman bury it

Asia Times

time23-05-2025

  • Asia Times

Man who made iPhone helping Sam Altman bury it

Sam Altman isn't just coming for your job. He's coming for your phone, and maybe your soul. OpenAI just spent US$6.5 billion to acquire a secretive hardware company founded by Jony Ive—the man who helped make the iPhone what it is. You may not know Ive's name, but you've touched his work. Literally. Every day. When we think of the iPhone, we automatically think of Steve Jobs—the black turtleneck, the enormous ego. The messianic charisma. But the real sculptor behind it was Ive. He's the architect responsible for Apple's sleek, seductive gadgets. He's the reason your phone feels like a lifestyle, not a tool. Ive turned cold metal into a fetish object. Now he's back. But not with Apple. With OpenAI. And that should make you pay attention. Because this isn't some design side-hustle or futuristic prototype for nerds in labs. This is OpenAI trying to build the first real AI-native device—a category killer designed not just to complement your phone but to replace it. A smart device that doesn't just respond to your voice, but listens when you don't speak. That doesn't wait for your command, because it already knows what you want. The goal is clear: Kill the iPhone, the interface and the screen. Become the last machine you ever carry. What OpenAI is building is not a phone. It's an ambient intelligence system—a wearable, maybe even implantable, AI that will live with you. On you. In you. It won't need an app store. It is the app. It'll whisper reminders, flag your blood pressure, read your micro-expressions, log your emotional state, track your speech, and give you answers before you ask. This isn't a more developed Siri. It's something far more intimate. It doesn't seek your input—it seeks your patterns. Your breath, your posture, your pulse. It'll understand what stresses you out. What calms you down. Who you're texting. What you're hiding. It's not a search engine. It's your new nervous system. You won't need to tap it. You'll forget it's there. But it'll always be listening. Always learning. Always predicting. Imagine something that makes Google seem slow and Apple seem old. That's what $6.5 billion just bought. Altman didn't hire Ive to make something cool. He hired him to make something irresistible. Because that's Ive's superpower: making invasive technology feel like art. Like you chose it. You didn't buy an iPhone. You joined the cult. Altman's about to launch a new one. If you want people to accept constant AI surveillance, you can't roll it out in a black box that looks like an NSA project. You need it to feel like magic. Smooth edges. Soft glow. Maybe white ceramic. Something elegant enough to be worn in public. Something that lets you lie to yourself and say: 'It's just a new kind of AirPod.' When in reality, it's the most intimate listening device ever created. This isn't just about hardware. It's about behavioral capture. You don't get true intimacy from cameras or mics. You get it from proximity—constant, seamless proximity. From something that nestles up to you like a digital familiar. And once it's there, you'll trust it. Because it'll work. And because it will flatter you. It'll make you smarter. More organized. More efficient. Less anxious. That's the hook. It's not surveillance if it helps you. And OpenAI isn't going to stop at design. The company's mission is to 'ensure artificial general intelligence benefits all of humanity.' But the way it's moving, it doesn't just want to build AGI. It wants to be the gateway to reality. That means controlling the interface between you and the machine. Not through a web browser. Not through a keyboard. But through something much closer. I suggest this is the final app—the interface to end all interfaces. Because the moment an AI companion lives in your ear, understands your speech patterns, and feeds you real-time answers… why would you ever Google something again? Why open your phone when your device knows what you're thinking before you do? That's the ambition here. Not just to make a better device. But to own the future of cognition itself. And the crazy—or maybe not so crazy—thing is that people will accept it. Happily. Gleefully. Because it will be useful (initially, anyway). It will help them write better emails, get better sleep, pick better dates, remember birthdays, spot diseases, and schedule their lives. It will become an outsourced consciousness, and it will feel natural. This is the seduction of AI intimacy: it will work. And when it does, it will become indispensable, like electricity or oxygen.

Apple aims to launch smart glasses in 2026 in push into AI devices to rival Meta, OpenAI
Apple aims to launch smart glasses in 2026 in push into AI devices to rival Meta, OpenAI

South China Morning Post

time23-05-2025

  • South China Morning Post

Apple aims to launch smart glasses in 2026 in push into AI devices to rival Meta, OpenAI

Apple is aiming to release smart glasses at the end of next year as part of a push into artificial intelligence (AI) -enhanced gadgets, but it has shelved plans for a smartwatch that can analyse its surroundings with a built-in camera. Company engineers are ramping up work on the glasses – a rival to Meta Platforms' popular Ray-Bans – in a bid to meet the year-end 2026 target, according to people with knowledge of the matter. Apple would start producing large quantities of prototypes at the end of this year with overseas suppliers, said the people, who asked not to be identified because the products have not been announced. The iPhone maker is looking to join the emerging trend of AI-powered devices – an area where it faces fresh competition. OpenAI said Wednesday that it was teaming up with former Apple chief design officer Jony Ive to introduce hardware products starting next year. The AI pioneer is acquiring Ive's secretive io start-up , with the goal of releasing a family of AI devices. OpenAI is acquiring the start-up of former Apple chief design officer Jony Ive (right), seen here with Apple CEO Tim Cook. Photo: AP Apple's glasses would have cameras, microphones and speakers, allowing them to analyse the external world and take requests via the Siri voice assistant. They could also handle tasks such as phone calls, music playback, live translations and turn-by-turn directions. The approach would be similar to that of Meta's current glasses and coming devices running Alphabet's Android XR operating system.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store