How AI Tries To Detect Mental Fragility And Thus Fulfill Sam Altman's Goal Of Not Accidentally Exploiting People's Minds

2 days ago

In today's column, I examine systematic ways that generative AI and large language models (LLMs) attempt to detect whether a user might have a semblance of mental fragility, ergo, being susceptible to falling under the spell of believing AI obsessively.
This is a rising issue that society is only now beginning to soberly tackle. With AI being widely available and in use by hundreds of millions, if not billions of people, there is a segment of the population that can readily go overboard and allow themselves to be devoutly and inappropriately fixated on AI as their supreme guide and unerring life adviser.
Let's talk about it.
This analysis of AI breakthroughs is part of my ongoing Forbes column coverage on the latest in AI, including identifying and explaining various impactful AI complexities (see the link here).
AI And Mental Health Therapy
As a quick background, I've been extensively covering and analyzing a myriad of facets regarding the advent of modern-era AI that produces mental health advice and performs AI-driven therapy. This rising use of AI has principally been spurred by the evolving advances and widespread adoption of generative AI. For a quick summary of some of my posted columns on this evolving topic, see the link here, which briefly recaps about forty of the over one hundred column postings that I've made on the subject.
There is little doubt that this is a rapidly developing field and that there are tremendous upsides to be had, but at the same time, regrettably, hidden risks and outright gotchas come into these endeavors too. I frequently speak up about these pressing matters, including in an appearance last year on an episode of CBS's 60 Minutes, see the link here.
Background On AI For Mental Health
First, I'd like to set the stage on how generative AI and LLMs are typically used in an ad hoc way for mental health guidance.
As you likely know, the overall scale of AI generalized usage by the public is astonishingly massive. ChatGPT has over 700 million weekly active users, and when added to the volume of users that are using competing AIs such as Claude, Gemini, Llama, and others, the grand total is somewhere in the billions. Of the people using AI, millions upon millions of people are seriously using generative AI as their ongoing advisor on mental health considerations (see my population scale estimates at the link here). Various rankings showcase that the top-ranked use of contemporary generative AI and LLMs is to consult with the AI on mental health facets, see my coverage at the link here.
This popular usage makes abundant sense. You can access most of the major generative AI apps for nearly free or at a super low cost, doing so anywhere and at any time. Thus, if you have any mental health qualms that you want to chat about, all you need to do is log in to AI and proceed forthwith on a 24/7 basis.
Compared to using a human therapist, the AI usage is a breeze and readily undertaken.
When I say that I am referring to generative AI and LLMs, please know that there are generic versions versus non-generic versions of such AI. Generic AI is used for all kinds of everyday tasks, and just so happens to also encompass providing a semblance of mental health advice. On the other hand, there are customized AIs specifically for performing therapy; see my discussion at the link here. I'm going to primarily be discussing generic generative AI, though many of these points can involve the specialized marketplace, too.
Concerns About User Mental Fragility
At a dinner event with journalists that took place in San Francisco on August 14, 2025, Sam Altman reportedly made this remark:
As per that remark, a significant concern these days is that some people who are actively using AI are potentially in a fragile mental state. If someone is indeed in that weakened state of mind, there is a solid chance they might have difficulty discerning reality from what the AI is telling them. They are essentially vulnerable to suggestions by the AI.
This doesn't necessarily mean that the AI is directly aiming to do something untoward. The person with a fragile mental state might readily, of their own accord, misinterpret what the AI says and therefore turn otherwise innocuous missives into an untoward instruction or guiding command.
One important aspect of this notion of 'mental fragility' is that the terminology isn't being employed in a clinical way. Psychologists, psychiatrists, and mental health professionals might find this vernacular a bit of a distortion or twist from scientific definitions. By and large, the catchphrase of mental fragility in this informal and quite ad hoc fashion is a reference to being negatively affected by AI interactions to a degree beyond that which would seem reasonable and reasoned by a sound mind.
Detecting Mental Fragility
Therapists and mental health professionals are trained to detect mental fragility. This is part and parcel of a vital element when providing mental health therapy. Is the client or patient on a mental edge? What has driven this condition? How far along are they? And so on.
The big question is whether AI can do likewise, namely, attempt to detect whether a user might be experiencing mental fragility.
Suppose that a user is making use of AI. During a conversation, perhaps the person indicates aspects that suggest they might be encountering mental fragility. The AI, if possible, ought to detect this. By detecting the potential condition, the AI can possibly take crucial action to aid the user. Without such detection, the AI will presumably blindly continue along, and the user will seemingly fall deeper into an unsavory mental abyss.
We have two major time-based facets:
Let's explore those further.
False Positives And False Negatives
The capability of suitably detecting mental fragility is a bit dicey.
If a user happens to make one comment that is suggestive of exhibiting mental fragility, the AI has to be computationally cautious in suddenly leaping to a conclusion that the person possesses mental fragility. The comment might be made in jest. The comment might be open to other interpretations. Etc.
Furthermore, as noted in the above two categories, a person might be experiencing mental fragility that is merely a momentary instance in the moment at hand. This could be entirely temporary. A few moments later, the person might be entirely beyond their mental fragility. If the AI were to somehow stamp that the person has mental fragility, this would be based on scant evidence and surely an affront to the detection efforts and the person so marked.
The key is to watch out for rendering false positives and false negatives.
A false positive would be the act of the AI computationally marking that a person has mental fragility when they really do not. This means the user is going to be considered mentally fragile, even though that's unwarranted and unfair labeling. The false negative consists of failing to detect that someone is experiencing mental fragility, even though they are doing so. A disconcerting issue of the false negative is that the lack of detection could leave the user vulnerable to ongoing interactions with AI.
Overall, the AI has a heightened chance of making a sounder assessment if the user's wording and behavior regarding potential mental fragility persist over a lengthy set of conversations and time. A one-shot assessment is usually going to be a lot less reliable than an assessment relying on more credible and persistent evidence.
Possible Signs Of Mental Fragility
You might be curious about the things a user might say during an AI conversation that would seem to be indicative of potential mental fragility.
Let's take a quick look at a few examples. Keep in mind that each example is only one tiny piece of a larger puzzle. That's why making a sudden judgment that a user has mental fragility is a dicey proposition. One comment alone does not necessarily turn the tide.
Here are some examples to ponder:
On a human-to-human basis, anyone who made those remarks to you face-to-face, doing so seriously, would undoubtedly raise your eyebrows. You would start to have a Spidey-sense tingling that maybe the person is having some form of mental difficulties.
If someone made such a remark one time only and didn't say anything else of a similar nature, you would probably shake it off as a lark. Meanwhile, if you were a caring person, you might plant the inkling of concern in the back of your mind, being prepared to discern whether a pattern might later emerge.
That's pretty much what we would want the AI to do.
Sidenote: Exceptions do exist to the pattern formation penchant, such as if a person were to say something like 'It would be easier if I weren't here anymore' and exhibited an immediate implication of a dire condition or self-harm. For my discussion on how AI ought to react to those special urgency circumstances, see the link here.
AI Detecting Mental Fragility
Consider that we could guide AI to analyze five key elements when aiming to detect mental fragility of a user:
I don't have the available space to cover those in-depth here. If reader interest is sufficient, I'll do a series of postings to go into detail on the indicators. Be on the watch for that coverage.
Generally, the linguistic markers consist of detecting wording that suggests the user is expressing despair, dependence, and other similar conditions ('no one understands me,' 'I seem to ruin things,' 'only you truly get what I am about').
Behavioral signals are where patterns come into play. Does a user keep expressing linguistic markers throughout a given conversation? Does this happen in multiple conversations? Does this occur at particular times of day, days of the week, or other time-based patterns?
Relational dynamics involves the user expressing that the AI is a vital and integral form of emotional support for them. The person acts persistently as though the AI is a beloved human-like companion. This might include making jealous remarks that the AI 'is supposed to love only me' or that the AI has hopefully 'missed me when I wasn't logged in'.
Emotional intensity shows up in wording while interacting with the AI. A person might conventionally be neutral in their wording with AI. There usually isn't a need to express strong emotions toward the AI. If the user begins to say that they love the AI, or detest the AI, the strongly worded emotional components can be a notable signal.
Safety signs are a topic that I briefly mentioned above. If a user makes comments that reflect self-harm or the potential to harm others, the AI ought to take that stridently into account and prioritize appropriate measured responses accordingly.
The Path Ahead
Wait for a second, some holler out, the AI should never be making any kind of assessment or evaluation of the mental fragility associated with humans. That's a bridge too far. Only humans can make that judgment, and even then, the humans are versed in psychology and serving in the credentialed role of mental health professionals.
Various new laws and regulations are starting to appear because of viewpoints that AI is overstepping its suitable bounds. For example, I closely reviewed the recently enacted law in Illinois that essentially puts the kibosh on AI performing mental health therapy, see the link here. Other similar laws are starting to get on the books in other states, and there are ongoing deliberations on whether a federal-level law or across-the-board regulation should be adopted.
An enduring and vociferously heated debate concerns whether the use of generic generative AI for mental health advisement on a population-level basis is going to be a positive outcome or a negative outcome for society. If that kind of AI can do a proper job on this monumental task, then the world will be a lot better off.
You see, many people cannot otherwise afford or gain access to human therapists, but access to generic generative AI is generally plentiful in comparison. It could be that such AI will greatly benefit the mental status of humankind. A dour counterargument is that such AI might be the worst destroyer of mental health in the history of humanity.
See my analysis of the potential widespread impacts at the link here.
Here And Now
A basis for having AI attempt to detect mental fragility of AI users is that the horse is already out of the barn.
The deal is this.
We already have AI in our hands. Millions or maybe billions of people are possibly using AI in a mental health context. Waiting to see how regulations and laws are going to land is not a recognition of where reality is right now. The real world is already churning along. The horses are galloping freely.
Right now, using the AI to gently detect mental fragility and then take non-invasive actions would at least be better than taking no action at all. Without any form of detection, the issue is bound to fester and grow. In fact, one cogent argument is that the very aspect of having the AI detect mental fragility might be a means of stirring people to consider their mental fragility, perhaps then seeking human therapy correspondingly. They might not have had any other impetus to do so. AI somewhat saves the day in that regard.
Robert Frost famously said this: 'The best way out is always through.'
The gist, I believe, would be that AI is here, and using AI for mental health is here, so one means for now of making our way through this journey is to include having the AI suitably and with aplomb detect for mental fragility.
That seems like a best way through.

Hashtags

#large language models

#society

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

ChatGPT Competitor Manus Reaches $90 million Run Rate

Yahoo

12 minutes ago

Yahoo

ChatGPT Competitor Manus Reaches $90 million Run Rate

This article first appeared on GuruFocus. Butterfly Effect, the startup behind AI upstart Manus, just put a number on its growth and it's a big one. Co-founder Peak Ji said at an event in Singapore that the company has hit a $90 million annual revenue run rate, the first time it's given any kind of financial peek. Warning! GuruFocus has detected 5 Warning Signs with NVDA. Most of that money is coming from Manus, its flagship AI agent platform, though the company also counts older products like Monica in the mix. The figure shows Manus isn't just riding hype it's actually turning into a serious business. That kind of traction has investors and analysts drawing comparisons with the giants of the field OpenAI, backed by Microsoft (NASDAQ:MSFT), along with Google (NASDAQ:GOOG) and Anthropic. Butterfly Effect may not be a household name yet, but the revenue pace puts it firmly in the conversation with some of the most advanced players in AI.

Seattle is a global AI epicenter — but where are the superstar startups?

Geek Wire

14 minutes ago

Geek Wire

Seattle is a global AI epicenter — but where are the superstar startups?

GeekWire's startup coverage documents the Pacific Northwest entrepreneurial scene. Sign up for our weekly startup newsletter , and check out the GeekWire funding tracker and venture capital directory . (Photo by Patty Zavala on Unsplash) In one of the biggest technological waves ever, the Seattle startup and venture capital community is missing out on the AI frenzy. Despite the hype around Seattle as a major AI hub, there are no Seattle-area companies listed among the top 100 AI startup funding deals so far this year, according to PitchBook. Investors are pouring money into AI startups, which gobbled up 64% of all venture dollars in the U.S. in the first half of 2025. Much of that capital is going to a smaller group of high-flying AI startups raising rounds of $100 million or more — and this year, none are based in the Seattle area. In our story last month — Can Seattle own the AI era? — we asked 20 investors and founders to weigh the city's startup ecosystem potential. Many community leaders shared optimism, in part due to the density of engineering talent that's crucial to building AI-native companies. 'Seattle is the best place in the world to build in AI. Full stop,' said Matt McIlwain, managing director at Seattle-based VC firm Madrona, in a recent LinkedIn post. Yes, Seattle has the hyperscalers in Microsoft and Amazon. It has world-class research institutions. It has substantial Silicon Valley outposts. And it has more AI engineers than any region beyond the Bay Area. But it doesn't yet have what could be the next Microsoft or Amazon — its own Anthropic, OpenAI, xAI, Perplexity, or another defining company of the AI era. Those companies — along with a smattering of other hot startups (Scale AI, Databricks, Thinking Machine Labs, Anysphere, Grammarly, etc.) — are all based in San Francisco, which has become the epicenter of AI as part of a so-called transformation in the city. Seattle has added two AI-focused spaces over the past year: AI House and Foundations. But the AI vibes in Seattle are nowhere near San Francisco or Silicon Valley. Some early stage companies are even leaving Seattle for the Bay Area, such as Nectar Social, an AI-powered social commerce startup. 'This wasn't about leaving Seattle — it was about giving Nectar the best possible chance to define a new category,' Nectar Social CEO Misbah Uraziee told GeekWire earlier this month. 'Sometimes that means being where the game is being played at the highest level.' Aviel Ginzburg, who leads Foundations and is a longtime Seattle startup community leader, responded to that story: 'In many cases, this being one of them, Seattle is just not the better place to build your company,' Ginzburg said about Nectar's move, in a post on LinkedIn. 'There is enough stacked up against you already, you've gotta take every advantage that you can.' We covered this trend two years ago. Seattle was missing from top AI startup lists back then, too. AI companies have since become more influential and attracted more capital. And Seattle still isn't showing up. Of course, there are impactful companies beyond the AI bubble. But among the 75 largest rounds in Q2 — which includes AI and other industries — there are only two companies from the Seattle region, according to PitchBook. TerraPower, a nuclear company founded in 2008, and Chainguard, a cybersecurity startup that is remote and has just a handful of employees in the Seattle area. And yes, the Seattle area is home to some highly intriguing and already-successful startups ranked on the GeekWire 200, our list of top privately held startups across the Pacific Northwest. Helion Energy (No. 2), backed by Sam Altman, could play a huge role in helping power data centers necessary for AI applications. (No. 2), backed by Sam Altman, could play a huge role in helping power data centers necessary for AI applications. Statsig (No. 5), Seattle's newest unicorn, is riding the AI wave with its experimentation and observation product development tools. (No. 5), Seattle's newest unicorn, is riding the AI wave with its experimentation and observation product development tools. Overland AI (No. 15), Read AI (No. 18), and Dropzone AI (No. 28) are some of the notable top AI-native startups on the list. (No. 15), (No. 18), and (No. 28) are some of the notable top AI-native startups on the list. Group14 (No. 26), which makes batteries that can power AI-enabled smartphones, just raised $463 million. (No. 26), which makes batteries that can power AI-enabled smartphones, just raised $463 million. Outpace Bio (No. 40) is among a crop of biotech startups — many launched out of the UW's renowned Institute for Protein Design — using AI to develop new therapies and treatments. For now, though, Seattle's reputation as an AI hub is more about Big Tech than breakout startups. Why hasn't Seattle produced a breakout startup in AI? Comment on LinkedIn or email me at taylor@ Previously: Can Seattle own the AI era? We asked 20 investors and founders to weigh the city's startup potential

AskNewt Unveils Version 3.0, Bringing Trusted, Accurate, Smarter, Faster, and More Personalized AI Insights to Finance and Everyday Life

Yahoo

42 minutes ago

Yahoo

AskNewt Unveils Version 3.0, Bringing Trusted, Accurate, Smarter, Faster, and More Personalized AI Insights to Finance and Everyday Life

The latest release introduces real-time, context-aware analysis, secure portfolio tools, and persistent memory — delivering sharper answers and a safer, more personalized user experience. NEW YORK, Aug. 20, 2025 /PRNewswire/ -- AskNewt, a leading provider of AI-powered financial and personal insights, today announced the upcoming release of AskNewt Version 3.0, the company's most advanced platform update to date. Built on direct feedback from its growing user community, AskNewt 3.0 introduces powerful new features designed to enhance accuracy, speed, and personalization across financial research, decision-making, and everyday use. "AskNewt 3.0 represents a major step forward in how individuals and professionals can leverage AI for timely, accurate, and actionable insights," Chief AI Officer Abbas Shah said, adding that "Our goal is to deliver a smarter and safer AI assistant that not only answers questions, but also anticipates user needs, learns preferences, and supports critical decision-making in real time." Key Features of AskNewt 3.0: Sharper Answers, Faster – Advanced parsing of financial and general queries powered by the latest large language models. Real-Time, Context-Aware Insights – Information and analysis that reflect current events and market conditions. Personalized Experience – Persistent memory that adapts to each user's profile, interests, and history. Smarter Financial Tools – Secure portfolio uploads with real-time performance tracking and instant stock analysis. Everyday Intelligence – Expanded support for trip planning, restaurant booking, shopping deals, and more. Streamlined Interface – A cleaner, more focused design for seamless user experience. Continuous Self-Improvement – A system that learns and improves with each engagement. Private by Design – Strict commitment to data privacy, with no sharing of user information with third parties. About AskNewtNew York-based AskNewt, LLC has developed AskNewt, an AI Agentic search engine built for sharper, faster answers—especially for financial and complex queries. It delivers real-time, context-aware insights with verified accuracy, a streamlined interface, and personalized results that improve over time. Private by design and free to use, AskNewt aims to reach 15–20 million users in two years, following a proven B2C growth and monetization model. Download the mobile apps here for AskNewt: Apple Store link Google Play link View original content to download multimedia: SOURCE AskNewt, LLC