The data advantage: How web scraping and NLP give investors a decision-making edge
But before AI even became part of everyday vocabulary, one company was already exploring how it could be used in determining investment strategies.
In 2019, global asset management firm Robeco tapped on natural language processing (NLP), which is a form of AI, to help them analyse large volumes of text and signals to find patterns that might influence markets.
An international asset manager with one of the world's largest quant equity research teams, Robeco has been at the forefront of quant research, contributing to both academic research and client portfolios for over 30 years.
While traditional sources of data such as financial statements and market prices have long been tapped by investors for insights, this new realm of alternative data means exploring unconventional or non-traditional types of data that have not been used in the past for investment decisions.
Next-generation quant researchers who delve into such data also use 'web scraped data' to monitor alternative sources like social media and online reviews in real time for a tech product launch, giving them an edge over investors waiting for quarterly reports to gauge sentiment around the product's reception.
Using NLP allows researchers to analyse such data, separating the noise from the potential signals. Another example is monitoring the number of job vacancies at firms, viewing an increase as an expectation for future growth.
Evolving along with financial markets
The use of these next-gen techniques and new data sources allows for more complex and adaptive investment strategies that can navigate the ever-changing conditions in financial markets. These tools do not just benefit existing quant strategies, such as Robeco's benchmark-aware active quant strategies, but also enable the firm to create new next-generation strategies.
One example is Robeco's multi-thematic equities strategy that harnesses AI to detect emerging themes such as cancer treatments and satellite communications, and identifies when to enter or exit themes and particular companies.
Firstly, the strategy uses a rigorously tested NLP algorithm to detect themes within a vast amount of alternative data including company earnings calls, news articles and management interviews. It then uses a different algorithm, based on sentiment analysis – a process which classifies whether something is positive, negative, or neutral, based for example on vocabulary choice – to select the most attractive companies.
Making the right choice
Today, many asset managers are jumping on the AI bandwagon. But what should asset allocators or fund selectors look for when evaluating the credentials of AI claims? 'Sometimes 'innovation' is a very overused term,' says Mike Chen, head of Next-Gen Research at Robeco.
'When assessing the quantitative investing capabilities of an asset manager, it's important to look beyond marketing claims. Do the asset managers invest sufficiently in building their proprietary data sets? And do they have a thoughtful, measured, and transparent process, with a team who understands the proper use and potential misuse of new tools and data?' he asks.
Chen explains that Robeco is a top-tier quant house and one of the few with strong fundamental equities and fixed income teams. In fact, the Robeco quant team started out by providing stock ranks for the portfolio managers' input in their fundamental emerging market team.
Today, the quant team can get feedback from the fundamental teams on dynamics the model might not pick up, such as stock-specific events, sector-specific adjustments, or macro considerations in emerging markets. The fundamental teams can use quant tools to identify promising investment opportunities, relying on a combination of the quant group's long-proven factor research and next-gen signals.
In the future, alternative data, machine learning, and NLP will enhance collaboration by improving both quant models and fundamental research, thereby strengthening the firm's offering. Asset managers that can adapt and leverage the growing power of data and AI techniques will see differentiated advantages.
Find out more about Robeco quant investing and its active quant strategies.
Disclaimer:
Important information – capital at risk
This information refers only to general information about Robeco Holding B.V. and/or its related, affiliated and subsidiary companies, ('Robeco'), Robeco's approach, strategies and capabilities. This is a marketing communication intended solely for professional investors, defined as investors qualifying as professional clients, who have requested to be treated as professional clients or who are authorized to receive such information under any applicable laws. Unless otherwise stated, the data and information reported is sourced from Robeco, is, to the best knowledge of Robeco, accurate at the time of publication and comes without any warranties of any kind. Any opinion expressed is solely Robeco's opinion, it is not a factual statement, and is subject to change, and in no way constitutes investment advice. This document is intended only to provide an overview of Robeco's approach and strategies. It is not a substitute for a prospectus or any other legal document concerning any specific financial instrument. The data, information, and opinions contained herein do not constitute and, under no circumstances, may be construed as an offer or an invitation or a recommendation to make investments or divestments or a solicitation to buy, sell, or subscribe for financial instruments or as financial, legal, tax, or investment research advice or as an invitation or to make any other use of it. All rights relating to the information in this document are and will remain the property of Robeco. This material may not be copied or used with the public. No part of this document may be reproduced, or published in any form or by any means without Robeco's prior written permission.
Alpha refers to the excess return of an investment relative to a benchmark index and is a measure of performance.
Singapore
This information is for informational purposes only and should not be construed as an offer to sell or an invitation to buy any securities or products, nor as investment advice or recommendation. The contents of this document have not been reviewed by the Monetary Authority of Singapore ('MAS').
Robeco Singapore Private Limited holds a capital markets services licence for fund management issued by the MAS and is subject to certain clientele restrictions under such licence. An investment will involve a high degree of risk, and you should consider carefully whether an investment is suitable for you.

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles
Business Times
2 hours ago
- Business Times
Using machine learning for smarter net-zero investing
CLIMATE considerations are becoming increasingly important to investors as they seek to manage risks, capitalise on opportunities, and identify engagement targets related to climate change in their investment strategies. These strategies often hinge on the availability and quality of climate data. Yet existing datasets frequently fall short. Net-Zero Investing: Harnessing the Power of Unstructured Data, a recent report published by CFA Institute and co-authored by a team from Acadian Asset Management, discusses how limitations in climate data pose challenges to net-zero investing. It also highlights how investors can overcome such limitations by leveraging natural language processing (NLP) and other machine learning (ML) techniques, as well as techniques to account for uncertainties. Net-zero investing generally refers to investment practices that have an ultimate impact on reducing carbon emissions and mitigating climate risks. This could mean financing projects that support or lead to achieving net-zero emissions, or investing in businesses that have significantly reduced emissions or are in the process of doing so. The report notes that with no single, consistent source of net-zero investment data, investors must be prepared to combine different datasets and various statistical techniques in devising net-zero investment strategies. Even then, investors will still face substantial uncertainty around the estimates they produce. The report advocates for new approaches to reflect this reality in the investment process. The problem with current climate data Climate data currently available to investors suffers from two primary issues: a focus on historical emissions and incomplete coverage. A NEWSLETTER FOR YOU Friday, 12.30 pm ESG Insights An exclusive weekly report on the latest environmental, social and governance issues. Sign Up Sign Up While historical data offer a snapshot of past performance, they fail to account for how companies might adapt to future decarbonisation demands. For example, metrics such as carbon intensity are point-in-time measurements of emissions, but are not reliable predictors of a company's long-term alignment with net-zero goals. In addition, the lack of comprehensive data coverage creates challenges and inconsistencies for asset owners managing diverse portfolios. Data providers often prioritise large-cap issuers in developed markets, leaving significant gaps for small-cap and emerging market issuers. This discrepancy complicates efforts to fully assess climate risks and opportunities in the entire portfolio, even as many asset owners aim for systemic climate alignment across all holdings. Furthermore, existing climate alignment measures often lack transparency and credibility. Proprietary metrics developed by data providers can vary widely in methodology, leading to inconsistent outcomes. For instance, implied temperature scores provided by different providers often exhibit low correlation, making it difficult for investors to determine which source to trust. Moreover, many of these metrics give a false sense of precision, presenting results with decimal-point accuracy despite significant uncertainties. These issues undermine confidence in the data and pose challenges for investors seeking reliable indicators of net-zero alignment. Greenwashing – the practice of overstating environmental commitments – further complicates the reliability of climate data. Many companies make bold claims about decarbonisation, but with varying credibility. Corporate disclosures often lack specificity, making it difficult for investors to evaluate the feasibility and sincerity of stated goals. Without robust tools to parse and analyse these narratives, investors risk relying on overly optimistic or misleading data. ML and NLP to the rescue Machine learning offers a transformative solution to the challenges of climate investing in two ways. First, understanding a company's decarbonisation strategy and assessing its net-zero alignment rely on unstructured data – such as corporate narratives, stated commitments, external commentary and qualitative disclosures. These forms of data are difficult to analyse with traditional statistical methods. ML techniques, particularly NLP, overcome this challenge by processing such data at scale to extract meaningful patterns and provide a more nuanced view of a company's climate strategy. Second, ML provides the scalability that manual analysis lacks. While human analysts can evaluate individual issuers, this approach is not feasible for portfolios with thousands of securities. ML enables consistent, repeatable analysis across a wide range of issuers, providing investors with actionable insights. Consider range of outcomes, not single-point estimates The inherent uncertainty of predicting long-term outcomes remains a central challenge for net-zero investment strategies. The report advocates for a probabilistic approach that incorporates a range of potential scenarios. Rather than relying solely on single-point estimates, investors should consider a spectrum of possible outcomes to better manage risks and opportunities. This approach aligns with the realities of climate investing, where future developments are highly uncertain. Integrating ML with other techniques While ML is a powerful tool, it is not a standalone solution. The report emphasises the importance of combining ML with other datasets and statistical techniques. For instance, Bayesian updating (a method that revises probabilities based on new evidence) can incorporate new data into climate measures, providing a dynamic view of a company's net-zero alignment. Furthermore, this approach offers not just point estimates, but also confidence intervals for parameters. By integrating ML with these methods, investors can build more comprehensive and reliable climate measures. Putting it all together Taking into account the above recommendations, here is an example of how investors can apply ML and NLP to assess a company's decarbonisation alignment. This approach highlights the value of blending diverse data types and analytical methods to build robust climate measures. The process involves: Data collection: Gather textual data from corporate disclosures, news media and other sources Model training: Use supervised learning to fine-tune an NLP model, teaching it to interpret climate-specific language Integration of additional data: Combine textual insights with other data sources, such as financial forecasts, to create a holistic measure of net-zero alignment Dynamic updating: Apply Bayesian techniques to account for new data and uncertainties, enabling a range of possible outcomes Conclusion Climate investing, in particular net-zero strategies, presents a complex but essential challenge as the limitations of existing climate data demand innovative approaches. By leveraging ML and NLP, investors can overcome these challenges, extracting actionable insights from unstructured data to build more comprehensive climate measures. No single solution can address all the needs of climate investing. Instead, investors must adopt a multifaceted approach that combines diverse datasets, advanced techniques, and probabilistic frameworks that embrace uncertainty. While climate investing remains both an art and a science, the tools and methodologies outlined here provide a solid foundation for building effective and credible net-zero portfolios. This article was adapted by the CFAS Advocacy Committee. The original version first appeared on the CFA Institute Research & Policy Center's website: The writers are from Acadian Asset Management. Andy Moniz, CFA, is senior vice-president and director of responsible investing, in London. Devin Nial is senior vice-president and portfolio manager, ESG, in Boston. Matthew Picone, CFA, is senior vice-president and portfolio manager, ESG, in Sydney. Lukasz Pomorski is senior vice-president, portfolio manager in Boston. Jerry Yu is assistant vice-president and associate portfolio manager, ESG, in Boston.
Business Times
08-08-2025
- Business Times
Indonesia's benchmark yield falls to lowest in almost two years
[JAKARTA] Indonesia's 10-year bond yield fell to the lowest level since September 2023 due to rising interest-rate cuts expectations, overseas inflows and a soft US dollar. The benchmark yield slipped three basis points on Friday (Aug 8) to 6.42 per cent, extending its decline from a high in January to almost 90 basis points. Emerging Asian sovereign bonds, including Indonesia's, have rallied this year as global funds diversified their investments away from US dollar assets. July's weaker-than-expected US payroll data boosted bets that the US Federal Reserve will cut rates at its September meeting, improving sentiment towards bonds and weighing on the greenback. 'Monetary policy settings in Indonesia have consistently been on the tight side as Bank Indonesia grapples with the foreign exchange management component of its mandate,' said Philip McNicholas, Asia sovereign strategist at Robeco in Singapore. 'With the US dollar embarking on a secular decline, that should allow for a more domestically supportive policy stance via rate cuts and should support some downward pressure on yields overall.' Expectations that Bank Indonesia will deliver more policy easing this year has also helped fuel the rally in local bonds. Improving sentiment towards the nation's finances also bolstered demand, after Finance Minister Sri Mulyani Indrawati pledged to keep the state budget deficit at below 3 per cent of gross domestic product and to use cash reserves to plug this year's shortfall. Overseas investors have bought about US$3.5 billion of Indonesian government bonds this year, already surpassing last year's total inflows. 'Global local-currency-investor positioning is relatively light in Indonesia, especially versus other high-yielding markets,' Robeco's McNicholas said. 'Moving back towards more historically normal positions in Indonesia is likely supportive for the government bonds, pushing yields lower still.' BLOOMBERG
Business Times
25-06-2025
- Business Times
TSMC unit to issue US$10 billion of stock to counter forex swings
[TAIPEI] Taiwan Semiconductor Manufacturing Co's overseas unit is set to issue US$10 billion worth of new stock to shore up its forex hedging operations, making its biggest such move to counter a volatile local currency. TSMC Global is poised to issue the shares to help it hedge against forex swings, the company said in a statement. It's the third such deal since 2024, and by far the largest. They occurred during periods when the Taiwan dollar tended to appreciate. The moves grant TSMC Global – the vehicle responsible for managing overseas investments and hedging – more capital flexibility in managing exchange rate risks. Recent gains by the Taiwan dollar have caused worries in Taipei about the economy's heavy reliance on exports. In May, the currency notched its biggest single-day gain since the 1980s, spurring calls from the central bank to curb speculation. 'Generally speaking, the heightened forex volatility would mean that banks may be adjusting their margin requirements,' said Philip McNicholas, Asia sovereign strategist at Robeco based in Singapore. 'Issuing new shares, and bringing in an immediate cash injection, may help companies manage the margin requirements on both existing and new hedges.' TSMC, the main chipmaker to Apple and Nvidia, is by far the island's biggest company and exporter because the majority of its production is domestic. A stronger Taiwan dollar hurts exporters because the US dollars they earn from sales abroad would translate into less of the local currency, or they would need to raise their prices overseas and risk denting demand. In June, chief executive officer CC Wei told shareholders the company's operating margin has fallen several percentage points because of a stronger local currency. BLOOMBERG