Managing Director & Partner
Seattle
By Suchi Srinivasan, Ernesto Pagano, Christopher Collins, Giorgo Paizanis, and Anna Schickele
For traditional information services providers, generative AI (GenAI) represents an unprecedented opportunity to drive sustainable advantage. By scaling solutions that leverage their vast data and content assets, companies can help their customers harness the power of GenAI while generating new revenue for themselves. Leading players across financial information, credit, publishing, and other info services are already beginning to see the benefit of GenAI at scale.
But for the vast majority of companies, realizing the full potential of GenAI has proved challenging. According to a recent BCG cross-industry survey, 66% of leaders are ambivalent or dissatisfied with their progress on AI and GenAI. This has also been the case with information service providers, many of whom have created proofs of concept only to find that developing a customer-facing product is more costly and resource-intensive than anticipated. Issues pertaining to accuracy of GenAI output, organizational capability and capacity, systems integration, and data preparation are among those standing in the way of successful scaling.
Providers that address these challenges and scale GenAI successfully will have access to a rapidly growing market. According to BCG analysis, the current market opportunity for GenAI financial services information products alone is around $5 billion to $6 billion and is projected to grow to $15 billion by 2026.
To date, GenAI has gained wide use as a way for companies to enhance internal productivity, and some are already using it to drive end-to-end transformation. But for companies that provide information products—such as financial, business, and sustainability data—GenAI can be a tool to enhance the productivity of end-customers. This new revenue stream can help to justify early investments into the nascent technology.
While most traditional info service providers understand the value of this opportunity, few have managed to scale GenAI successfully. To learn why, in early 2024 we surveyed 100 professionals from providers who had experience scaling GenAI solutions. (See “About the Survey.”)
More than half of the survey respondents said that building GenAI products has been more time-intensive and costly than anticipated. Four challenges were cited most frequently: accuracy requirements, organizational capability or capacity, systems integration, and data preparation and management. (See Exhibit 1.)
In early 2024, BCG conducted a survey of 100 tech, data, and IT professionals from global information services providers across financial information; credit, risk, and compliance; business information; and other verticals. Of these respondents, 66 were considered experienced: they have started to or have successfully scaled a GenAI solution. The remaining 34 are considered inexperienced because they have conducted proofs of concept or pilots but have not yet attempted to scale a solution.
Through our work with clients, we have gleaned various ways to address the top four obstacles to successfully scaling GenAI.
Accuracy requirements for GenAI output were the most frequently cited barrier to scaling up solutions. Yet while there are many ways to make an off-the-shelf GenAI model more accurate, determining which techniques to implement is not straightforward.
A key issue is that accuracy improvements often come at the expense of latency and cost. Complicating matters, changes to any part of the solution may have ripple effects elsewhere. Sometimes these effects can be positive—for instance, simply injecting the word “please” in a prompt can significantly improve the accuracy of model output. But often a change, such as adding an extra space, can reduce accuracy. And since GenAI solutions can provide different results each time, the outcome of any changes, large or small, can be difficult to predict.
Given these constraints, setting a clear accuracy threshold is a critical first step for avoiding overinvestments of time and resources. For a case where GenAI will be used to help with report writing, 80% accuracy (as good as a typical human writer) is likely good enough. This type of solution may be achievable with a lower-cost model (such as Chat GPT-3.5 instead of GPT-4). But for use cases where near 100% accuracy is required—such as contract review or credit scoring—predictive or deterministic AI models may need to be used alongside GenAI, at least for now.
Once the threshold has been determined, the next step is to apply techniques that may help meet this goal. We recommend starting with the simplest techniques, such as prompt engineering and context injection, even though many developers are not familiar with them. (See Exhibit 2.) If these techniques do not achieve the threshold, organizations should consider approaches that are more complicated and expensive, such as fine-tuning or training a new model. A “human in the loop” process to review output can also boost accuracy.
The key here is iteration, finding the right combination of changes to magnify the power of the entire solution and hit the accuracy threshold while limiting negative impacts to speed and budgets.
Few organizations have the capability or capacity to develop every GenAI product they’d like to build. According to BCG’s recent cross-sector, C-suite survey, 62% of the executives who expressed dissatisfaction with their organization’s progress on AI and GenAI cited a shortage of talent and skills.
To make the most of the GenAI opportunity, therefore, it’s important for companies to determine which use cases to focus on. Resource requirements should be driven by three considerations that focus on value, readiness, and complexity. To scale a GenAI product successfully, companies should be able to answer “yes” to the following questions:
Companies must fight the urge to prioritize based on only one or two of these dimensions. Focus only on value potential, and a company may take on more than it is ready for in terms of technical complexity. Focus only on organizational readiness, and a company may build products that don’t have strong enough market demand. A company that focuses only on complexity may conceptualize solutions that are valuable and feasible but could lack the necessary leadership support to execute.
Best-in-class companies use a balanced scorecard across these dimensions to focus their energy and resources in the right places. Once companies have a few early successes, they are usually able to scale the skills and capabilities they have built within their organization—and build increasingly sophisticated solutions as well.
For example, a financial information services provider built a retrieval-augmented generation (RAG) solution to help answer free-text questions about company financial performance. It took the company nine months to build this product, and only half that time to build subsequent solutions that leveraged a similar approach.
It’s easy to lose sight of the fact that a GenAI solution requires many components besides the GenAI model. The front end, data, enterprise core systems, infrastructure, and security all must be developed to operate optimally, both now and in the future. Companies should start developing these other components early on, focusing on two main considerations.
Interoperability. To support high-quality, secure data sharing, the components of a particular GenAI solution need to be interoperable with the organization’s existing tech stack and with its other GenAI products. This helps ensure that different solutions can “talk” to each other, enabling cohesive workflows.
Adaptability. Given the fast pace of today’s tech cycles, it’s critical to ensure that the solution components can be updated cost-effectively at scale. Companies, therefore, should build components with standardized APIs and services that can be easily swapped out for upgrades when they become available. A standard solution reference architecture can then be used to guide the conformance to enterprise architecture principles and standards.
Often, companies will need to make substantive changes to a solution. For instance, when using an off-the-shelf model, it may be necessary to “remodel” or switch to a newer version of a foundation model when it comes available. That’s what many users of Chat GPT-3.5 did when GPT-4 came out. In other cases, companies may need to rebuild their solution completely, such as when new technology or techniques emerge or a strategic shift necessitates a different approach (e.g., moving from a RAG solution to a fine-tuned model). Although these changes go beyond what we would typically consider “application maintenance,” companies should treat them as part of the ongoing maintenance cycle for GenAI products.
To build GenAI products that meaningfully leverage a company’s data assets, organizations need data readiness. Any data, whether structured or unstructured, needs to be clean, machine readable, and protected from risks of exposure. This means deploying preparation and management techniques specific to GenAI solutions.
Our survey of information services providers demonstrated that data readiness increases with scaling experience: two-thirds of the respondents who are experienced in scaling GenAI products report high data readiness. By contrast, only one-third of the participants who lack scaling experience said they are data-ready. (See Exhibit 3.)
Organizations can take the following steps to improve their data preparation and management:
Prepare the data with new techniques. Three techniques—creating metadata, preprocessing unstructured data, and storing data in GenAI ingestion-ready formats, such as vector databases— are all crucial because they help ensure that the GenAI model can interpret the data correctly. Experienced companies are more comfortable with these steps; training can help inexperienced teams get up to speed.
Mitigating new risks. While info services providers have long been careful to protect their IP and data, GenAI products introduce new pathways for exposure. IP concerns include inadvertently deploying models trained on unlicensed proprietary data or selecting a vendor that retains proprietary company data to train their models, among others. Data exposure concerns include potential release of the information services provider’s own data or the customer’s data.
The methods that info services providers use for mitigating IP and data exposure risk are clearly related to their scaling experience. Some 74% of novice organizations restrict the data that can be used in GenAI solutions. While this approach reduces risk, it also curtails product development and limits product depth.
By contrast, experienced organizations deploy mitigation practices that avoid making this tradeoff. These companies pay close attention to vendor selection, for example, by working only with providers who can demonstrate that they have full rights to the data used to train their models. Moreover, they make careful design choices, such as building all or parts of the solution within the company’s own domain so that proprietary data is never sent to a third party. These companies also establish process checks, such as IP risk reviews and user agreements, and they monitor progress early and often to catch any issues that may arise. Such guardrails allow product development to move forward while mitigating the risks associated with using IP and data in GenAI solutions.
A gap is already opening between the information services providers that started early with GenAI and are gaining critical scaling experience—and those who are waiting on the sidelines. Now is the time for companies to get their GenAI products to market and create a competitive advantage that in due course will become unassailable.
The advent of GenAI has given incumbent information services providers the opportunity to leverage their vast data and content assets like never before. By addressing the challenges of accuracy, capability, systems integration, and data preparation, companies can scale their products with confidence and realize the promise of this remarkable innovation.
We wish to thank Krishnan Ramachandran, Daniele Pariente, John Hajjar, and Lauren Hawkins for their contributions to this article.
ABOUT BOSTON CONSULTING GROUP
Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today, we work closely with clients to embrace a transformational approach aimed at benefiting all stakeholders—empowering organizations to grow, build sustainable competitive advantage, and drive positive societal impact.
Our diverse, global teams bring deep industry and functional expertise and a range of perspectives that question the status quo and spark change. BCG delivers solutions through leading-edge management consulting, technology and design, and corporate and digital ventures. We work in a uniquely collaborative model across the firm and throughout all levels of the client organization, fueled by the goal of helping our clients thrive and enabling them to make the world a better place.
© Boston Consulting Group 2024. All rights reserved.
For information or permission to reprint, please contact BCG at permissions@bcg.com. To find the latest BCG content and register to receive e-alerts on this topic or others, please visit bcg.com. Follow Boston Consulting Group on Facebook and X (formerly Twitter).