News & Insights

Blumberg Capital portfolio news, startup growth resources and industry insights

Home > News & Insights > Mapping the AI Toolchain – 2024 Perspective

Mapping the AI Toolchain – 2024 Perspective

Jun 3, 2024

Written by Matt Bornstein, former principal at Blumberg Capital, in 2018.

2024 perspective by Roy Lowrance, advisor at Blumberg Capital. Roy helped design New York University’s Center for Data Science, a research and teaching center for artificial intelligence and big data.

Setting the Scene

2018 Perspective:

The first wave of artificial intelligence has been about experts: brilliant technologists doing cutting-edge research and building advanced systems in places like Silicon Valley.

The second wave of AI will be about practitioners: traditional developers becoming AI rockstars and addressing a wide range of business problems.

The Democratization of AI

During this transition, we believe AI — particularly, deep learning — will begin to resemble a general-purpose computing platform, a topic we explored in this Forbes article. But a new set of tools will be necessary to make that vision a reality. To help advance the conversation, we published an alpha landscape of the emerging AI Toolchain:

(click to enlarge; see methodology at end of post)

2024 Perspective:

As predicted, access to AI was democratized since its onset through deep learning models, which turned out to be a new wave of technology — the ChatGPTs and similar. The invention of large language models are also having a bigger impact than traditional machine learning models, at least in the near term, because they do not require programmers.

For example, with ChatGPT-4, a non-programmer can upload relevant documents and use those documents as the basis for prompt completion. We anticipate that this wave of document generation will be an area for rapid progress in the next several years. Longer-term, we expect that traditional machine learning — creating predictions from data — will have more value than document generation. Fully leveraging these predictions necessitates a redesign of business processes, however, and this redesign is likely to pose a significant constraint as predictions are integrated into business frameworks.

To decide on the redesign, a team from the business needs to understand the current workings of their products and processes and explore how these could be enhanced using large language models (LLMs). This requires a comprehensive grasp of what LLMs are capable of now and their potential future capabilities. Typically, a Chief Technology Officer understands the technology, while a business expert comprehends why customers value the current products and the rationale behind existing processes. These parties must collaborate to envision the potential tasks for the LLM, including the prompts it would receive and the responses it would generate. They need to thoroughly consider how these changes would modify the products and processes and, most importantly, how customers would perceive and value these changes.

Multiple candidate prompt-response scenarios should be explored due to the rapid advancements in LLM capabilities. A one-time implementation approach is insufficient. Instead, the team should adopt a cycle of designing and deploying solutions, assessing their impact on the business, and continuously monitoring LLM advancements. With significant improvements occurring every few months, this iterative process ensures that the business remains at the forefront of technology integration and customer satisfaction.

Why AI tooling?

2018 Perspective:

Artificial intelligence has captured the world’s attention based on its promise to generate value from big data and to deliver a new breed of intelligent applications. Enterprises, venture capitalists, academic institutions and governments are betting billions of dollars — with good reason — that AI will be a competitive advantage in the coming decades.

Today’s reality, though, is that AI is in the earliest stages of development and adoption — what Kai-Fu Lee calls the “first wave.”1 A recent O’Reilly survey found that half the organizations in its data-savvy audience are still “exploring” or “just looking” at machine learning.2 That number is significantly higher among small and medium-sized organizations.3 Anecdotally, only a select few (e.g., tech giants, hedge funds, intelligence agencies) are seeing real value.

The gap between expectation and reality is driven, in large part, by the difficulty of getting AI to work in practical use cases and at scale. One ML practitioner at Lyft calls this the “primordial soup phase” of AI.4 An IDC survey found that 77 percent of enterprises have “hit a wall” with AI infrastructure, many more than once.5 This problem is made more acute by the shortage of talented AI developers and unclear ownership of AI initiatives.

Delivering on the potential of artificial intelligence will require a dramatic improvement in the systems and tools that help AI developers — from the most experienced to the newly engaged — do their jobs effectively.

2024 Perspective:

Not only must the AI tools improve, but also the proficiency of managers and business analysts. What’s required are business professionals adept as discerning what predictions could be made with traditional machine learning and what documents could be created with generative AI. Equally crucial is understanding how these predictions and documents can be seamlessly integrated into products and processes to generate tangible value. The bottleneck is shifting from possessing robust tools to cultivating a deep comprehension of technical possibilities and business relevance across various organizational domains.

Compute: The Gold Rush Is On

2018 Perspective:

Infrastructure has played a central role in the AI revolution from the start.

Landmark results in deep learning, including the 2012 AlexNet paper, relied on the massively parallel computing resources of graphics processing units (GPUs) to train networks that were larger, deeper and more powerful than previously possible.6

Fast-forward to 2018, and AI compute technologies have become an explosive area of development and investment. NVIDIA is the clear leader, powering the majority of AI training workloads and surpassing $100 billion in market capitalization. Intel is playing catch up via acquisition of companies like Nervana, Movidius and Altera, with the company’s first “neural network processor” expected in late 2019.7 The large cloud providers (AWS, Azure, Google) are developing chips as well, but so far have found only niche adoption.

Entrepreneurs believe they can beat the incumbents to the punch. More than a dozen companies in this category have raised over $800 million in venture capital. Most of these companies are pursuing new architectures for data center chips used for training and inference. Some others are experimenting with unique power/performance profiles that target edge devices, a key growth area not yet dominated by large vendors.

Few (if any) startups have delivered a commercial AI chip so far. Going to market will be difficult, requiring substantial support from device manufacturers, framework developers and investors. But the opportunity is lucrative. Competition in AI compute technologies will likely continue to heat up.

2024 Perspective:

The AI hardware market has been largely monopolized by a handful of major players, limiting opportunities for newcomers. A new entrant not only needs a superior approach but also faces the challenge of persuading developers to adopt their new devices — both difficult tasks.

Modeling: Frameworks are Mature, But Development Environments are Works-in-progress

2018 Perspective:

Innovative new chips need intuitive programming interfaces and widespread developer support to succeed.

AI frameworks have already reached a reasonably mature state, in some ways outpacing the development of the underlying chips. Tensorflow is emerging as a de facto standard, with a number of good alternatives including Pytorch and H2O.8 This is part of a long-running trend redefining the idea of a computing platform in both traditional and AI development: where developers once targeted a particular processor or operating system, they now target a framework instead.

Systems to help developers write models, however, have room to grow. Popular workbenches like Domino make it easy to access modeling tools and collaborate on projects. IDEs like RStudio and notebooks like Jupyter enable an iterative model construction process. These tools, though effective for their intended purposes, do not assist with unique AI needs, such as introspecting and interactively adjusting data.9 There is a big opportunity to build the “home screen” for AI developers.

Automation of the modeling process, likewise, is still more tailored to traditional data science, with DataRobot the apparent leader. Pioneering, AI-specific efforts like Google’s AutoML or TPOT are interesting experiments to test the level of automation developers find valuable. There is skepticism among some practitioners, though, that these types of solutions will work reliably or deliver major benefits.10 Automated modeling remains an area of ongoing technology and product research.

2024 Perspective:

In 2024, it seems that everyone who needs to program a traditional machine learning model has access to a software library that enables that to happen. Instructional institutions are beginning to offer basic machine learning courses to undergraduate students. For example, NYU and others offer a course intended for freshmen that teaches how to use off-the-shelf libraries to program simple machine learning models. Another example is the Miami-Dade College, which has recently offered a series of certificates leading up to a BA in AI. The courses are designed for practical application of AI ideas, so that they are heavy on software development and light on mathematics.

Training: The Burning Platform Need

2018 Perspective:

There is an old adage, from sci-fi writer Arthur C. Clarke, that “any sufficiently advanced technology is indistinguishable from magic.”

If there is magic in deep learning, it happens at the training step. Starting with just a few lines of code (a model definition), and without explicit human intervention, training produces a system capable of performing advanced computing tasks (e.g., classification) across a wide range of use cases.

Of course, magic is not necessarily a good thing. Training is mostly a trial-and-error process today. Results are notoriously difficult to predict or to reproduce, owing in part to the complex and non-obvious relationships between various layers of a neural network. Experiment management tools like Weights & Biases and Comet address this problem by tracking, comparing and visualizing training runs (“experiments”). This attempt to impose order and standards is much-needed, especially by larger AI teams, but no vendor has established a clear lead.

Training AI models also requires significant computing resources and highly skilled people to optimize the process. Resource management companies like SigOpt and Determined aim to make training more efficient, powerful and automated. Initial features include hyperparameter tuning, GPU sharing, cold/warm-starting and other advanced training paradigms. These companies are addressing some of today’s most difficult AI development problems and several are gaining significant traction with customers.

Finally, we can’t lose sight of the data. Deep learning and other AI techniques require large, carefully constructed training datasets, and building them often takes the majority of an AI developer’s time. Data generation companies like Figure Eight or Labelbox address this pain point via software tools, sometimes combined with human services, to annotate (or create) data and make it usable for AI training. Organizations also face increasing needs to standardize training data pipelines and address more nuanced issues like bias, auditability and privacy. Data management companies like Pachyderm and Iterative, sometimes called “data version control” or “git for data,” are working to provide this necessary functionality.

2024 Perspective:

Getting clean enough data for training traditional machine learning models continues to be a problem. While some startups have made strides in tackling aspects of this issue, the advent of LLMs has facilitated training on vast swathes of internet-scale data—encompassing everything from online text to published books. However, the hurdles extend beyond the sheer scale of data processing; there’s also the task of navigating legal issues surrounding the use of data authored by others, for which the model trainer often lacks proper licensing.

Operations: We Don’t Know What We Don’t Know

2018 Perspective:

Traditional software teams have a robust set of tools to manage the operational aspects of the application lifecycle: test automation, code analysis, CI/CD, APM, A/B testing and many others.

Sophisticated companies like Google, Facebook and Uber have built similar capabilities for AI applications with in-house platforms. The average enterprise, however, lacks the resources and expertise to build these tools and will look to vendors to fill the gap.

Most startups in this category focus on the deployment and monitoring of models in a production setting. This is an area of clear need, analogous to the traditional programming world, that will likely attract substantial IT budgets in the future. The market, however, is still nascent. Many organizations are focused today on modeling and training. Nearly one third report having “no methodology” in place for AI development or lifecycle management.11 Vendors may need to educate their potential customers.

Looking further ahead, AI will also require unique operations systems to reach scaled production in many industries.

Model explainability addresses the “black box” nature of AI applications, helping users understand why a loan was rejected or a news item was promoted. This functionality is fast becoming a legal and/or market requirement across industries, particularly in financial services. Several data management and model deployment companies have made inroads on explainability via sophisticated management of AI pipelines. Finding a general solution, however, is considered an open area of research. DARPA has committed $75 million to an explainable AI (“XAI”) grand challenge that will report its first results in 2019. In the meantime, more practical systems around debugging, model introspection and general visibility represent nearer-term opportunities.

AI security or verification is another non-trivial, unsolved problem. Researchers have found that many AI models are relatively easy to deceive with adversarial data inputs. One team fooled self-driving car algorithms into ignoring stop signs using just a few stickers.12 Others showed this type of technique to be robust across various data types.13 Widespread deployment and acceptance of AI applications by regulatory authorities will require applications to behave as expected and in a safe way, within appropriate guardrails. This is a potentially lucrative opportunity for startups, whether in a horizontal or vertical business model.

2024 Perspective:

Managing the operational aspects of deployed AI-based applications is starting to be addressed by the startup community. In addition to explainability and security, deployed models need to be retrained periodically.

However, simple customization is not enough, as competitors can easily replicate these efforts. To create a competitive advantage, startups need to leverage unique data that their competitors do not have access to. This often involves implementing a feedback loop: deploying a customized LLM in a product or process, observing its effectiveness, determining improvements based on collected data, and then incorporating this data into the next iteration of LLM customization.

ML-as-a-Service: Does One Size Fit All?

2018 Perspective:

AI tooling companies face a persistent question from skeptics: “Why can’t Amazon/ Google/ Microsoft do the same thing?”

The cloud providers have a credible play in the ML-as-a-service category, including pre-trained models and tightly-integrated toolchains. These companies are well-positioned to sell tooling as a bundled service, since they already provide the infrastructure for many AI applications. They also have massive resources and demonstrated commitment to the AI market.

Cloud provider offerings have several downsides, though:

Lock-in. Most tools published by cloud providers are designed to be used as a complete set on a particular cloud. This is at odds with many AI developers, who prefer to use bespoke stacks with best-in-class tools, and many corporate IT departments, who are pursuing hybrid or multi-cloud IT strategies.
Wrong incentives. Cloud providers – especially Amazon and Google – have the primary incentive to sell more cloud services. Commercial software is not their main line of business and is unlikely to become so. Microsoft may prove the exception to this rule, given their past wins with developer tools and recent acquisition of Github.
One size fits all. Market feedback suggests AI tooling from cloud providers is designed for simpler, more homogenous use cases. This is an advantage today, while most customers are early in their AI capabilities, but may become a liability as AI competency grows and deep learning moves closer to a general computing platform.

It would be a mistake to count out Amazon, Microsoft and Google. They will continue to expand and improve their offerings, addressing some of the current issues. But structural factors suggest they are unlikely to capture the full market, especially as AI becomes more popular and talent more evenly distributed.

ML-as-a-service startups like Floyd and Machinify, meanwhile, aim to beat the cloud providers at their own game, delivering ease-of-use without lock-in. This is an attractive value proposition for many developers and organizations just beginning their AI journeys. Whether these customers continue with an end-to-end approach as they become more sophisticated is a key question.

2024 Perspective:

Eventually, all surviving data processing systems will come with prediction components, where that is feasible. Many things that can be reported on can also be predicted. As these augmentations emerge, the need for stand-alone prediction systems will diminish.

We see two obstacles in predicting the kinds of data in accounting systems and ERP-like systems. The first is the will to do the work. Auto-trained machine learning systems available from cloud computing vendors and others are often good enough to make the initial round of predictions. The second obstacle is the confidence that the predictions can be monetized. Overcoming this second obstacle requires an understanding of how the business products and processes could be modified with accurate-enough predictions.

Conclusions

2018 Perspective:

Seven of the world’s ten largest public companies are tech companies: Apple, Amazon, Microsoft, Alphabet, Facebook, Alibaba, Tencent.14They have revolutionized industries like media, commerce and communications by building (or building on top of) today’s massive, global digital infrastructure.

Not coincidentally, these companies are also among the world’s most sophisticated AI organizations. They have built expert AI teams, absorbing leading academic labs in the process, and have demonstrated a substantial multiplier effect of AI on their core businesses. Most of these companies are also entering new markets, like transportation and home control, based on proprietary AI research.

For traditional enterprises, it is no longer enough to be “digital.” Software and online services are necessary, but not sufficient, to remain competitive. AI is the next battleground.

A rich ecosystem of startups is forming to address this need. Intelligent applications, built by vendors for a wide range of industries and corporate functions, are an important part of the equation. But enterprises must also create strong internal AI capabilities to generate long-term value.

The AI toolchain is ripe with opportunities to support business leaders – and engineers – in this journey, including:

Bringing standards, scale and efficiency to the training process
Creating the “home screen” for AI developers
Explaining the predictions of advanced AI models and verifying behavior
Building better chips for AI inference, especially at the edge
Defining the product and GTM strategy to stitch these pieces together

2024 Perspective:

It’s imperative to recognize that the most pressing constraint isn’t technological but human. Successful transformational initiatives hinge on organizational leaders’ comprehension of both the technical possibilities and the realistic outcomes achievable for stakeholders.

_________________________________________________

Methodology:

The AI Toolchain focuses on new tools designed specifically to support AI applications and development workflows. It avoids conventional data science and big data companies, which play a critical role in AI development but are already well-known and are beginning to consolidate. Some conventional tools (e.g., Domino) are included due to a lack of new, purpose-built alternatives.
The landscape also focuses primarily on commercial companies to illustrate the growth of AI tooling as an industry. Some open-source projects (e.g, Tensorflow) are too important to ignore, and other areas (e.g., explainability) have few commercial entrants thus far.
Companies are clustered based on their core technological innovations and their place within the AI development workflow. Since this is a nascent market, the categories are not officially recognized by IT analysts like Gartner or Forrester. This is more art than science, and categories will likely evolve as the market matures.We included all the activity we know about, but please let us know if you have a suggestion or proposed update

Citations:

Data Analytics & Infrastructure

News & Insights

Mapping the AI Toolchain – 2024 Perspective

Setting the Scene

2018 Perspective:

The Democratization of AI

2024 Perspective:

Why AI tooling?

2018 Perspective:

2024 Perspective:

Compute: The Gold Rush Is On

2018 Perspective:

2024 Perspective:

Modeling: Frameworks are Mature, But Development Environments are Works-in-progress

2018 Perspective:

2024 Perspective:

Training: The Burning Platform Need

2018 Perspective:

2024 Perspective:

Operations: We Don’t Know What We Don’t Know

2018 Perspective:

2024 Perspective:

ML-as-a-Service: Does One Size Fit All?

2018 Perspective:

2024 Perspective:

Conclusions

2018 Perspective:

2024 Perspective:

Test Automation Platform Tricentis Acquires SeaLights

Mapping the AI Toolchain – 2024 Perspective

PerfectScale raises $7.1 million for its Kubernetes optimization platform

Kloudfuse Launches Out of Stealth with $23M

Investing in PerfectScale and the Transformation of Kubernetes Optimization

Nexla has Defied Startup Conventional Wisdom with Slow and Steady Growth

Tagado Raises $4M Seed Funding to Help Companies Turn Customer Feedback into Actionable Insights

IBM Acquires Databand to Bolster its Data Observability Stack