• Skip to main content
  • Skip to primary sidebar
  • Skip to footer
  • Home
  • Services
    • Vendor Advisory Services
    • IT Advisory Services
    • Business Advisory Services
    • Serious Insights Agile Thinking Workshops
    • Innovation Workshops
    • Serious Insights Keynotes
    • Strategy Advisory Services
    • Thought Leadership & Content Marketing
  • Reviews
    • All Hardware Reviews
    • Headphone Reviews
    • USB-C Hub Reviews
    • SeriousPop.Tech
    • Software Reviews
  • Advisory Research
    • Serious Insights on AI
    • Serious Insights Interviews
    • Strategy & Scenario Planning
    • Serious Insights on Collaboration
    • Hybrid Work
    • Knowledge Management
    • Management
    • Learning Reimagined
    • Serious Insights: The 10s
    • Special Reports
    • Sponsored Research
    • USG Scenario Planning Videos
  • About Us
    • About Serious Insights
    • About Daniel W. Rasmus
    • Daniel W. Rasmus Appearances
    • Daniel W. Rasmus Videos
    • Clients
    • Headshots
    • Books
      • Management by Design
      • Listening to the Future
      • Twelve Ways to Escape an Alien
      • Older Books
    • Daniel W. Rasmus World Travel
    • Dan’s Quotes
    • Community
    • Site Disclaimer
    • Privacy Policy
  • News
  • Contact Us
    • Contact Us
    • Book Daniel W. Rasmus
    • Serious Bookkeeping
    • Product Evaluation Request Form
    • Wedding Ceremonies
Serious Insights

Serious Insights

Research and reviews from strategist, futurist and analyst Daniel W. Rasmus

Follow Us

  • Facebook
  • X
  • LinkedIn
  • YouTube
  • Instagram

LLM Proliferation Will Challenge Emerging Testing Market

June 27, 2024 by Daniel W. Rasmus Leave a Comment

LLM Proliferation Will Challenge Emerging Testing Market

Cover image from Meta’s llama 3 via a prompt by the author.

The advent of smaller, more efficient LLMs will result in an even more rampant LLM proliferation as they become available to run on smaller and smaller devices. At the same time, problems with models, from safety to misinformation and bias, have created a nascent market for testing LLMs. Most testing will be aimed at large platforms and significant enterprise models. Most small models will likely evade testing, which may open new threat vectors as they embed and spread.

Tiny LLM Proliferation

Researchers at UC Santa Cruz found a way to run LLMs with the power equivalent to a light bulb. Microsoft Research in Asia also announced energy-efficient 1-bit LLMs.

These small LLMs, and all of the other non-web-based LLMs that can easily be downloaded and run in tools like LLM Studio, demonstrate the multiplication of models disconnected from infrastructure.

What I mean by that is that these tools will be deployed as local, one-off instances, often by individuals, who will trust, at least to some degree, any claims made by those from whom they acquired the code (if any claims are made at all).

Let the testing begin

As these small LLMs start their inevitable spread, companies like Haize Labs are promising to test LLMs into submission, finding all the flaws in their many nooks and crannies. I’m skeptical of that claim, given that even the developers aren’t sure where all the nooks and crannies are located. And new models will change even the known flaws.

LLM Proliferation Will Challenge Emerging Testing Market
But we don’t want to be tested!
via Dalle-2 and Microsoft Copilot Designer

For enterprises, an investment in testing will require a target environment employed for mission-critical systems. Test it. Ensure safety. Use it day after day as it is. If it changes, it needs to be retested. Testing will become an ongoing cost associated with enterprise-quality AI systems.

Testing a target platform, however, is very different than testing hundreds or thousands of small, often open-source LLMs. Sure, some forms of automation will evolve, but testing will be voluntary. Many will likely never be tested before they become obsolete, and even when they do become obsolete, they will still be available for download. Some will avoid testing because they specifically violate the norms that testing firms seek to enforce.

I have pointed out the issue of LLM metadata and management before. Many think about the big platforms as being “The AI”—services that can be tested by developers who can be held accountable if they violate trust or law. The small LLMs are completely decoupled. They do and will significantly outnumber the large platform services from Microsoft, OpenAI, Antropic, Google, Apple, and others. Their authors may be hard to identify. They may be stored in places that don’t require metadata about how or if they have been tested.

Testing will only work with AI that stands still long enough to be tested

As the AI community deals with this two-sided conundrum, small LLMs are already common. Very common. While they may safely execute on local machines without much threat to enterprise applications, they still create information that may be copied and pasted into enterprise documents. 

AI Training programs need to take these tools into account. On one hand, they can be inexpensive tools for learning. On the other hand, they can be risky partners that may offer up incorrect business information or expose unknowing end users to disturbing experiences and false information. Businesses and users should assume that most models have not been tested.

These “tiny” or “micro” AI models will appear everywhere over the next few months. I speculate that AI testing certification programs will emerge. These testing services will put a stamp on a model instance. This is likely a new business segment for AI startups. Eventually, revenue models of tested models may drive some adoption behavior, but model development will likely be wild and rampant over the next several years. Those on the bleeding edge will not want tested models any more than they want models constrained by guardrails.

Innovation in AI will continue to run afoul of safety, and the innovation of tiny, easily downloaded models that run on the smallest devices will make understanding those models’ features, capabilities, and safety a growing source of concern and confusion.

Created by Siipkan Creativefrom the Noun Project

AI icon by Siipkan Creative from Noun Project (CC BY 3.0)

Did you enjoy LLM Proliferation Will Challenge Emerging Testing Market? Please leave a comment, ask a question or like the post.

For more serious insights on AI, click here.

Share this post:

  • Share on X (Opens in new window) X
  • Share on LinkedIn (Opens in new window) LinkedIn
  • Share on Facebook (Opens in new window) Facebook
  • Email a link to a friend (Opens in new window) Email
  • Print (Opens in new window) Print
  • Share on WhatsApp (Opens in new window) WhatsApp
  • Share on Bluesky (Opens in new window) Bluesky
  • More
  • Share on Tumblr (Opens in new window) Tumblr
  • Share on Pinterest (Opens in new window) Pinterest

Like this:

Like Loading…

Related

Filed Under: AI

Reader Interactions

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Subscribe to Serious Insights

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 7,849 other subscribers

Download the 2026 State of AI Report

Amazon Associate

As an Amazon Associate, I earn from qualifying purchases.

Hit Amazon Haul for Amazing Discounts.

Also, take a look at these links for additional Amazon discounts.

Today’s Deals.
Up to 80% Off
Crazy Low-Priced Finds
Under $5
Brand Scores

Dan’s poetry. Only on Kindle. Read today!

Top Posts

  • JBL Tour Pro 2 Review: Excellent Headphones That Crush With Their NextGen Case
    JBL Tour Pro 2 Review: Excellent Headphones That Crush With Their NextGen Case
  • JLab Epic Air Sport ANC Gen 2 Review: Sports Earbuds that Go the Extra Mile
    JLab Epic Air Sport ANC Gen 2 Review: Sports Earbuds that Go the Extra Mile
  • Tozo HT2 ANC Headphones Review: Inexpensive Headphones That Impress for the Price
    Tozo HT2 ANC Headphones Review: Inexpensive Headphones That Impress for the Price
  • Jabra Elite 10 Earbuds Review: The Jabra Flagship Continues to Improve on Comfort and Features
    Jabra Elite 10 Earbuds Review: The Jabra Flagship Continues to Improve on Comfort and Features
  • 12 Hybrid Work Fears Managers Must Face
    12 Hybrid Work Fears Managers Must Face

Buy my space adventure only on Kindle.

Recent Comments

  • JBL Tour Pro 2 Review: Worth It? Specs, Comparison & More - Coastal Journal on JBL Tour Pro 2 Review: Excellent Headphones That Crush With Their NextGen Case
  • AI PCs Want Higher Labels Than AI PC – blog.aimactgrow.com on Acer Aspire 16 AI Qualcomm Review: Snapdragon X Value Laptop with Copilot+ Trade-offs
  • AI PCs Need Better Labels Than AI PC on Acer Aspire 16 AI Qualcomm Review: Snapdragon X Value Laptop with Copilot+ Trade-offs
  • OWC Thunderbolt Dock (14-Port) Review: One Dock, and One Cable, to Rule Them All on EZQuest USB-C Slim Gen 2 Hub Adapter 6-in-1 Review: A Speedy Modern Hub for Modern Work
  • Lenovo’s Qira is a Bet on Ambient, Cross-device AI—and on a New Kind of Operating System on “The Future of AI Isn’t What You Think” from Foxit Featuring a Daniel W. Rasmus Interview

Footer

Sitemap

  • Blogs
  • Book Daniel W. Rasmus
  • About Daniel W. Rasmus
  • Serious Insights LLC Disclaimer
  • Privacy Policy

Archives

Tag Cloud

ABC Apple AR artificial intelligence Big Data Buffy the Vampire Slayer BusinessWeek Cengage CIO Magazine CIOs Cisco context coronavirus Customer Service Dell Disney Disneyland earbud review Enterprise 2.0 facebook Fast Company Feedback loops Harvard Business Review HBR HP IBM Innovation Instagram iPhone case JBL Kindle Knowledge Management life-long learning Logitech Management By Design Microsoft mission statement Netflix New Scientist Nokia scenario planning Star Trek Stephen Elop Thought Leadership VR

Copyright 2009-2026 Serious Insights LLC | Log in

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

%d
    Powered by  GDPR Cookie Compliance
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

    Strictly Necessary Cookies

    Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.