• Skip to main content
  • Skip to primary sidebar
  • Skip to footer
  • Home
  • Services
    • Vendor Advisory Services
    • IT Advisory Services
    • Business Advisory Services
    • Serious Insights Agile Thinking Workshops
    • Innovation Workshops
    • Serious Insights Keynotes
    • Strategy Advisory Services
    • Thought Leadership & Content Marketing
  • Reviews
    • All Hardware Reviews
    • Headphone Reviews
    • USB-C Hub Reviews
    • SeriousPop.Tech
    • Software Reviews
  • Advisory Research
    • Serious Insights on AI
    • Serious Insights Interviews
    • Strategy & Scenario Planning
    • Serious Insights on Collaboration
    • Hybrid Work
    • Knowledge Management
    • Management
    • Learning Reimagined
    • Serious Insights: The 10s
    • Special Reports
    • Sponsored Research
    • USG Scenario Planning Videos
  • About Us
    • About Serious Insights
    • Daniel W. Rasmus
    • Daniel W. Rasmus Appearances
    • Daniel W. Rasmus Videos
    • Clients
    • Headshots
    • Books
      • Management by Design
      • Listening to the Future
      • Twelve Ways to Escape an Alien
      • Older Books
    • Daniel W. Rasmus World Travel
    • Danโ€™s Quotes
    • Community
    • Site Disclaimer
    • Privacy Policy
  • News
  • Contact Us
    • Contact Us
    • Book Daniel W. Rasmus
    • Serious Bookkeeping
    • Product Evaluation Request Form
    • Wedding Ceremonies
Serious Insights

Serious Insights

Research and reviews from strategist, futurist and analyst Daniel W. Rasmus

Follow Us

  • Facebook
  • X
  • LinkedIn
  • YouTube
  • Instagram

Access Innovations Leaders on Semantic Enrichment and Why The Scholarly Publishing Model Is The Blueprint for AI Readiness: A Serious Insights Interview

April 21, 2026 by Sheri McLeish Leave a Comment

Access Innovations Leaders on Semantic Enrichment and Why The Scholarly Publishing Model Is The Blueprint for AI Readiness

And the days of XML Are Numbered

Marjorie (Margie) Hlava and Veronica Showers. Access Innovations Leaders on Semantic Enrichment and Why The Scholarly Publishing Model Is The Blueprint for AI Readiness: A Serious Insights Interview
Marjorie (Margie) Hlava and Veronica Showers from provided portraits via a ChatGPT prompt written by Daniel W. Rasmus.

The AI revolution is exposing a fundamental truth that the scholarly publishing world has known for decades: unstructured data is a liability, not an asset. As enterprises race to build AI-powered tools and chatbots, many are discovering, often painfully, that dumping raw content into a language model produces hallucinations, policy violations, and fabricated citations.

In this Serious Insights interview, Access Innovations founder and Chief Scientist Marjorie (Margie) Hlava and VP of Business Development Veronica Showers make the compelling case that the rigorous content architecture long practiced in scholarly publishing โ€” semantic enrichment, controlled vocabularies, and low-level chunking โ€” is exactly the blueprint every organization needs to make AI work reliably. If you are wrestling with AI readiness or wondering why your early AI pilots underperformed, this conversation delivers both the diagnosis and the prescription.

Top 3 Takeaways

  • AI readiness requires chunking content into small units (200โ€“800 tokens), tagging them with controlled vocabulary, and storing them in vector databases before any AI tool can reliably retrieve the right information.
  • Controlled vocabularies and semantic enrichment solve the disambiguation problem AI cannot handle alone โ€” preserving meaning, context, and provenance through the chunking and ingestion process.
  • Humans remain essential stewards of their knowledge domains; removing them from the semantic enrichment process risks AI outputs built on poorly grounded, domain-agnostic representations.

The Access Innovation Interview

Sheri McLeish: I’ve referred to the scholarly publishing model as a blueprint that enterprise marketers and other publishers must adopt to survive the AI revolution. Could you provide a high-level overview of what this model actually entails and why it works so well?

Veronica Showers: Because of GenAI, we are transitioning from a resource economy to an answer economy. In an answer economy, it’s not about just finding the right documents; it’s about finding the right sections within each document to determine an answer. In this new economy, we need to learn how to function in ways that are optimized for language models, and they don’t like to think in big documents.

They really think in smaller units, anywhere between 200 and 800 tokens. Anything beyond that introduces a lot of noise, so you want to keep things small. The data you have on hand needs to be pre-processed into those smaller units, tagged properly, and stored in a vector database. That is what I call the โ€œbrainโ€, first, before you can build the tools.

Margie Hlava: People are thinking, “Well, I just got my data into XML, and it’s costing me a mint, and now you want me to chunk that up?” Well, yeah, I do. XML provided a structural backbone for production, but it described the form rather than the actual meaning of the content, and you actually didn’t do very much in terms of tagging it with subject metadata from a controlled vocabulary… We’re chunking our data, we’re tagging it at a low level so that we know how to attribute it, and we know what the meaning is.


Sheri McLeish: Why did simply dumping unstructured data into early large language models cause so many headline-making failures?

Veronica Showers: When organizations dumped their data without any real training, the language models had to rely on surface-level semantic similarity, which resulted in chatbots telling users how to break policies or making up falsified references. The models lacked the structured instruction needed to successfully retrieve and apply the right information.


Sheri McLeish: If chunking and low-level semantic tagging are done correctly, does that eliminate the need for heavy XML overhead?

Margie Hlava: Yes, I think if these processes go right, there might not be a need for all that XML overhead. There is a significant movement toward linked data and content profiles that prioritize this lower-level chunking and tagging instead.

Starting Your AI Readiness Efforts

Sheri McLeish: For organizations overwhelmed by a massive backlog of unstructured content, where is the best place to start proving value?

Margie Hlava: You should look at where your unstructured information is costing you the most in manpower or exposing you to litigation liabilities because you cannot easily access your data. Automatically indexing that high-cost data hits the organization in the pocketbook and is a serious place to start.


Sheri McLeish: Is there a different approach to prioritize content if their goal is to build a specific AI tool, like a client-facing chatbot?

Veronica Showers: In that case, you have to work backward from your specific goal. Determine exactly what kind of queries the chatbot needs to answer, and then figure out what specific data needs to be fed into the tool to fulfill it.


Sheri McLeish: In the agency world, we encourage defining content models for clients so their content marketing, product information and customer service materials can dynamically flow into different outputs without manual revisions. How does this modular approach fit into AI readiness?

Veronica Showers: It fits perfectly because the smaller components allow language models to package and repackage that data in many different ways. Tagging data correctly at the chunk level establishes the semantic structure that allows those modular building blocks to be dynamically aggregated and served back to people effectively.

The Importance of Semantic Enrichment and Vocabulary

Sheri McLeish: Taxonomies constantly evolve, so how do you keep these controlled vocabularies accurate over time?

Margie Hlava: You monitor incoming content streams to see if new terms were indexed appropriately or if there was nothing there that the taxonomy could latch onto, which indicates a gap. Terms need to be added to the taxonomy to cover the gaps. For rapidly moving fields like news, you have to concentrate on the topical areas and cannot afford to fall a day behind.


Sheri McLeish: Why is establishing a structured vocabulary so important for disambiguation when feeding AI?

Margie Hlava: To a computer, homonyms look exactly the same. Structuring your vocabulary ensures that the different meanings of words are recognized in their proper context. Adding tags, keywords, or concept labels to the content provides context, enables discovery, and ensures that the meaning in the writing is preserved. English has words with many meanings, and words taken out of context lead to sometimes amusing, but often incredibly incorrect, interpretations of the information presented.

Words have different meanings in different domains. โ€œMercuryโ€, for example, can be an element in chemistry, a planet in astronomy, a god in mythology, an automobile, a messenger, a plant, etc. โ€œLeadโ€ can be a management term, something you use to walk the dog, the inlet of a river to a larger body of water, or an element on the periodic table.

Words also often change labels or meanings quickly in modern discourse, leaving the earlier writings unfindable, buried in old terminology. Take the case of homeless, unsheltered, unhoused, street people, and earlier, hobos, drifters, vagrants. We came up with at least 57 synonyms for this. Laws and research exist for every one of those terms. That is a big search parameter! Or look at when COVID appeared on the scene as Coronavirus, SARS, SARS-CoV-2, Covid-19, etc. How do we keep track of these changes and ensure that we are really doing a full scan of the available research data?

The value of semantic enrichment within AI is that when the data is chunked, tokenized, and fed into vector databases, all links to the word usage (meaning and context) are lost unless we tag that data in the beginning so it is held together throughout the ingestion process by the terminology control that the semantic enrichment provides. The prediction of which word might come next is powerful: it is more powerful with guardrails of a taxonomy or other vocabulary control.


Sheri McLeish: Why is establishing the provenance of these information chunks becoming so critical?

Margie Hlava: Establishing the source or provenance of information, such as using a DOI, is becoming increasingly important for overall data accuracy. It prevents misinterpretation and ensures the AI outputs are grounded in authoritative expertise rather than generalized automation. When items are linked back and attributed to an author, it helps preserve their original intent and keeps the meaning intact. Throughout my career, including my work with the original Dublin Core group, I have focused on establishing syntax like DOIs and contributor role designations (CRediT) to ensure we know exactly who contributed to a paper and why their name is on it.

When items are linked back and attributed to an author, it helps preserve their original intent and keeps the meaning intact.


Sheri McLeish: What are your impressions of the market demand for structured content, considering its differing maturity in domains like marketing compared to technical publishing?

Veronica: The work itself is universal for any organization that wants to create a product related to AI. No matter whether you’re a publisher or a marketing firm, you still have to go through that process of taking every document, breaking it up into small components, and then tagging that document properly to instruct the language model on how to retrieve the right portions of documents and how each component should be understood.


Sheri McLeish: With AI automating so many repetitive tasks, what is the ongoing role of the human in the loop?

Margie Hlava: AI is a sophisticated pattern recognition system, but it lacks deductive reasoning and the ability to understand if information is truly complete. While automated indexing tools can certainly handle repetitive tasks, keeping a human in the loop is essential because human expertise is absolutely required to make intellectual decisions and apply common-sense reasoning.

When you are preparing content for AI, the semantic enrichment and structuring really need to be done by the people who own and understand the knowledge domain. It is human expertise that preserves the conceptual architecture, logic, and complex relationships that a specific discipline depends on. If you remove the human from this process and let a vendor’s model guess at the meaning of your field, you run the severe risk that AI outputs will be derived from poorly grounded representations that lack true domain expertise.

Ultimately, in a world where AI is increasingly mediating how knowledge is discovered and interpreted, keeping a human in the loop to safeguard the true meaning of the discipline remains our most important responsibility.

Yes, there are several additional insights from the sources that expand on this topic. Both Margie and Veronica emphasized that the shift toward AI is changing how organizations capture internal expertise, turning everyday work into structured data.

The (Past) and Future of Knowledge and the Human Role

Sheri McLeish: Margie, having followed technology changes from nine-track tapes to where we are today, do you see historical similarities in how people are reacting to AI?

Margie Hlava: Yes, technology is currently acting as either a positive or a disruptive influence, depending on how you look at it. Right now, people are curious, cautious, and quite afraid of what it will do to their publishing models. Interestingly, back in 1964, in response to the space race after Sputnik went up in 1957, the Council on Scientific and Technical Information (COSATI) report outlined a great deal of the information structures we are seeing todayโ€”they just didn’t have the computing horsepower back then.


Sheri McLeish: How does the rapid emergence of AI compare to past technological shifts we’ve experienced?

Veronica Showers: It reminds me of the dot-com era. In both cases, the technology was here to stay, and those who learned how to use it were going to succeed. I started learning how AI processes information and how to build agents, which brought me to Access Innovations. Because of how crucial data structuring is for these systems, I predict there will be an absolute boom in knowledge management roles specifically related to AI projects to ensure AI is grounded in structured knowledge rather than unstructured content.

Margie Hlava: We are entering a moment where organizations and publishers must stop thinking of themselves as simply providers of articles and start recognizing themselves as stewards of knowledge. Furthermore, having this structured knowledge provides a massive advantage for new, junior employees, allowing them to soak up all available internal documentation to quickly become a reliable contributing part of the organization.


Sheri McLeish: How is this shift toward semantic enrichment changing internal knowledge management, especially when it comes to retaining expertise as employees are let go or retire?

Margie Hlava: Capturing the knowledge of the people who are currently working, or those who are leaving or retiring, is an incredibly important field. The focus has to be on structuring the metadata rather than just the data itself, as this information is now being packaged and repackaged in many different ways.

Veronica Showers: Organizations are taking a taxonomic or tagging approach to move away from big blobs of text toward structured outlines. The best framework for capturing this knowledge is for employees to document what they are doing as they go, similar to how research and development firms use lab manuals. Even everyday internal assets like transcripts, emails, and podcasts can be tagged and chunked in an AI setting to be incorporated into an internal knowledge base.


Sheri McLeish: This was a great conversation, and I appreciate the time that you have spent with me. I think there is a lot that we were able to dig into, and hopefully, this assists those who are looking for this type of expertise to find you.

Veronica Showers: Good to talk to you, Sheri, thank you.

About Marjorie (Margie) Hlava and Veronica Showers
Access Innovations

Marjorie (Margie) Hlava

Marjorie (Margie) Hlava is the founder, Chairman, and Chief Scientist of Access Innovations, who began her career as an information engineer at NASA and has helped develop around 600 taxonomies.

Veronica Showers is the VP of Business Development at Access Innovations, bringing over 20 years of experience in scholarly publishing and specializing in preparing content for generative AI through ontology application.

Access Innovations has long been a powerhouse in the scholarly publishing industry, a sector where structured content, semantic enrichment, and rigorous taxonomies have been acknowledged as necessities for decades. In this Serious Insights interview, we discuss why the scholarly publishing model is the blueprint that modern marketers and enterprise publishers must adopt to survive the AI revolution.

About Sheri McLeish

Sheri McLeish is an Associate Analyst at Serious Insights, specializing in Content Strategy and AI Search. With a background as a Forrester Analyst and digital agency leader, she helps organizations navigate the shift from traditional SEO to Authority Architecture.

For more serious insights on AI, clickย here.

Did you find thisย interview with Marjorie Hlava and Veronica Showers useful? If so, please like, share or comment. Thank you!

The cover image is AI-generated (Adobe Firefly) from a Serious Insights prompt referencing source photos provided by the participants.

Share this post:

  • Share on X (Opens in new window) X
  • Share on LinkedIn (Opens in new window) LinkedIn
  • Share on Facebook (Opens in new window) Facebook
  • Email a link to a friend (Opens in new window) Email
  • Print (Opens in new window) Print
  • Share on WhatsApp (Opens in new window) WhatsApp
  • Share on Bluesky (Opens in new window) Bluesky
  • More
  • Share on Tumblr (Opens in new window) Tumblr
  • Share on Pinterest (Opens in new window) Pinterest

Like this:

Like Loadingโ€ฆ

Related

Filed Under: AI, Interview, Knowledge Management

Reader Interactions

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

Subscribe to Serious Insights

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 7,849 other subscribers

Download the 2026 State of AI Report

Amazon Associate

As an Amazon Associate, I earn from qualifying purchases.

Hit Amazon Haul for Amazing Discounts.

Also, take a look at these links for additional Amazon discounts.

Todayโ€™s Deals.
Up to 80% Off
Crazy Low-Priced Finds
Under $5
Brand Scores

Danโ€™s poetry. Only on Kindle. Read today!

Top Posts

  • JBL Tour Pro 2 Review: Excellent Headphones That Crush With Their NextGen Case
    JBL Tour Pro 2 Review: Excellent Headphones That Crush With Their NextGen Case
  • JLab Epic Air Sport ANC Gen 2 Review: Sports Earbuds that Go the Extra Mile
    JLab Epic Air Sport ANC Gen 2 Review: Sports Earbuds that Go the Extra Mile
  • Tozo HT2 ANC Headphones Review: Inexpensive Headphones That Impress for the Price
    Tozo HT2 ANC Headphones Review: Inexpensive Headphones That Impress for the Price
  • Jabra Elite 10 Earbuds Review: The Jabra Flagship Continues to Improve on Comfort and Features
    Jabra Elite 10 Earbuds Review: The Jabra Flagship Continues to Improve on Comfort and Features
  • 12 Hybrid Work Fears Managers Must Face
    12 Hybrid Work Fears Managers Must Face

Buy my space adventure only on Kindle.

Recent Comments

  • JBL Tour Pro 2 Review: Worth It? Specs, Comparison & More - Coastal Journal on JBL Tour Pro 2 Review: Excellent Headphones That Crush With Their NextGen Case
  • AI PCs Want Higher Labels Than AI PC – blog.aimactgrow.com on Acer Aspire 16 AI Qualcomm Review: Snapdragon X Value Laptop with Copilot+ Trade-offs
  • AI PCs Need Better Labels Than AI PC on Acer Aspire 16 AI Qualcomm Review: Snapdragon X Value Laptop with Copilot+ Trade-offs
  • OWC Thunderbolt Dock (14-Port) Review: One Dock, and One Cable, to Rule Them All on EZQuest USB-C Slim Gen 2 Hub Adapter 6-in-1 Review: A Speedy Modern Hub for Modern Work
  • Lenovoโ€™s Qira is a Bet on Ambient, Cross-device AIโ€”and on a New Kind of Operating System on “The Future of AI Isnโ€™t What You Think” from Foxit Featuring a Daniel W. Rasmus Interview

Footer

Sitemap

  • Blogs
  • Book Daniel W. Rasmus
  • About Daniel W. Rasmus
  • Serious Insights LLC Disclaimer
  • Privacy Policy

Archives

Tag Cloud

ABC Apple AR artificial intelligence Big Data Buffy the Vampire Slayer BusinessWeek Cengage CIO Magazine CIOs Cisco context coronavirus Customer Service Dell Disney Disneyland earbud review Enterprise 2.0 facebook Fast Company Feedback loops Harvard Business Review HBR HP IBM Innovation Instagram iPhone case JBL Kindle Knowledge Management life-long learning Logitech Management By Design Microsoft mission statement Netflix New Scientist Nokia scenario planning Star Trek Stephen Elop Thought Leadership VR

Copyright 2009-2026 Serious Insights LLC | Log in

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

%d
    Powered by  GDPR Cookie Compliance
    Privacy Overview

    This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

    Strictly Necessary Cookies

    Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.