Categories
Content Engineering

Content models: lessons from LEGO

Content models can be either a friend or foe.  A content model can empower content to become a modular and flexible resource when developed appropriately. Models can liberate how you develop content and accelerate your momentum.

However, if the model isn’t developed correctly, it can become a barrier to gaining control over your content.  The model ends up being hard to understand, the cause of delay, and eventually an albatross preventing you from pivoting later.  

Models are supposed to be the solution, not the problem

Content modeling can be challenging. Those involved with content modeling have likely heard stories about teams wrestling with their content model because it was too complex and difficult to implement. 

Some common concerns about content modeling include:

  • There’s not enough time to develop a proper content model.
  • We don’t know all our requirements yet.
  • People don’t understand why we need a content model or how to develop one.
  • Most of the content model doesn’t seem relevant to an individual writer.
  • Someone else should figure this out.

These concerns reflect an expectation that a content model is “one big thing” that needs to be sorted out all at once in the correct way, what might be called the monolithic school of content modeling. 

Rather than treat content models as monolithic plans, it is more helpful to think of them as behaving like LEGO. They should support the configuration of content in multiple ways.

Yet, many content models are designed to be monolithic. They impose a rigid structure on authors and prevent organizations from addressing a range of needs.  They become the source of stress because how they are designed is brittle.

In an earlier post, I briefly explored how LEGO’s design supports modularity through what’s called “clutch power.” LEGO can teach us insights about bringing modularity to content models. Contrary to what some believe, content models don’t automatically make content modular, especially when they are missing clutch power. But it’s true that content models can enable modularity. The value of a content model depends on its implementation. 

A complex model won’t simplify content delivery.  Some folks mistakenly think that the content model can encapsulate complexity that can then be hidden from authors, freeing them from the burdens of details and effort. That’s true only to a point.  When the model gets too complex for authors to understand and when it can’t easily be changed to address new needs, its ability to manage details deteriorates. The model imposes its will on authors rather than responding to the desire of authors.

The trick is to make model-making a modular process instead of a top-down, “here’s the spec, take it or leave it” approach. 

Don’t pre-configure your content

LEGO pioneered the toy category of multipurpose bricks. But over time, they have promoted the sale of numerous single-purpose kits, such as one for a typewriter that will “bring a touch of nostalgia to your home office.”  For $249.99, buyers get the gratification of knowing exactly what they will assemble before opening the package.  But they never experience the freedom of creating their own construction.

LEGO typewriter, 2079 pieces (screenshot)

The single-purpose kits contain numerous single-purpose bricks. The kit allows you to build one thing only. Certain pieces, such as the typewriter keys and a black and red ink spool ribbon, aren’t useful for other applications. When the meme gets stale, the bricks are no longer useful.

One of the most persistent mistakes in design is building for today’s problems with no forethought as to how your solution will accommodate tomorrow’s needs. 

Many content models are designed to be single-purpose.  They reflect a design frozen in time when the model was conceived – often by an outside party such as a vendor or committee. Authors can’t change what they are permitted to create, so the model will often be vague and unhelpful to not overly constrain the author. They miss out on the power of true modularity. 

When you keep your model fluid, you can test and learn.

Make sure you can take apart your model (and change it)

Bent Flyvbjer and Dan Gardner recently published a book on the success or failure of large projects called How Big Things Get Done.  They contrast viewing projects as “one big thing” versus “many small things.”

Flyvbjer and Gardner cite LEGO as a model for how to approach large projects. They say: “Get a small thing, a basic building block. Combine it with another and another until you have what you need.”

The textbook example they cite of designing “one big thing” all at once is a nuclear power plant.  

Flyvbjer and Gardner comment: “You can’t build a nuclear power plant quickly, run it for a while, see what works and what doesn’t, then change the design to incorporate the lessons learned. It’s too expensive and dangerous.”

Building a content model can seem like designing a nuclear power plant: a big project with lots of details that can go wrong. Like designing a nuclear power plant, building a content model is not something most of us do very often.  Most large content models are developed in response to a major IT re-platforming.  Since re-platformings are infrequent, most folks don’t have experience building content models, and when a re-platform is scheduled, they are unprepared. 

Flyvbjer and Gardner note that “Lacking experimentation and experience, what you learn as you proceed is that the project is more difficult and costly than you expected.” They note an unwanted nuclear power plant costs billions to decommission.  Similarly, a content model developed all at once in a short period will have many weaknesses and limitations. Some content types will be too simple, others will be too complex.  Some will be unnecessary ultimately, while others will be overlooked.  In the rush to deliver a project, it can be difficult to even reflect on how various decisions impact the overall project.

You will make mistakes with your initial content model and will learn what best supports authoring or frontend interaction needs, enables agile management of core business messages and data, or connects to other IT systems.  And these requirements will change too. 

Make sure you can change your mind.  Don’t let your IT team or vendor tell you the model is set in stone.  While gratuitous changes should be avoided – they generate superfluous work for everyone – legitimate revisions to a content model should be expected.  Technical teams need to develop processes that can address the need to add or remove elements in content types, create new types, and split or merge types. 

Allow authors to construct a range of outputs from basic pieces

In LEGO, the color of bricks gives them expressive potential. Even bricks of the same size and shape can be combined to render an experience.  I recently visited an exhibit of LEGO creations at the National Building Museum in Washington, DC.  

Mona Lisa in LEGO by Warren Elsmore at National Building Museum

Much online content is developed iteratively.  Writers pull together some existing text, modify or update it, add some images, and publish it.  Publishing is often a mixture of combining something old with something new.  A good content model will facilitate that process of composition. It will allow authors to retrieve parts they need, develop parts they don’t yet have, and combine them into a content item that people read, watch, or listen to.

Content modeling is fundamentally about developing an editorial schema.  Elements in content types represent the points authors want to make to audiences. The content types represent larger themes. 

A content model developed without the input of authors isn’t going to be successful. Ideally, authors will directly participate in the content modeling process. Defining a content type does not require any software expertise, and at least one CMS has a UI that allows non-technical users to create content types. However, it remains true that modeling is not easy to understand quickly for those new to the activity. I’m encouraged by a few vendor initiatives that incorporate AI and visualization to generate a model that could be edited. But vendors need to do more work to empower end-users so they can take more control over the content model they must rely on to create content. 

Keep logic out of your model 

LEGO is most brilliant when users supply the creativity rather than rely on instructions from the LEGO corporation.  

The DIY ethos of LEGO is celebrated on the Rebrickable website, where LEGO enthusiasts swap concepts for LEGO creations.

Rebrickable website (screenshot)

Rebrickable illustrates the principle of decoupling content from its assembly. LEGO bricks have an independent existence from assembly instructions. Those who develop plans for assembling LEGO are not the same people who created the bricks.  Bricks can be reused to be assembled in many ways – including ways not foreseen. Rebrickable celebrates the flexibility of LEGO bricks.

A content model should not contain logic. Logic acts like glue that binds pieces together instead of allowing them to be configured in different ways. Decouple your content from any logic used to deliver that content. Logic in content models gets in the way: it complicates the model, making it difficult for authors (and even developers) to understand, and it makes the model less flexible.  

Many Web CMS and XML content models mix together the structure of types with templating or assembly logic. When the model includes assembly instructions, it has already predetermined how modules are supposed to fit together, therefore precluding other ways they might connect. Content models for headless CMS implementations, in contrast, define content structure in terms of fields that can be accessed by APIs that can assemble content in any manner.  

A brittle content model that can’t be modified is also expensive.  Many publishers are saddled with legacy content models that are difficult to change or migrate. They hate what they have but are afraid to decommission it because they are unsure how it was put together. They are unable to migrate off of a solution someone built for them long ago that doesn’t meet their needs.  This phenomenon is referred to as “lock-in.”

A flexible content model will focus only on the content, not how to assemble the content. It won’t embed “views” or navigation or other details that dictate how the content must be delivered. When the model is focused solely on the content, the content is truly modular and portable.  

Focus on basics and learn

LEGO encourages play.  People don’t worry about assembling LEGO bricks the wrong way because there are many valid ways to connect them. There’s always the possibility of changing your mind.

Each LEGO brick is simple.  But by trying combinations over time, the range of possibilities grows.  As Flyvbjer and Gardner say in their book, “Repetition is the genius of modularity; it enables experimentation.”

Start your model with something basic. An author biography content type would be a good candidate. You can use it anytime you need to provide a short profile of an author. It seems simple enough, too.  A name, a photo, and a paragraph description might be all you need.  Congratulations, you have created a reusable content type and are on the path toward content modularity.

Over time, you realize there are other nuances to consider. Your website also features presenters at live events.  Is a presenter biography different from an author biography?  Someone in IT suggests that the author bio can be prepopulated with information from the employee directory, which includes the employee’s job title. The HR department wants to run profiles of employees on their hiring blog and wonders if the employee profile would be like the author bio.  

As new requirements and requests emerge, you start to consider the variations and overlaps among them. You might try to consolidate variations into a single content type, extend a core content type to handle specialized needs or decide that consolidating everything into a single content type wouldn’t simplify things at all. Only through experimentation and learning will the correct choice become apparent.

It’s a good idea to document your decisions and share what you’ve learned, including with outside teams who might be curious about what you are working on – encourage them to steal your ideas.  You can revisit your rationale when you are faced with new requirements to evaluate.

Evolve your model

Embrace the LEGO mindset of starting small and thinking big.

The more experience your team gains with content modeling, the more comprehensive and capable your content model will become.

Flyvbjer and Gardner note: “Repetition also generates experience, making your performance better. This is called ‘positive learning.’”  They contrast it with negative learning, where “the more you learn, the more difficult and costly it gets.” When the model starts off complex, it gets too big to understand and manage. Teams may realize only one person ever understood how the model was put together, and that person moved to another company. 

Content modeling is about harnessing repeating patterns in content. The job of content modeling is to discern these patterns.

Content modeling should be a continuous activity. A content model isn’t a punch-list task that, once launched, is done. The model isn’t frozen. 

The parts that work right will be second nature, while the parts that are still rough will suggest refinement.

While content modeling expertise is still far from mainstream, there are growing opportunities to gain it. Teams don’t have to wait for a big re-platforming project.  Instead, they should start modeling with smaller projects. 

New low-cost and/or low-code tools are making it easier to adopt modular approaches to content. Options include static site generators (SSGs) like Jekyll or Hugo, open-source headless CMSs like Strapi, and no-code web-builders like Webflow. Don’t worry if these tools don’t match your eventual needs.  If you build a model that supports true modularity, it will be easy to migrate your content to a more sophisticated tool later. 

With smaller projects, it will be more evident that the content model can change as you learn new things and want to improve it. Like other dimensions of software in the SaaS era, content models can continuously be released with improvements.  Teams can enhance existing content types and add new ones gradually. 

The evolution of content models will also include fixing issues and improving performance, similar to the refactoring process in software.  You may find that some elements aren’t much used and can be removed. You can simplify or enrich your model.  Sometimes you’ll find similar types that can be folded into a single type – simplification through abstraction.  Other times, you’ll want to extend types to accommodate details that weren’t initially identified as being important. 

Continual iteration may seem messy and inefficient, and in some ways, it is. Those who approach a content model as one big thing wish the model to be born fully mature, timeless, and resistant to aging. In practice, content models require nurture and maintenance. Changing details in a large content model will seem less scary once you’ve had experience making changes on a smaller model.  And by working on them continuously, organizations learn how to make them better serve their needs. 

Regard changes to content models as a learning experience, not a fearful one.

Allow your model to scale up

Flyvbjer and Gardner note: “A block of Lego is a small thing, but by assembling more than nine thousand of them, you can build one of the biggest sets Lego makes, a scale model of the Colosseum in Rome. That’s modularity.”

My visit to the National Building Museum revealed how big a scale small LEGO bricks can build, as shown in this model of London’s St Pancras rail station. 

St Pancras rail station in LEGO by Warren Elsmore at National Building Museum

The magic of LEGO is that all bricks can connect to one another. The same isn’t necessarily true of content models. Many models reflect the idiosyncrasies of specific CMS platforms.  Each model becomes an isolated island that can’t be connected to easily.  That’s why content silos are a pervasive problem.

However, a modular content model focused on content and not assembly logic can easily connect to other modular models. 

A content model can enable content to scale.  But how does one scale the content model?

The good news is that the same process for developing a small model applies to developing a larger one.  When you embrace an iterative, learning-driven approach to modeling, scaling your model is much easier.  You understand how models work: which decisions deliver benefits and which ones can cause problems. You understand tradeoffs and can estimate the effort involved with alternatives.

One key to scaling a model is to play well with other models. In large organizations, different teams may be developing content models to support their respective web properties.  If these models are modular, they can connect.  Teams can share content.

It’s likely when there are two models, there will be overlaps in content types. Each model will have a content type defining a blog post, for example. Such situations offer an opportunity to rationalize the content type and standardize it across the teams. Eventually, separate teams may use a common content model supporting a single CMS. But until then, they can at least be using the same content type specifications.  They can learn from each other.

– Michael Andrews

Categories
Content Engineering

All models reflect a point of view

We sometimes talk about having a “bird’s eye” perspective of a place or about a topic.  We can see how details connect to form a larger whole.  The distance highlights the patterns that may be hard to see up close.  In many ways, a content model is a bird’s eye view of content addressing topics or activities.  It’s a map of our content landscape. 

This year, many of us — having been shut indoors involuntarily and had our travel limited —  have marveled at the freedom of travel that birds enjoy.  Bird watching has become a popular pastime during the pandemic, encouraging people to consult content about birds to understand what they are newly noticing.

Content about birds provides an excellent view into the structure of content — even for people not interested in birds. Nearly 20 years ago JoAnne Hackos discussed what field guides to birds can teach us about structuring content in her book, Content Management for Dynamic Web Delivery.  Information about birds offers a rich topic to explore content structure.

 Last year, I conducted a workshop exploring content structuring decisions by comparing how field guides describe bird species.  I have a collection of field guides about Indian birds from my time living in India.  They all broadly cover the same material, but the precise coverage of each varies.  Some will talk about habitats, others will discuss diet.  They will get into different levels of detail.

Various guides to Indian birds
Various guides to Indian birds

As someone interested in birds, I noticed how these field guides differed when discussing the same bird species.  Even the images of a bird varied greatly: whether they are paintings or photos, if they are in context or not, and whether distinguishing features are described in the text or as annotations.  Image choices have profound implications for what information is conveyed.

All this variation reveals something that’s obvious when you see it: there’s no one way to describe a bird.  People make editorial choices when they structure content.  That’s a very different way of thinking about content structure than the notion of domain modeling, which assumes that things we describe have intrinsic characteristics that can be modeled objectively.  Domain modeling presumes there’s a platonic ideal that we can discover to describe things in our world. It can provide some conceptual scaffolding for identifying the relationships between concrete facts associated with a topic. Domain modeling is best approached from the viewpoint of different personas.  Why do they care about any of these facts?  And importantly, might they care about different facts?

Without doubt, it’s valuable to get outside of our subjective ways of seeing to consider other viewpoints.  But that doesn’t imply there’s a single viewpoint that is definitive or optimal.  Content modeling is not the same thing as data modeling, just as content isn’t simply data.  The limitation of domain modeling is that it doesn’t provide any means to distinguish more important facts from less important ones.  And it treats content as data, where facts are readily reduced to a few concrete objective statements rather than involving descriptive interpretations or analysis.  With a domain model, it’s not obvious why people care about any of the data.  Before we can build experiences from content, we need the basic content to be interesting and relevant. Relevance and interest have been overlooked in many discussions about content modeling.   

While I see richness in small editorial decisions among different field guides, a person not interested in birding may find these distinctions as unimportant.  To a casual viewer, all field guides look similar.  There’s a generic template that field guides seem to follow to describe birds.  Here, the editorial decisions have emerged over time as a common framework that’s widely accepted and expected.  Some people will view the task of structuring content as one of finding an existing framework that’s known, and copying it.  Instead of searching for the intrinsic properties of things that can be described, the search is finding patterns already used in the content.  Adopting vernacular structural patterns has much to commend it.  These patterns are familiar, and in many cases work fine.  But relying on habit can also cause us to miss opportunities to enrich how to describe things.  They may satisfy a basic need, but they don’t necessarily do so optimally.  

The most disruptive development to influence field guides is the smartphone app.  These apps shake up how they approach birds and can incorporate image and audio recognition to aid in identifying a species.   They can be marvelously clever in what they can do, but they too represent editorial decisions: a focus on a transactional task rather than a deeper look into context and comparison.  The more that content exists to support a repetitive transactional task, the more tightly prescribed the content will be in a model.  If you consider content as existing only to support narrow transactional needs, the structure of the model might seem obvious, because what else would someone care about?  Such a model presumes zero motivation on the part of the reader: they only want to view content that is necessary for completing a task.  

Field guides exist to aid the identification of birds.  Deciding what information is important — or is readily available — to aid the identification of birds still involves an editorial judgment.  

Topics don’t have intrinsic structures.  The structure depends in part on the tasks associated with the topic.  And the more that a topic requires the motivation of the reader — their interest, preferences, and choices outside of a narrow task supported within the UI — the more important the editorial dimensions of the content model become.    The model needs to express elements that tell readers why they would care about the topic: what they should learn or see as important and relevant.

Importantly, the discussion of a topic is not limited to a specific genre.  Species of birds can be discussed in ways other than field guides.  Within a genre,  you can change the structure associated with it.  But you can switch genres as well, and embrace a different structure entirely.

When we consider structures within genres, we may be inclined to confuse its form with its intent.  The importance of genre is that it provides a point of view to elucidate a topic.

Picture books provide an alternative genre to present content about birds.  They involve the tight juxtaposition of words and images, often to provide a richer narrative about a topic.  Beyond that, the structure involved is not standardized or well defined.  The genre is most often associated with children’s books and exemplified by outstanding writer-illustrators such as Maurice Sendak, Dr. Seuss, and Quintin Blake.

But picture books are not limited to children’s entertainment.  They can potentially be used for any sort of topic and any audience.

This spring a new picture book, What it’s like to be a Bird, was released that became an instant bestseller.  It was produced by David Sibley, a renowned ornithologist and illustrator.  “My original idea, in the early 2000s, was to produce a bird guide for kids. Then I started thinking about it as a bird guide for beginners of any age.  But having created a comprehensive North American bird guide, the concept of a ‘simplified’ guide never clicked for me. Instead, I wanted to make a broader introduction to birds.”

His goal is to “give readers some sense of what it’s like to be a bird…My growing sense as I worked on this book is that instinct must motivate a bird by feelings — of satisfaction, anxiety, pride, etc…how else do we explain the complex decision that birds make everyday…”  Sibley wants to capture the bird’s experience making decisions on its life journey.  A wonderful backdrop as we think about the reader’s experience on the journey through his book.

“Each essay focuses on one particular detail…they are meant to be read individually, not necessarily in sequence — everything is interconnected, and there are frequent cross-references suggesting which essay to read next.”    We can see how Sibley has planned a content model for his material.

Bolstered by his talents as a subject expert, writer, and illustrator, he’s been able to rethink how to present content about birds.  While he’s also written a conventional field guide to birds, his new book explores species through their behavior.  His is not the only recent book looking at bird behavior, but the perspective he offers is unique.  Rather than organizing content around behavior themes such as mating, he focuses the content on species of birds and then talks about two or three key behaviors they have that are interesting.  The birds act as protagonists in stories about their life situation.  

Sibley’s book profiles various species of birds — something field guides do as well. But Sibley’s profiles explain the bird from its own point of view, instead of from an external viewpoint of concrete properties such as feather markings or song calls.  The stories of these species incapsulate actions that happen over a period involving a motivation and outcome.  They aren’t data.  

He introduces species of birds by presenting two or three stories about each.  Each story is a short essay explaining an illustration of an activity the bird is engaged in.  Frequently, the illustration is puzzling, prompting the reader to want to understand it.  In some cases, he breaks the story into several small paragraphs, each with its own illustration, when he wants to describe a sequence of events over time.

A simplified content model will show how Sibley explains birds.  The diagram reveals a highly connected structure.  It doesn’t look like the hierarchical structure of a book. There’s no table of contents or index.  Though the content is manifested as a book, it could be delivered to alternative platforms and channels.  

A simplified content model for Sibley's What it's like to be a bird
A simplified content model for Sibley’s What it’s like to be a bird

On the far left of the diagram, we see themes about birdlife that are explored.  These themes may be broken into sub-themes.  For example, the theme of survival has two sub-themes, which has even more specific sub-themes:

  • Survival
    • Birds and weather
      • Keeping cool
      • Keeping warm
    • Avoiding predators
      • Be inconspicuous
      • Be alert
      • Create a distraction

Each theme or sub-theme presents a range of related factual statements.  For example, he presents a series of facts about how birds create distractions.  These facts represent some important highlights about a theme: an index of knowledge that offers a range of perspectives.  Each fact points to a profile of a bird specifies, where a story essay will provide context about the statement and make it more understandable.

In this example, we see how the theme of how birds use smell is revealed through a series of facts that point to essays about how different species use of smell.

Thematically grouped factual highlights about birdlife (source: Sibley's What it's like to be a Bird)
Thematically grouped factual highlights about birdlife (source: David Sibley’s What it’s like to be a Bird)

When we visit a profile of a bird species, we encounter several stories, which are a combination of picture and essay.  The illustration shows  a starling holding a cigarette in its beak.  The situation has the makings of a story — we want to know more.  The essay tells us.

Illustration and essay providing a story relating to a behavior of a bird species (source: Sibley's What it's like to be a Bird)
Illustration and essay providing a story relating to a behavior of a bird species (source: David Sibley’s What it’s like to be a Bird)

Even if the story is enjoyable, a part of us may wonder if it’s just an entertaining yarn. Picture books are most often associated with fantasy, after all.  And unlike a nature program on TV, we don’t see a museum expert in a talking head interview to make it seem more credible.  Instead, we get a list of recent scientific references relating to the issue.  All these references are arranged thematically, like our facts.  They provide an overview of the focus on recent scientific research about birds, giving us a sense of how much scientists are still learning about these ubiquitous creatures that are as old as dinosaurs. 

Source references of recent discoveries from scientific research relating to birds, arranged thematically (source: Sibley's What it's like to be a Bird)
Source references of recent discoveries from scientific research relating to birds, arranged thematically (source: David Sibley’s What it’s like to be a Bird)

The structuring of the content helps us to understand and explore.  The stories make each bird more real: something living we can become interested in.  We can understand their dilemmas and how they seek to solve them.  We are up-close.  But we can also step back and understand the broader behaviors of birds that influence their lives.  

While each species is described through stories, we are not limited to those.  How do other birds work with smell?  What have we learned recently about smell and birds?  These other pathways allow us to follow our interests and find different connections.  

Sibley’s model shows how to transcend the top-down hierarchies that force how to learn about a topic and the bottom-up collections of random facts that leave us with no structure to guide us.  Sibley’s model of content can be approached in different ways, but it is always deliberate.  There’s no feeling of being lost in hyperlinks.

Content models should reflect what readers want to get and how they might want to get it.  They are more than a technical specification.  They are an essential tool in editorial planning.  Developing a content model can be a creative act.

— Michael Andrews