Understanding the Web's Missing Structure: A Q&A on the Block Protocol and Semantic Web

Since the 1990s, the web has primarily been a place for human-readable documents. But as we'll explore in this Q&A, the lack of structure in standard web pages limits what computers can do with that information. Let's dive into the challenges and the vision for a more intelligent web.

What was the original web like?

The early web, beginning in the 1990s, was designed as a publishing platform for documents intended to be read by people. These documents were written in HTML, which gave a basic level of structure—like marking a paragraph or emphasizing a word. Then, CSS was added to make things look appealing, such as styling paragraphs with tiny gray text. While this worked for human readers, the structure remained superficial. A computer program could see that something was in a paragraph or bolded, but it couldn't understand the meaning behind the content. That limited the web to being just a collection of pretty pages, not a rich source of data that machines could process automatically.

Understanding the Web's Missing Structure: A Q&A on the Block Protocol and Semantic Web — Source: www.joelonsoftware.com

What is the core problem with HTML's limited structure?

HTML tells a browser how to display content, but it doesn't tell a computer what the content means. For instance, you can make text bold or turn it into a heading, but there's no way to say “this is a book title” or “this is an author's name.” Without that deeper meaning, automated programs—whether simple bots or advanced AI—struggle to extract and use the information reliably. The problem becomes clear when you want to do things like aggregate data from multiple pages or power intelligent assistants. The web is full of human-friendly formatting, but very little machine-friendly semantics.

Can you give an example of a typical book citation lacking structure?

Imagine you write a blog post mentioning Goodnight Moon by Margaret Wise Brown, illustrated by Clement Hurd, published by Harper & Brothers in 1947, with ISBN 0-06-443017-0. On a standard web page, you might just make the title bold. A computer reading that page sees only formatting: a string of text with a bold tag. It cannot automatically identify this as a book, or extract the author, illustrator, publisher, or ISBN—even though that information is right there. In contrast, if the page used semantic markup from schema.org, a program could say, “Aha! This is a Book entity with these properties.” Without such structure, the web remains largely blind to the data it contains.

What was Tim Berners-Lee's vision for the Semantic Web?

As early as 1999, Tim Berners-Lee dreamed of a web where computers could analyze all data—content, links, and transactions. In his book Weaving the Web, he described a “Semantic Web” that would make it possible for machines to talk to machines, handling trade, bureaucracy, and daily life through intelligent agents. The core idea was to add explicit meaning to web content using standards like RDF and JSON-LD, often based on shared vocabularies such as schema.org. That way, a computer could understand that a page mentions a book, a person, a recipe, or an event—not just display it. Berners-Lee believed this would unlock huge potential for automation and interoperability.

How would the Semantic Web add structure to a page?

To make a page machine-readable, you would start by looking up a suitable vocabulary, such as schema.org's definition for a Book. Then you would add extra markup to your HTML using formats like JSON-LD or RDFa. For example, you might embed a snippet of JSON-LD that explicitly says: “This is a Book with title X, author Y, ISBN Z.” When a computer reads the page, it sees not just a paragraph but a structured data object. This approach allows search engines, smart assistants, and other tools to extract precise information. However, implementing it correctly requires extra effort after writing the human-readable content, which has been a major barrier to widespread adoption.

Why has adoption of semantic markup been so slow?

Despite the promise of the Semantic Web, adoption remains minimal. The main reason is the effort required: once you've written a beautiful blog post or article for people, adding semantic markup feels like extra homework. It's time-consuming to learn the vocabularies and embed the correct code. Also, without immediate payoff—like a search engine that rewards rich snippets—many authors give up. As a result, even decades after Berners-Lee's dream, very few pages include semantic annotations. The web remains mostly flat data from a computer's perspective, and the envisioned “intelligent agents” have not materialized because there's so little structured information available for them to use.

Why is fixing this important for human progress?

Adding semantic structure to web pages would dramatically improve how information is shared and processed. Human progress depends on making data accessible not only to people but also to AI systems and traditional programs. For instance, a structured web would allow smart assistants to answer complex questions, researchers to aggregate data automatically, and businesses to streamline operations. Without this foundation, we limit the potential of machine learning and automation. Making semantic markup easy and rewarding is a crucial step toward a web where both humans and machines can truly understand and use the vast amount of information available.

What would encourage people to add semantic markup?

In my view, people will only add semantic markup to their web pages if the process becomes as simple as writing in WordPress or using a simple plugin—essentially, if it's a natural part of their workflow instead of an extra chore. The Block Protocol aims to solve exactly this: by standardizing how content blocks carry structured data, authors can enrich their pages without learning complex formats. When semantic richness is built into the tools we already use, adoption can skyrocket. That's the path to finally realizing Berners-Lee's vision: a web that is both human-friendly and machine-readable, unlocking new levels of automation and discovery.