Schema Markup

RDF and Schema Markup: The Power of Relationships in the Age of Intelligent Systems

Reading Time: 9 minutes

As AI-driven search and machine learning continue to evolve, modern systems are better than ever at processing and interpreting data. This evolution raises an important question: how can we ensure data is meaningful and understandable for machines?

As machines become more contextually aware, one thing is certain—articulating relationships between entities is more vital than ever.

Schema Markup, aka structured data, powered by the Resource Description Framework (RDF), is the foundation of this capability. It enables us to move beyond isolated data points to create interconnected Knowledge Graphs, where entities and their relationships are expressed with semantic richness.

These graphs are not only essential for search visibility but also critical for empowering AI systems to infer, reason, and deliver accurate, contextual insights.

In this blog, we’ll explore why articulating property relationships through Schema Markup and RDF is a cornerstone of effective SEO and AI readiness.

What Is RDF?

The Resource Description Framework (RDF) is a standardized method for expressing data as a directed graph using subject-predicate-object statements, commonly referred to as “triples.”

RDF Triples: The Building Blocks of Knowledge Graphs

The foundational unit of a Knowledge Graph is the triple. Each triple consists of two nodes (representing entities) connected by a single edge (representing a relationship). This structure, articulated as “subject-predicate-object” statements, illustrates how one entity (subject) connects to another entity or value (object) through a specific property (predicate).

RDF example using "Mark van Berkel" as the subject, connecting using "founded" as the predicate, to "Schema App" which is the object.

When triples combine, they form an interconnected graph of resources, which serves as the foundation of a comprehensive Knowledge Graph.

Mark van Berkel's Knowledge Graph

To make these triples meaningful for machines, they must be expressed in a machine-readable format.

RDF Formats

When building or working with applications that consume or produce linked data, there are various formats that RDF triples can be expressed in, including:

However, if you’re doing RDF for your webpage, these are Schema.org formats that can be used:

At Schema App, we store all of our customers’ data in a Knowledge Graph Database as RDF data. However, when we export or publish data to websites, we transform it into JSON-LD for compatibility with search engines like Google. While JSON-LD (like many other formats) is machine-readable, this format makes it easier for marketers (who are not experts in RDF) to understand the RDF statement.

Let’s illustrate this with an example.

How Does Schema Markup Look as RDF?

Let’s look at a scenario of how the same Schema.org data is represented in JSON-LD, RDFa and RDF Turtle syntax.

Scenario: You have an eCommerce product detail page for wireless headphones. The brand of this headphone is TechSound, and you want to clearly indicate the brand of the product within your Schema Markup.

A table showing how Schema.org data would look if represented as JSON-LD, RDFa, and RDF Turtle Syntax.

Note: According to the World Wide Web Consortium (W3C), one of the main design goals of JSON‑LD is simplicity.

“No extra processors or software libraries are necessary to use JSON‑LD in its most basic form. The language provides developers with a very easy learning curve. Developers not concerned with Linked Data only need to understand JSON, and know to include but ignore the @context property, to use the basic functionality in JSON‑LD.”

World Wide Web Consortium. (2014). JSON‑LD 1.0.

JSON-LD specifies a number of syntax tokens and keywords that are a core part of the language which add additional semantic meaning and bring it closer to RDF, such as:

  • @context defines the vocabulary and meaning of terms
  • @id provides a unique identifier (URI) for an entity
  • @type specifies the entity’s classification within that vocabulary (e.g., schema:Product).

How RDFa Became a Web Standard

1. The Need for a Machine-Readable Web

In the early 2000s, the Semantic Web vision, championed by Tim Berners-Lee, aimed to make web content machine-readable. However, most web pages contained unstructured or semi-structured data, making it difficult for search engines and applications to extract meaningful relationships between entities.

At the time, RDF had already been established as a powerful way to represent linked data. But there was no easy way to embed RDF data directly into HTML documents without breaking web page layouts.

2. The Birth of RDFa (2004–2008)

To solve this, researchers at the W3C (World Wide Web Consortium) started developing a new attribute-based syntax to incorporate RDF directly into HTML.

The result was RDFa (RDF in Attributes), first introduced in 2004 as part of a W3C proposal. The main goals were:

  • Allow RDF triples to be embedded inside HTML documents.
  • Use HTML attributes (typeof, property, about, etc.) to structure RDF data.
  • Make it compatible with existing web standards (HTML, XHTML, XML).
  • Enable web pages to contribute to linked data and the Semantic Web.

RDFa was formally standardized by the W3C in 2008 with the release of RDFa 1.0, which worked with XHTML.

3. Evolution & Widespread Adoption (2010–2015)

In 2010, RDFa 1.1 was released, improving support for HTML5 and simplifying its syntax. Around the same time in 2011, Schema.org was introduced by Google, Bing, and Yahoo! to promote structured data for search engines.

RDFa became one of the three officially supported formats for Schema.org markup, alongside Microdata and JSON-LD.

The European Union and Governments started using RDFa to publish open data in a machine-readable way.

4. Google Prefers JSON-LD (2015–Present)

While RDFa remains a W3C standard, its use in SEO declined when Google announced its preference for JSON-LD in 2015. JSON-LD offered a simpler, script-based format that was easier for developers to implement compared to inline RDFa.

However, RDFa is still widely used in government, academia, and linked open data projects where RDF compatibility is essential.

Benefits of Articulating Property Relationships Using Schema Markup and RDF

In essence, Schema Markup acts as a structured data layer that explicitly defines how entities relate to one another on your website. By having this schema data stored in a graph database, it provides a rich semantic data layer that is crucial for building web aware AI systems, search engines, large language models (LLMs), and internal chatbots, allowing them to interpret your content with greater accuracy.

By leveraging Schema Markup to define these relationships, you create a semantic foundation that enhances how intelligent systems process, connect, and retrieve your data, ensuring your content is accurately represented in an increasingly AI-driven landscape.

Schema Markup transformed into RDF triples are central to the power of Knowledge Graphs, enabling superior information modeling through their unique features and capabilities.

Here’s why RDF frameworks make Knowledge Graphs exceptional for modeling complex data relationships:

1. Flexible and Extensible Schema

RDF properties allow data models to evolve organically, as relationships can be added without altering the existing structure. Unlike relational databases, where schema changes often require significant restructuring, RDF properties enable dynamic schema evolution.

For example, a Knowledge Graph for an eCommerce site might start with schema:Product and later add relationships like schema:brand or schema:offers without requiring a schema redesign.

2. Semantic Richness

RDF properties inherently carry meaning (e.g., rdfs:subClassOf, owl:sameAs), allowing relationships to express complex semantics. These semantics enable advanced reasoning and inference, which relational databases lack.

For instance, using schema:knows to represent relationships between people in a social graph inherently expresses the concept of “knowing someone.”

Graphic example of a knowledge graph showing Person 1 knows Person 2, Person 1 likes Hiking, and Person 1 likes National Geographic.

3. Semantic Interoperability

RDF properties use standardized vocabularies (e.g., Schema.org, FOAF, SKOS) that ensure compatibility across systems and datasets. This ensures the data remains universally understood, making it easy to integrate data from diverse sources while preserving the semantic integrity of relationships.

By using Schema.org and RDF, you provide a shared, machine-readable structure that eliminates ambiguity and enables seamless integration, automation, and AI-driven insights.

For example, a Knowledge Graph using schema:Person and schema:Event can easily integrate with other systems that recognize these vocabularies, such as Google or Bing, ensuring widespread compatibility.

4. Contextual Relationships

RDF properties support reification. Reification is the act of representing something abstract into something real and concrete. RDF properties support reification by allowing you to add metadata to relationships (e.g., provenance, confidence, timestamps). This is critical for understanding the “context” of data, which relational databases generally struggle to represent.

For example, schema:Review can be connected to a schema:Product with metadata such as schema:author (who wrote the review) and schema:datePublished (when it was written).

A graphic example of how JSON-LD calling out Review, Product, author, datePublished, and other related markup would be depicted as a Knowledge Graph highlighting these contextual relationships.

5. Graph Traversal

Relationships defined by RDF properties enable graph traversal, making it easier to explore interconnected data. Query languages like SPARQL take advantage of this by allowing path queries that are infeasible or complex in SQL.

For example, starting with a schema:Person who is a schema:attendee at a schema:Event, you can traverse to find related schema:Organization that sponsored the event or schema:Place where it was held.

Graphic example of a knowledge graph showing Person is an attendee at an Event, that is sponsored by Organization and located at Place.

6. Inference and Reasoning

RDF properties can enable inferencing through ontologies, deriving new relationships from existing data (e.g., if A isPartOf B and B isPartOf C, then A isPartOf C). This automated reasoning enhances data discovery and insights.

For instance, if schema:hasPart is used to define that a schema:Book includes chapters as schema:CreativeWork, the system can infer that every chapter is a schema:CreativeWork. Reasoning could also infer relationships like schema:subOrganization between a parent schema:Corporation and its subsidiaries.

7. Linking Open Data

RDF properties are the backbone of Linked Data, enabling the connection of Knowledge Graphs across domains (e.g., linking your data to Wikidata or DBpedia). This supports richer knowledge integration and the reuse of existing datasets.

For example, a Knowledge Graph that uses schema:Person can link to Google Knowledge Graph or Wikidata entities with schema:sameAs to provide richer profiles and additional context about the person.

JSON-LD stating that Person, Ryan Gosling, mentioned on IMBD, is the sameAs the Wikidata entry for Ryan Gosling.

8. Hierarchical and Cross-Domain Relationships

RDF properties like rdfs:subPropertyOf and owl:equivalentProperty allow for hierarchical and cross-domain mappings. These relationships make it easier to unify data under a shared ontology or map similar concepts across different vocabularies.

For example, rdfs:subClassOf allows you to define that schema:NewsArticle is a subclass of schema:Article, automatically inheriting all its properties like schema:author and schema:headline.

9. Machine-Readable and Human-Understandable

RDF properties provide clear, descriptive identifiers, making data comprehensible for both machines and humans. This dual utility improves usability and reduces ambiguity compared to cryptic database column names.

For example, schema:Event is clear and descriptive for both developers and systems, compared to a relational database column like evt_type or e_type.

10. Global Identifiers

RDF properties use Unique Resource Identifiers (URIs), ensuring globally unique identifiers for entities and their relationships. This prevents naming conflicts and ensures unambiguous references, unlike column names in relational databases that may overlap in different contexts.

It’s clear that these benefits highlight how RDF properties empower Knowledge Graphs to surpass relational databases in handling interconnected, evolving, and semantically rich data.

Adding Semantics to SQL Improves AI

A benchmark study conducted by data.world compared the performance of SQL tables and RDF SPARQL databases using the same dataset and concepts. The findings highlighted significant advantages of semantic modeling, particularly as queries increased in complexity:

  • For simple table lookups, SQL and SPARQL performed similarly.
  • When queries involved cross-table joins, SPARQL significantly outperformed SQL.

This performance disparity underscores the importance of how relationships between concepts are defined.

RDF properties allow data to be interlinked and enriched with semantic meaning, providing a richer framework for AI and search applications. These properties enable advanced inference and reasoning capabilities, which are essential for Large Language Models (LLMs) to interpret and connect disparate data points accurately.

By contrast, SQL’s reliance on rigid foreign key structures often leads to fragility in cross-concept queries. This limitation can hinder AI systems that depend on nuanced understanding and interconnected data for optimal performance. As semantic machine understanding becomes critical in modern AI and search engine algorithms, adopting Knowledge Graphs offers organizations a pathway to future-proof their data strategies.

Organizations leveraging these technologies position themselves to achieve better search visibility, deeper insights and inferences of their data, and a competitive edge in the era of intelligent systems.

Future-Proofing Your Data Strategy with RDF and Schema Markup

As search engines and AI systems advance, the ability to communicate the relationships between entities in a machine-readable way will define which organizations thrive in the era of intelligent systems.

RDF and Schema Markup provide a way for humans to articulate these relationships, enabling the creation of rich Knowledge Graphs that power smarter AI interactions and more relevant search experiences.

By embracing these technologies, organizations gain more than just improved search visibility—they unlock the potential for advanced reasoning, interoperability, and data integration that drives competitive advantage.

From creating flexible and semantically rich data models to enabling machine-readable clarity, Schema Markup and RDF empower businesses to control their content’s narrative in the age of AI.

For those questioning the relevance of structured data, the evidence is clear: investing in Schema Markup isn’t just a technical decision; it’s a strategic imperative.

Let Schema App help you harness the power of structured data to future-proof your digital presence and unlock the full potential of interconnected, intelligent systems. Get in touch with us today to get started.

Mark van Berkel, Schema App
Mark van Berkel

Mark van Berkel is the Chief Technology Officer and Co-founder of Schema App. A veteran in semantic technologies, Mark has a Master of Engineering – Industrial Information Engineering from the University of Toronto, where he helped build a semantic technology application for SAP Research Labs. Today, he dedicates his time to developing products and solutions that allow enterprise teams to leverage Schema Markup to boost their AI strategy and drive results.

Menu