November 11, 2025
November 11, 2025

Industry 4.0 systems require seamless communication between different stakeholders including embedded systems engineers, application developers, and system managers. This guide explains how to create semantic digital twin models using Node-RED, enabling knowledge sharing and automated reasoning through standardized semantic modeling and knowledge graphs.
This tutorial is designed for engineers, researchers, and architects developing digital twin solutions that require cross-domain knowledge integration and system interoperability.
The term "digital twin" refers to different concepts depending on automation levels. Three distinct categories exist:
A digital model represents a physical asset with manual data flow in both directions:
Physical to Digital Flow: Updates from physical assets are manually transferred to digital representationsDigital to Physical Flow: Changes in digital models are manually implemented in physical systems
Common Applications: CAD models, design-phase simulation models, engineering specifications
A digital shadow provides automatic updates from physical assets but requires manual feedback:
Physical to Digital Flow: Automatic data collection through sensors and monitoring systemsDigital to Physical Flow: Manual implementation of adjustments based on digital insights
Common Applications: Real-time monitoring dashboards, predictive maintenance alert systems
A digital twin features bidirectional automatic data flow between physical and digital representations:
Physical to Digital Flow: Automatic sensor data collection and system monitoringDigital to Physical Flow: Automatic control commands and system adjustments
Common Applications: Closed-loop control systems, autonomous process optimization
True digital twins represent the highest integration level where physical and digital systems synchronize continuously without manual intervention.
Conventional model-based design follows this sequence:
In this approach, models serve primarily as development artifacts that become disconnected from operational systems after deployment.
Digital twin methodology extends modeling throughout the complete product lifecycle:
The fundamental difference: models remain active throughout the system lifecycle, evolving with physical systems and enabling continuous improvement.
Research literature provides multiple digital twin definitions. This framework synthesizes key concepts:
A digital twin is a combination of models that represent an observable asset from the real world.
Models: Multiple representations providing different perspectivesObservable Assets: Physical entities (equipment, devices) or non-physical elements (processes, workflows)Multiple Perspectives: Behavioral models, 3D geometry, state machines, validation testsContinuous Updates: Real-time tracking of asset properties and statesBidirectional Interaction: Properties that change, actions that execute, and events that signal transitions
Consider a wind turbine as the physical asset. The digital twin includes:
3D Geometry Model: Physical structure, dimensions, and spatial positioningAutomationML Model: Equipment hierarchy and system connectionsState Machine Model: Operating modes including starting, running, stopping, and maintenance statesBehavioral Model: Power generation curves and efficiency characteristicsValidation Tests: Software component testing and verification
These combined models form the complete cyber representation of the physical wind turbine, serving different stakeholders and supporting various use cases.
This semantic modeling approach aligns with W3C (World Wide Web Consortium) Web of Things standards, which define a "Thing" with three core elements:
Properties: Readable and writable attributes such as temperature, speed, and operational statusActions: Invocable operations including start, stop, calibrate, and reset functionsEvents: Notifications about state changes like alarms triggered or cycles completed
Alignment with established W3C standards ensures interoperability with existing platforms and avoids proprietary implementations that limit system integration.
This methodology provides a systematic approach to semantic digital twin creation:
Decompose your system into discrete, observable components with these characteristics:
Production Line Example:
Develop models for each identified asset, defining three core elements:
Establish relationships between assets using semantic descriptions:
Semantic descriptions enable automated reasoning beyond simple data exchange.
Connect additional representation formats:
Establish physical asset connectivity:
Deploy the configured digital twin to operational infrastructure, making it available for use.
Semantic relationships create a knowledge graph—interconnected facts stored as entities and relationships. Knowledge graphs enable:
Inference Capabilities: Deriving new facts from existing relationship dataDiscovery Functions: Finding related information across different domainsIntegration Benefits: Connecting previously siloed stakeholder knowledge
Develop applications leveraging the knowledge graph infrastructure:
Knowledge graphs provide capabilities that complement machine learning approaches.
Knowledge graphs represent facts as interconnected entities and relationships:
Entity Examples: BMW, i-Next Electric Car, Tesla, Model S, Automotive IndustryRelationship Examples:
Knowledge graphs derive implicit knowledge from explicit facts:
Given Facts: BMW is_member_of Automotive Industry, Tesla produces Model S, Model S is_type Electric CarInferred Fact: Tesla is_member_of Automotive Industry
This reasoning capability enables systems to understand unstated relationships, connecting distributed stakeholder knowledge.
Embedded systems engineers possess hardware knowledge. Application developers understand software requirements. Knowledge graphs formally capture and relate this distributed expertise, enabling:
This research provides domain-independent semantic modeling language for representing knowledge across diverse areas:
Manufacturing Domain: Equipment hierarchies, production workflows, quality parametersInformation Technology Domain: Network architecture, software components, data flowsBusiness Operations Domain: Product catalogs, customer requirements, financial constraints
Generic semantic primitives (entities, relationships, properties) enable consistent modeling across all domains, facilitating previously difficult integration scenarios.
Node-RED offers multiple benefits for digital twin implementation:
Visual Programming Interface: Flow-based development accessible to non-programmersExtensive Node Library: Thousands of nodes supporting industrial protocols, databases, and cloud platformsLightweight Architecture: Runs on edge devices, gateways, and cloud infrastructureJavaScript Foundation: Extensible through custom node developmentActive Community: Large industrial IoT user base and contributor network
The Node-RED implementation includes:
Asset Nodes: Represent observable assets with properties, actions, and eventsSemantic Relationship Nodes: Define connections between assetsCommunication Protocol Nodes: Connect to physical systems using OPC UA, MQTT, and RESTKnowledge Graph Nodes: Generate and query semantic representationsStorage Nodes: Persist knowledge graphs to database systems
Digital twin creation workflow in Node-RED:
The visual flow representation displays complete system structure and data flows, making complex relationships understandable.
Full W3C WoT Thing Description specification compliance ensures:
OPC UA integration provides:
NGSI-LD (Next Generation Service Interfaces - Linked Data) support enables:
Planned GAIA-X (European data infrastructure initiative) integration will provide:
GAIA-X enables safe, trusted data sharing across organizational boundaries—essential for industrial digital twin ecosystems spanning multiple companies.
Asset Type: Industrial Temperature SensorProperty Definitions:
Action Definitions:
Event Definitions:
factory/area1/sensor1/temperatureQuery: "Identify all sensors in ProductionLine1 currently reporting fault status"
Knowledge graph traversal process:
This query demonstrates automated reasoning across relationships not explicitly programmed—the core advantage of semantic modeling.
Semantic models based on international standards enable different systems to exchange information without custom integration development.
Formalized semantic knowledge remains accessible despite personnel changes. Semantic models document not only data structures but also meaning and relationships.
Knowledge graphs enable automated relationship discovery, constraint validation, and optimization opportunity identification beyond manual analysis.
Different stakeholders view the same semantic model from domain-specific perspectives, ensuring consistent information while using appropriate tools.
Standards-based implementations adapt more easily to new technologies and requirements compared to proprietary, tightly-coupled systems.
Semantic digital twin models using Node-RED provide a research-driven methodology for connecting diverse stakeholders, systems, and knowledge domains in Industry 4.0 environments. By extending traditional digital twin concepts with semantic modeling and knowledge graphs, this approach enables automated reasoning, cross-domain integration, and standards-based interoperability.
The Node-RED implementation makes semantic modeling accessible through visual programming while maintaining alignment with W3C Web of Things, OPC UA, NGSI-LD, and GAIA-X standards. The property-action-event framework provides consistent asset representation regardless of application domain.
While currently in active research and development phase, the project welcomes community participation to guide its evolution. By formalizing industrial knowledge as semantic models and knowledge graphs, organizations can preserve expertise, enable automated optimization, and bridge operational technology and information technology more effectively than traditional point-to-point integration approaches.