November 2, 2025
November 2, 2025
Scaling industrial AI across multiple manufacturing facilities presents unique challenges that traditional approaches struggle to address. While many organizations successfully implement AI pilots at individual sites, they often encounter significant obstacles when attempting to deploy these solutions across their global operations.
This episode explores federated learning—an approach that enables manufacturers to train AI models across multiple facilities while keeping sensitive data localized. By understanding how federated learning works and where it applies, data and analytics leaders can make informed decisions about scaling AI initiatives across their organizations.
According to Michael Kuhne Schlinkertr, CEO of Katulu, federated learning addresses two fundamental problems that prevent AI scaling in manufacturing: limited access to data and cost-efficient deployment across distributed facilities. Companies like Siemens have already implemented federated learning in production environments, demonstrating its practical viability for industrial applications.
Organizations with multiple manufacturing facilities typically pursue one of two approaches when scaling AI initiatives: site-specific implementations or centralized cloud-based systems. Each approach presents distinct challenges that impact both costs and effectiveness.
In this model, each factory develops and maintains its own AI solutions. While this approach allows customization for local processes, it creates several inefficiencies:
Some organizations attempt to address these issues by centralizing AI development in the cloud. This approach provides access to data from multiple sites but introduces different challenges:
Both approaches struggle to balance the benefits of data aggregation with the practical realities of distributed manufacturing operations.
Federated learning is a machine learning approach that enables multiple facilities to collaboratively train AI models without sharing raw data. Instead of centralizing data or developing isolated models, facilities exchange learned parameters while keeping actual production data local.
The federated learning process follows these steps:
1. Initial Model Distribution: A base model is developed (typically in a lab or pilot facility) and deployed to all participating factories. This initial model serves as the common starting point for collaborative learning.
2. Local Training: Each facility trains the model using its local production data. The data never leaves the facility's systems. During training, the model adjusts its parameters based on local patterns and conditions.
3. Parameter Exchange: After local training, each facility shares only the model parameters (the mathematical weights that represent what the model learned) with a central server. The raw data remains at each site.
4. Model Aggregation: The central system combines parameters from all facilities into an updated global model. This aggregated model benefits from patterns learned across all locations.
5. Updated Model Distribution: The improved global model is sent back to each facility, where it can be further refined with local data. This cycle repeats, with the model continuously improving.
This approach addresses several critical challenges in industrial AI deployment:
Data Privacy and Security: Sensitive production data remains within each facility's control. Trade secrets, proprietary processes, and confidential information never leave local systems.
Regulatory Compliance: Because data doesn't move across borders or between facilities, compliance with data sovereignty laws (GDPR, local privacy regulations) becomes more manageable.
Reduced Data Transfer Costs: Only model parameters are transferred—a fraction of the size of raw production data. This significantly reduces cloud storage and bandwidth expenses.
Access to Distributed Data: Models benefit from learning patterns across all facilities, even when direct data sharing isn't possible due to technical, legal, or strategic constraints.
Improved Model Performance: Training on diverse data from multiple facilities typically produces more robust models than training on data from a single location.
A Siemens facility implemented federated learning for quality prediction in printed circuit board (PCB) manufacturing. Multiple production lines trained a shared model for defect detection. Each line's model learned from its local data, but the parameters were combined to create a model that recognized quality patterns across all lines. The result was more accurate defect prediction without requiring any facility to share detailed production data with other sites.
Beyond multi-site implementations within a single organization, federated learning enables AI collaboration across different companies in a supply chain. This capability addresses a longstanding challenge: optimizing processes that span organizational boundaries without sharing proprietary data.
Consider a manufacturing value chain where different process steps occur at different companies:
Traditional Limitations: A PCB manufacturer wants to optimize quality by understanding how variations in soldering paste affect their production outcomes. However, the soldering paste supplier considers their production process proprietary and won't share detailed manufacturing data.
Federated Learning Solution: Each company trains their portion of an AI model on their own data:
This cross-organizational approach enables several use cases:
Quality Traceability: Track quality issues to upstream processes without suppliers sharing detailed production data
Predictive Maintenance: Equipment manufacturers can improve predictive models using data from multiple customer installations without accessing sensitive operational information
Process Optimization: Tier 1 suppliers and OEMs can jointly optimize processes that span their organizations while maintaining competitive separation
Collaborative Innovation: Companies can jointly develop AI models for industry-wide challenges (such as sustainability metrics or safety improvements) without sharing competitive data
While federated learning offers significant advantages, successful implementation requires addressing several organizational and technical challenges. Understanding these challenges helps organizations plan effectively and set realistic expectations.
Local Autonomy Resistance: Production facilities often operate with significant independence. Site managers and production teams may believe their processes are too unique to benefit from shared models. This perception can create resistance to collaborative AI approaches.
Change Management Requirements: Implementing federated learning requires coordination across facilities that may have different priorities, schedules, and success metrics. Building consensus on model objectives, performance criteria, and deployment timelines demands executive sponsorship and clear governance structures.
Skill Distribution: Data science expertise varies across facilities. Some sites may have strong technical teams, while others have limited AI capabilities. Federated learning implementations must account for these capability differences in training, support, and maintenance procedures.
Data Quality Variance: Successful federated learning requires consistent data quality across participating facilities. Common challenges include:
Organizations must establish minimum data quality standards before implementation. This doesn't require perfect data uniformity, but sufficient consistency for meaningful model training.
Infrastructure Prerequisites: Each participating facility needs:
Data Harmonization: While federated learning doesn't require centralized data storage, facilities must agree on:
Federated learning frameworks are actively developing, and implementation requires expertise in both machine learning and industrial systems. Organizations should expect:
Pilot with Supportive Sites: Begin with 2-3 facilities that have strong data foundations and supportive leadership. Use early success to build momentum for wider adoption.
Invest in Data Governance: Establish clear data quality standards, ownership models, and access policies before implementation. These foundations support both federated learning and future AI initiatives.
Provide Comprehensive Training: Develop training programs that help facility teams understand not just how to use the technology, but why federated learning benefits their operations.
Create Cross-Site Communities: Facilitate knowledge sharing between facilities implementing federated learning. Regular communication helps sites learn from each other's challenges and solutions.
Align Incentives: Ensure that facility performance metrics reward participation in collaborative AI initiatives, not just local optimization.
Organizations can follow a structured approach to implement federated learning successfully. This framework balances technical requirements with organizational readiness.
Identify applications where federated learning provides clear advantages:
Ideal Characteristics:
Common Starting Points:
Selection Criteria:
Establish the data infrastructure necessary for federated learning:
Assessment Activities:
Standardization Requirements:
Infrastructure Preparation:
Organizations don't need perfect data uniformity, but require sufficient consistency for effective model training across sites.
Begin with a limited deployment to validate the approach and refine processes:
Site Selection: Choose 2-3 facilities with:
Pilot Objectives:
Success Metrics:
Build the economic justification for broader deployment:
Cost Analysis:
Value Calculation:
Comparison to Alternatives:
Expand implementation across the organization:
Rollout Strategy:
Governance Structure:
Knowledge Transfer:
Maintain and enhance the federated learning system:
Performance Monitoring:
Model Updates:
Capability Building:
Federated learning represents a practical solution to fundamental challenges in scaling industrial AI across distributed manufacturing operations. By enabling collaborative model training without centralized data storage, this approach addresses critical barriers related to data access, compliance, and cost efficiency.
Technical Advantages: Federated learning allows organizations to develop more robust AI models by learning from distributed data sources while maintaining data locality and security.
Economic Benefits: The approach reduces redundant development costs, minimizes data transfer expenses, and enables more efficient scaling compared to site-specific or fully centralized approaches.
Organizational Enablement: Successfully implemented, federated learning supports collaboration across facilities and even supply chain partners while respecting competitive boundaries and proprietary information.
Strategic Positioning: Organizations that develop federated learning capabilities now build expertise and infrastructure that will support future AI initiatives across their operations.
The technology has moved beyond theoretical research to production implementation. Major industrial companies and technology providers have adopted federated learning as a core capability. The inclusion of federated learning in government AI strategies and enterprise technology roadmaps indicates sustained development and adoption.
Organizations evaluating federated learning should focus on specific, high-value use cases where the approach's unique advantages—distributed data access with maintained privacy—directly address existing limitations in their AI scaling efforts.
For data and analytics leaders considering federated learning:
Federated learning is not appropriate for every AI application, but for organizations with distributed operations facing data access constraints, it offers a proven approach to scaling industrial AI effectively.
The manufacturers successfully implementing federated learning share common characteristics: they identified clear use cases, invested in data foundations, managed organizational change thoughtfully, and committed to learning through iterative implementation. Organizations willing to follow this disciplined approach can achieve similar results.
Kudzai Manditereza is an Industry4.0 technology evangelist and creator of Industry40.tv, an independent media and education platform focused on industrial data and AI for smart manufacturing. He specializes in Industrial AI, IIoT, Unified Namespace, Digital Twins, and Industrial DataOps, helping digital manufacturing leaders implement and scale AI initiatives.
Kudzai hosts the AI in Manufacturing podcast and writes the Smart Factory Playbook newsletter, where he shares practical guidance on building the data backbone that makes industrial AI work in real-world manufacturing environments. He currently serves as Senior Industry Solutions Advocate at HiveMQ.