Bringing Intelligence to Steel: How SHS is Reshaping a Traditional Industry with AI by SHS Group

where the right approach to a problem is rarely obvious from the outset.

Can you give a concrete example of a recent R&D project?

A strong example is an EU-funded project to digitalise and partially automate the scrap handling process. We developed a new approach to building a dataset for steel scrap, then trained an AI model to classify different scrap varieties. Building on that, we trained a second model to predict the chemical composition of liquid steel based on the specific scrap inputs used.

Both models were genuinely experimental and success was not guaranteed going in. The results, however, were very promising, which made it a particularly satisfying project to work on.

How does the R&D team collaborate with the Specialised AI and GenAI teams?

The collaboration is close and deliberately flexible. There are no rigid boundaries between the teams. My team leads on research projects and carries the bulk of that work, but we work in close partnership with Anna’s team throughout.

As a model matures and moves from active development into ongoing supervision, it naturally transitions across. Equally, when our team is between funding cycles, we’ll contribute to projects in the other teams. I think that fluidity is essential. A purely research-focused team that operates in isolation risks losing touch with the realities of the plant floor. Maintaining close contact across all three departments keeps the work grounded and relevant.

What do you see as the most significant unsolved problems in industrial AI?

The hardest recurring challenge is optimising processes where the data you need simply doesn’t exist. Sometimes a subtle defect is causing problems somewhere in the process, but it’s so difficult to measure directly that you wouldn’t even know where to place sensors to capture it. It may be faintly visible beneath the noise in other sensor readings, but finding and isolating it is genuinely difficult.

The question we return to repeatedly is: how do you model a process when you lack the right data, and how do you find proxies that can take you closer to an answer? That challenge is compounded by the environment itself. A steel plant is inherently noisy and imperfect; nothing behaves the way it would in a controlled laboratory setting.

Your team works on experimental problems with uncertain outcomes. How do you persuade domain experts to invest time in projects that may never deliver a clear commercial result?

The honest answer is that if the commitment feels too open-ended to the people on the shop floor, the project simply won’t get off the ground and we respect that. What makes it work in practice is that the most experimental problems rarely need to be sold in. They tend to come to us.

An engineer has been sitting with an unsolved problem for years, sometimes decades. Typically this is something that resisted previous attempts because the techniques to tackle it didn’t yet exist. When they see an opportunity to finally address it, the motivation is already there. That intrinsic drive from the domain expert side is what makes these projects viable. Without it, the dynamic wouldn’t work.

We’ve worked hard to demonstrate that we are genuine partners and not just a technical team parachuting in with solutions.

That means sharing credit when things work, and being honest when they won’t, rather than letting people invest time in a dead end. The credibility we have today with the harder, more experimental problems is built entirely on the foundation of how we handled the easier, earlier ones. Without that track record, this kind of collaboration would be very difficult to sustain.

The future of heavy industry is often perceived to be physical. Do you see the steel industry heading in that direction?

Physical AI opens up genuinely compelling possibilities, particularly around safety. If autonomous systems can operate in environments that are hazardous for humans, the case for deploying them becomes very strong. It’s not just an efficiency argument, it’s a human one.

That said, I think widespread practical application in steel is still some years away. It’s a development we’re watching closely rather than actively building towards right now.

It’s relatively uncommon to have a dedicated research function within a team of 20 people. What motivated that decision?

Staying at the cutting edge requires active investment in research and it doesn’t happen passively. To remain competitive, you have to engage with the state of the art, not just apply what’s already well established.

Having a research capability gives us direct access to new ideas, and it enables us to connect with other steel plants, researchers, and data scientists across Europe. That exchange of methods, perspectives, and approaches to shared problems is a significant advantage. You learn things from those conversations that you simply wouldn’t encounter by staying within your own environment.

TOBIAS BETTINGER

GENAI AT SHS

Tobias, what role does generative AI play at SHS, and where is it currently being applied?

While GenAI dominates the public conversation about AI, it’s worth noting that it represents just one strand of a much broader AI programme at SHS. Currently, our GenAI work is focused on administrative functions.

One significant area is the technical feasibility analysis of customer requirements, a process that currently takes domain experts up to two weeks to complete manually. AI can accelerate that considerably. A second area is the automation of customer enquiry handling, where we receive large volumes of unstructured information per email that staff currently process by hand. Both represent strong candidates for AI-assisted workflows.

GenAI is associated with risks such as hallucinations and safety concerns. How do you manage those, and are there areas where you’ve chosen not to deploy it?

Those risks are real, and they directly shape our deployment decisions. In a steel plant environment, where safety is critical, the consequences of a hallucinated output could be serious. For that reason, we are not deploying fully agentic GenAI systems in production processes at this stage.

All current use cases retain a human in the loop as a deliberate safeguard. As the technology matures and our understanding of the risk profile deepens, production applications may become viable, but that is a decision for the future, not the present.

Are you working with multi-agent systems, or primarily single-agent approaches?

Mostly single-agent systems for now, though there are specific use cases where a multi-agent architecture is the more appropriate solution. Our current priority is to map and formalise the workflows that experienced human teams already follow and then translate that logic into agentic AI systems.

Getting that foundation right is the essential first step.

How do you capture the domain knowledge needed to build effective agentic systems?

The process mirrors what the wider AI department does across all its projects: close, structured collaboration with the people who hold the knowledge. We run workshops with domain experts to understand their daily work in detail: what decisions they make, what information they draw on, and what the logic of their workflows actually looks like.

That knowledge then forms the basis for designing the agent. Without it, you cannot build a system that genuinely replicates or supports the way people work. The workshops aren’t a preliminary step; they’re central to the whole process.

How do you establish guardrails in your GenAI systems to manage risk?

Our centralised GenAI platform is the cornerstone. It incorporates elicitation prompts at key points in workflows which are essentially human-in-the-loop checkpoints that intercept potential errors before they reach downstream processes.

Beyond that, we follow the same parallel testing approach used by the other teams: running the system alongside domain experts and process owners before any production sign-off. Governance is built in from the start, not added as an afterthought.

What are the biggest barriers to adoption?

Somewhat counterintuitively, it’s not resistance from people as our domain expert collaborators are generally engaged and motivated. The real friction is integration complexity.

Operating across a heterogeneous IT landscape with numerous proprietary systems makes accessing the right data structures and sources genuinely difficult. That is consistently where the most effort goes.

What is your GenAI model strategy: commercial foundation models or locally hosted alternatives?

Both, depending on the context. For the majority of use cases, we use OpenAI models deployed via Azure as they provide the capability and scale we need. Where sensitive data is involved, we switch to locally hosted models to ensure nothing leaves our environment.

Operating in a hybrid setup gives us the flexibility to make that distinction cleanly and consistently.

What has been your standout GenAI success so far?

Our internal GenAI platform. It gives employees across the business a single, governed interface for getting answers to everyday questions – replacing the fragmented process of searching across multiple systems. The fact that it also handles security and governance centrally makes it doubly valuable. It’s the foundation on which everything else is being built.

How does your team use GenAI in your own daily work?

Coding assistance is the most widely adopted application across our team and the broader IT function. The productivity gains are tangible. The important caveat, though, is that you need strong underlying expertise to use it well. Knowing how to frame instructions clearly and being able to evaluate the output critically requires genuine technical understanding. GenAI amplifies capability; it doesn’t replace the judgment needed to direct it effectively.

What emerging technologies or model developments are you most excited about?

The pace of development in GenAI is itself remarkable. Release cycles are extremely short, and keeping up is a genuine challenge. Rather than a specific model, what excites me most is the trajectory: each new release expands what’s possible, and that momentum shows no sign of slowing.

The shift I find most significant is the move from very large, general-purpose language models towards smaller, more specialised ones. The practical implication is striking. Within weeks rather than months, it will be possible to run powerful multimodal models on standard hardware. That dramatically lowers the barrier to deployment and opens up new possibilities for how and where we apply these models within our environment.

What is the strategic vision for AI at SHS going forward?

The goal is straightforward: to make the entire SHS group AI-powered. Not AI for its own sake, but AI applied wherever it delivers genuine, measurable value.

The pieces are coming together, and the direction is clear. Over the next few years, we expect AI to be operating across significantly more of our processes, optimising, automating, and further strengthening our competitiveness.

Bringing Intelligence to Steel: How SHS is Reshaping a Traditional Industry with AI by SHS Group

THE EVOLUTION OF AI AT SHS GROUP

For those unfamiliar with the steel industry, what does a modern steel plant look like today?

Can you describe the mission of your AI team within SHS?

How has the AI team grown, and what does it look like today?

How did the AI function evolve from those early days to where it is now?

Can you describe what each of the three sub-departments focuses on?

How would you describe the operational complexity of your data environment?

Why does SHS operate its own data centres rather than relying on the cloud?

ANNA VOCKE

SPECIALISED AI IN STEEL MANUFACTURING

What has actually changed in steel manufacturing as a result of AI?

How is the Specialised AI team structured?

What does ‘specialised AI’ actually mean in your context?

What methods and techniques does the team draw on?

What types of operational problems is the Specialised AI team solving on a daily basis?

What does poor data quality look like in practice within a steel plant?

How reliable is sensor data compared to other data sources in the plant?

Where does poor data quality cause the most damage in practice?

Can modelling from first principles bridge these gaps?

What does a robust data pipeline look like in a steel plant environment?

Which AI use cases have delivered the greatest value at SHS?

Beyond model building, what are the biggest technical challenges you face?

How do you validate models before they go live in production?

How do you detect model degradation or failure in a live environment?

How do domain experts, who are not data specialists, develop confidence in working with AI models?

ULRIKE FALTINGS

AI FOR R&D

What does your R&D team actually do, and how does it differ from the other AI teams?

What backgrounds does the R&D team bring together?