0% found this document useful (0 votes)
46 views10 pages

2023 - Schleimer - Architecture DesignOptions For Federated Data Spaces

Uploaded by

anglingreuses.0c
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views10 pages

2023 - Schleimer - Architecture DesignOptions For Federated Data Spaces

Uploaded by

anglingreuses.0c
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Proceedings of the 56th Hawaii International Conference on System Sciences | 2023

Architecture Design Options for Federated Data Spaces

Anna Maria Schleimer Nils Jahnke Boris Otto


Fraunhofer ISST Fraunhofer ISST TU Dortmund University
TU Dortmund University [email protected] Fraunhofer ISST
[email protected] [email protected]

Abstract benefits like natural resource replenishment, as well as


The massive growth of data and the increasing carbon footprint tracking, and social responsibility
potential of data analytics in industrial production fuel actions can be realized (Khan & Abonyi, 2022).
the emergence of data spaces and corresponding However, industrial data have distinct properties that
platforms that realize data ecosystems and enable challenge data sharing: the data is typically
data-driven sustainability applications. To leverage characterized by massive volume, heterogeneous data
their benefits of demand-driven and scalable data types, real-time existence, and being sensitive to
integration, the stakeholders of emerging data space delays, as well as having considerable value potential
initiatives must make informed decisions about their (Dai et al., 2020). These characteristics create barriers
data space support platforms (DSSPs). This study to data sharing for applications focused on industry
proposes a conceptual framework based on federated sustainability and resilience, including lack of data
architectures and by considering existing endeavors of interoperability, trust, and privacy concerns (Z. Chen
data infrastructures. Based on existing literature & Huang, 2021; Walden et al., 2021). The need to
about data ecosystem resources and an explorative solve these issues and foster data sharing by
single case study of an industrial data space with implementing new data management, sharing, and
sustainability-focused applications, we elaborate on integration capabilities has led to the increasing
the key design options of data, services, and emergence of data spaces and data space support
computing infrastructures. The resulting conceptual platforms (DSSPs) (Franklin et al., 2005; Otto &
framework guides design decisions for DSSPs. The Jarke, 2019). For instance, the European Commission
framework captures not only the resources involved (2020) proposes a data space for traceability and
but also the operational concepts of federated services innovative services to improve social, environmental
and shared services to introduce governance and economic issues related to batteries. Hence, data
mechanisms and sustainability policies. ecosystems emerging on top of data spaces are
beneficial not only to individual companies but also to
Keywords: data spaces, federated architectures, data entire economies and societies (Capiello et al., 2020).
space support platform, data sharing, sustainable To enable ecosystems and new, data-driven
manufacturing applications, the concept of “data infrastructure as
platform” emerged as key enabler (Castro et al., 2021,
1. Introduction para. 5). In contrast to private and often central
platforms, data infrastructure platforms can also be
realized as open and public solutions (Beverungen et
The industrial sector currently faces a “data tsunami”
al., 2022). The underlying platform technologies can
from huge and complex data sets; which pose a
be considered a form of digital infrastructure due to
challenge to traditional processing and database
their enabling role for any applications built on top of
management tools (Zhong et al., 2016, p. 572) but also
them (Constantinides et al., 2018) and, thus, not only
hold immense potential for data analytics (Dai et al.,
enable single data spaces but also the purposeful,
2020). These opportunities include the realization and
organized federation of multiple data spaces. In
improvement of sustainable supply chain management
hierarchical systems such as a federated DSSP,
or circular economy applications that rely on
stakeholders must understand the architecture and the
technologies such as big data analytics, simulation, or
dependencies to come towards a well-defined set of
digital twins for industrial applications, and depend on
requirements from “informed deliberations among
inter-organizational shared data (Z. Chen & Huang,
stakeholders with shared as well as competing
2021; Mageto, 2021). With the resulting information
interests” (Whalen et al., 2012, p. 55). Using an
transparency facilitated by shared data, environmental

URI: https://hdl.handle.net/10125/103078
978-0-9981331-6-4 Page 3643
(CC BY-NC-ND 4.0)
architecture-driven approach, the following research involved in data ecosystems as modeled in Oliveira et
question arises: al.’s (2018) data ecosystem metamodel. Resources
What are the architecture design options contain data sets, systems infrastructure for storage
for federated industrial data spaces? and computing, and data-based software solutions.
These solutions can include reusable assets such as
In response to this question, this study proposes a components and services but also applications to
framework for structured design decisions according produce, provide, or consume data by different actors.
to the systems’ architectural decomposition (Whalen In addition to the business applications that process or
et al., 2012). The design options outline opportunities pre-process data, a set of resources is also required to
to realize sustainability and sovereignty-oriented enable data exchange and related communication
business applications and policies. The framework is between data space participants, which is generally
based on an explorative single case study of an realized via an additional abstraction layer. Figure 1
emerging large-scale data space initiative in the presents a four-layered model (Curry et al., 2019;
industrial sector that tackles sustainability-related use Curry, 2020) that illustrates the position of data
cases. Following Ridder (2017), the study is meant to services in the technology stack. Several not-for-profit
fill gaps in existing theory and therefore relies on a associations have suggested key concepts and roles for
research framework derived from the literature. In this this abstraction layer to support the domain-
paper we outline the design options on the basis of the independent standardization of DSSPs (Nagel &
resources defined in the data ecosystem metamodel Lycklama, 2021). In addition to such a standardized
(Oliveira et al., 2018) and the different layers defined set of services, the specific characteristics of industrial
in the federated architecture literature (Busse et al., data require a particular and flexible design of services
2000; Heimbigner & McLeod, 1985). The initiative and industrial DSSPs. First, the volume and velocity
represents an extreme case, since it (a) consists of of data flows are considerable and presuppose a highly
more than 100 participants from areas such as the scalable data management and integration concept that
automotive, manufacturing, and IT sections, including also considers the implications that different operating
small- and medium-size enterprises (SMEs); (b) systems such as cloud-edge combinations will bring.
enables data-driven use cases to foster sustainable The data is also private and highly protected, in
manufacturing and supply chain resilience; and (c) contrast to, e.g., information available as open data.
commits to leveraging the developments of multiple Manufacturing processes and supply chain networks
data infrastructure initiatives at the same time. also have their own hierarchies that demand easily
adjustable governance capabilities to modify the
2. Background framing conditions of a data sharing collaboration on
a case-by-case basis.
2.1. Data Spaces
Intelligent Applications
Data spaces enable demand-driven and flexible
data integration within and across domains (Curry, Data
2020). In the industrial sector, the term is often Middleware
commonly used to describe an alliance of
organizations that collaborate for data sharing Communication and Sensing
purposes. From a technical viewpoint, the term Figure 1. Framework to enable data ecosystems
describes a particular data integration concept (Curry et al., 2019; Curry, 2020, p. 8)
(Franklin et al., 2005; Halevy et al., 2006) enabled via
a set of enabling services that allow for scalability and
to integrate governance mechanisms (Curry, 2020). 2.2. Federated Architectures
The key characteristics of data spaces are integration
via semantic integration and vocabularies according to DSSPs create a balance between the autonomy of
Linked Data principles, remaining a decentral data the various participants while placing considerable
holding, and enabling nesting and overlaps of data demands on the ability to communicate and negotiate
(Franklin et al., 2005; Halevy et al., 2006). Data spaces between them, at both the technological and
are an enabler for data ecosystems, a term that organizational levels. These fundamental challenges
describes (analogously to biological ecosystems) a set are addressed in DSSPs’ architectural structure, which
of loosely coupled actors that jointly create value from present a federated architecture that connects
data and compete with data and service offerings decentralized databases to a joint data exchange group
(Jacobides et al., 2018). Different resources are

Page 3644
(Heimbigner & McLeod, 1985). The key concerns of (Kramberg & Heinzl, 2021) that can be private (Castro
federated architectures are autonomy and self- et al., 2021) or public (Beverungen et al., 2022). Some
organization of the involved entities while creating a federated services are realized as shared services,
“‘game field’ with the necessary rules and which are a common management concept for sharing
infrastructure supporting functions so that all of the costs among a collaboration network (Borman &
‘players’ are able to find the data they need” (Duan, Ulbrich, 2011; van Fenema et al., 2014). The key
2009, p. 166). Considering the basic types of network characteristics of federated services, however, are
models, the federated approach represents a their distributed nature, their ability to encompass
hierarchical model in which entities are organized into governance mechanisms, and their hierarchical
multiple layers, as shown in Figure 2. Although the network character, all of which allow for the
other models also offer advantages for distinct use inscription of properties such as standards or policies
cases, the hierarchical model holds benefits as it in a top-down manner. These characteristics mean that
reflects common corporate structures and offers a federated service can also be decentrally realized at
advantages by allowing conglomerates of different the autonomous entities’ location and still be governed
entities to have their own policies and processes; the in a top-down way without being a shared service.
hierarchical model also has a hierarchical control,

Presentation
discovery, and governance structure (Duan, 2009). Global Data Availability

Global

Layer
These demands and structures are also required for
production networks.
Dataspace

Federation
Dataspace
Dataspace

Layer
Dataspace
Dataspace
Foundation
Layer

Data Data
Local

central distributed hierarchical Data


Resource Resource Resource
Company A Company B Company C
Figure 2. Basic network model types
(Duan, 2009, p. 170) Figure 3. Data spaces as federated systems

From a conceptual perspective, federated


architectures may be distinguished into three different 3. Conceptual Framework
layers: the global presentation layer, the federation
layer, and the local layer; the local layer also includes 3.1. Research Approach
a wrapper layer (Busse et al., 2000). These layers can
also be described as overlay network layer, service This study follows the paradigm of design-
provider layers and peer-to-peer overlay network layer oriented research (Hevner et al., 2004) and realizes the
(G. Chen et al., 2008). A mapping of the traditional benefits of conceptual modeling and of single case
federated information systems layers to the data space studies. Conceptual models are generally abstractions
concept is displayed in Figure 3. In order to realize a that require simplification of the real system or the part
federated architecture, Steinke and Hommel (2018, of the real world they represent (Robinson, 2010).
p. 1) note that not only technologies but also the Case studies consist of the analysis of real-world
“management needs to become federated to support phenomena (Baskerville et al., 2018; Yin, 1981) and
the collaboration between multiple organizations”. A are characterized, among other aspects, by being
federated service thus enables organization and highly complex and focusing on one particular
execution among multiple autonomous entities and research question. Because data spaces and their
allows for the following of different hierarchies of technologies are still an emerging field and this study
governance and their policies. Such policies might be has an explorative character, a single case study from
related to interoperability or security. Within a DSSP, an established conceptual lens is a suitable approach
such capabilities for governance and technology to extend the body of knowledge (Yin, 2010). First,
policies are conceptually located in the federation the key concepts and structures are derived from
layer. Different alliances establish to design and literature and form a conceptual research framework
analyze different aspects of DSSPs (Otto & Jarke, (section 3.2). The study’s conceptual framework
2019). In addition, a DSSP is a form of data platform draws on federated architecture concepts (Busse et al.,

Page 3645
2000; Heimbigner & McLeod, 1985), data space services. While private business applications refer to
characteristics (Franklin et al., 2005; Halevy et al., privately owned applications for analytic purposes,
2006) and their belonging data ecosystem resources federated services refer to those that form a federation
(Oliveira et al., 2018). Second, the single-case study is and connect different autonomous participants.
analyzed with a strong emphasis on the federation Federated services may be shared services, as
layer and service design (section 3.3). Third, the visualized in Figure 4. They can also be realized
results are generalized (3.4). Subsequently, section 4 decentrally or via single intermediary business
continues with the application of the conceptual partners instead of as a collaborative network. The aim
framework in a circular economy use case requiring of the management concept of shared services is to
data sharing. consolidate services to reduce costs. If this approach is
applied across organizational boundaries, then
3.2. Research Framework organizations form a shared service network and can
collaborate to gain mutual benefits (Borman &
The conceptual framework used to systematically Ulbrich, 2011). Inter-organizational shared services
analyze the case is derived from Oliveira et al.’s foster process and output innovation while involving
(2018) data ecosystem metamodel as well as the layers multiple organizations (van Fenema et al., 2014).
of federated architectures (Busse et al., 2000; The vertical infrastructure dimension refers to the
Heimbigner & McLeod, 1985), which were explained deployment and operation of different services that
in sections 2.1 and 2.2, respectively. Because the process data and thus describes design related to
framework focuses on the design of a DSSP, different storage and computing options. This dimension also
data characteristics, service categories, and captures the quantity dimensions, which describe how
infrastructural options for hosting and computing are often a certain solution is instantiated.
of interest. The framework is displayed in Table 1 and
includes nine fields numbered from I to IX describing Shared Federated
the realization of data ecosystem resources on the
vertical dimension and the architectural dimension on
the horizontal dimension. Each field presents a
resource in a certain dimension, e.g., V the services on Federated
the federation layer. As DSSPs focus on the creation Shared
of a federation layer, this architectural layer is strongly
emphasized.

Table 1. Conceptual research framework


Figure 4. Federated and shared services
Dimension Data Service Infrastructure
Global I II III
3.3. Case Analysis
Federation IV V VI
Local VII VIII IX The following section describes the service
landscape of the industrial data space initiative under
The vertical data dimension refers to analytical the conceptual lens. The local level (VII, VIII, IX)
data as well as operational data and metadata for the includes data-driven business applications (VIII) that
purpose of data sharing. While analytical data is handle the productive data of individual data space
generally characterized by its applications in machine participants (VII). All decisions regarding the data,
learning and is often the object of interest in service, and infrastructure design and operation of
distributed data platforms and data markets, these business applications belong to the participants
operational data is involved in ongoing internal (IX), who consume data via the DSSP and may
operations and can be used in a DSSP as well provide their results (again via the DSSP) to other
(Dehghani, 2022; Inmon et al., 2019). Industrial data participants. The business applications realized via the
can have characteristics that lead to specific data- DSSP are characterized by being strongly reliant on
space-enabling service demands, such as volume or data from multiple sources and diverse participants,
policies. including SMEs and large industrial organizations. For
The vertical service dimension describes private example, to enable a carbon footprint calculation
business applications, federated services, and shared along the supply chain, a large set of data is required
that will demand different design options on the

Page 3646
federation layer. The DSSP provides cloud-agnostic include policies regarding ecological and social
endpoints in the form of interfaces and system properties. The case examined in this study has
adapters that can be adapted to any participant’s different subcategories of federated services, such as
systems and provide data and metadata, such as federated integration services, federated data
provenance information or policies for further services, and federated business services, as
processing. summarized in Figure 6. The federated services are
Dimensions IV–VI refer to the federation level. also categorized into being mandatory, optional, or
Services at this level (V) may be divided according to recommended. For example, the use of a specific
their operating model into those that exist only as one identity and certification service is mandatory for all
common instance provided by the alliance and those data exchange partners, as is the use of a registry for
that can exist in multiple instances in a decentralized digital twins. To incorporate these services,
manner among each participant (VI). The first option participants can leverage an open-source reference
of having only one instance of a service is referred to implementation. Further, it is envisioned that
as a central shared service. The services include commercial services with similar functionality will be
intermediary services that only handle metadata and available in the future.
self-information, such as the cataloging or logging of
data exchanges, as well as business services, which go Federated Service
beyond the intermediary function and are used to
process production data (IV). An example of the latter one or multiple instance at multiple instances shared
are services that anonymize a vehicle identification instances shared participants’ by autonomously defined
number to ensure data protection. The different by the alliance side subgroups of participants
options are described in Figure 5. One benefit in
particular of the central shared service is that one intermediary functions business functions
trusted instance is used that every participant has (metadata only) (metadata and data)
agreed on. On the other hand, central services also
place single-point-of-failures and bottlenecks of distributed central
technology performance, but also power structure. operation and hosting operation and hosting
This service can also represent a central trusted data Figure 6. Design options for federated services
source that contains information such as master data
and trust-relevant member information. Despite their When realized at the participant level, the
advantages, the aim in this case is to use as few central participants can decide to have multiple instances or
services as possible to avoid bottlenecks and increase shared versions in a subgroup (VI). Due to
scalability. interoperability concerns, most shared services are
also federated services, but this is not a necessity. In
Shared Service particular, atomically small and encapsulated services
on the bottom of the technology stack are not subject
one instance multiple instances to the federation agreements
shared by the alliance shared by the alliance The federation layer also comprises the data
infrastructure services. The interoperability of these
intermediary functions business functions services is driven by data infrastructure initiatives,
(metadata only) (metadata and data) which place a common layer between all data spaces
and have the goal of creating a global data space
distributed central (Dataspace Business Alliance, 2021). Such services
have a cloud-agnostic design (VI) and follow a
operation and hosting operation and hosting
predefined architecture and specifications (V), which
Figure 5. Design options for shared services allows for multiple implementation options and
multiple instantiations.
The services (V) can be additionally characterized The remaining global layer (I-III) describes the
as federated services. They can be shared services, but resulting data availability across data spaces. Enabled
they don't necessarily have to be, as depicted in Figure by the federation layer, the data on this global layer
4. Federated services enable a distributed nature and consists of shared data or data processing results (I).
autonomous usage while incorporating common The service (II) and infrastructure options (III) of the
agreements, which mainly consist of interoperability data are unrestricted and not only capture the data
and security aspects, such as policies for how to space participants but also their end users, who benefit
formulate outputs. They also provide the option to from data or service products based on the data space.

Page 3647
3.4. Resulting Conceptual Framework actor or holder of decision rights to determine data
usage. The services involved have different design
The case analysis of a particular industrial data options for the authority and design of governance,
space and its DSSP as an extreme case, based on a specifications, and implementation and whether the
domain-neutral conceptual framework that relates usage is mandatory. The infrastructural design options
architecture to resources, can be abstracted to a general address the decisions and implications about one
framework for industrial DSSPs that allows with versus multiple instantiations of services. Different
connectivity and governance design options to address operational and deployment options should also be
sustainability-relevant properties in Table 2. The considered to enable performance in the targeted
horizontal rows present the architectural layer and the industrial environment. When considering the
vertical columns the different resource design options. horizontal axis, the focus lays on the federation layer,
On the vertical axis are the data, services, and which is the key layer for balancing different
infrastructure options. For the data involved in a autonomy and communication purposes. At the local
DSSP, the design options exist to decide whether level, the business application category implies
productive data or only metadata will be processed. complete autonomy for participants in terms of the
The holder is also defined as being the responsible design and operation of their applications.

Table 2. Conceptual Framework


Data Service Infrastructure
Type and Holder

Operation and
Implementation
Specification
Governance

Deployment
Occurrence
Mandatory
Service

Usage
Category

Operation and deployment


Global

Global Data or services as business application result available for


* are up to the end-users and
Dataspace end-user provided by data space participants.
participants.
Defined The alliance defines how
Central The alliance defines, specifies, and implements
by 0…1 they are operated and
Service the central service.
alliance. where they are deployed.
Considering the
federation
Participants’ The alliance
Federated governance, Defined The alliance defines how
data to enable defines
Business multiple by * they are operated and
federated governance
Service specifications and alliance. where they are deployed.
Federation Layer

processing. rules.
implementations
exist.
Participants’ The alliance decides whether
Federated Defined The alliance defines how
metadata to additional federated intermediary
Intermediary by * they are operated and
enable data services are required and how they
Service alliance. where they are deployed.
sharing. are designed and realized.
Participants’ The
Different The alliance defines how
metadata and governance
Federated implementation Defined the services are operated
alliance’s self- and
Data Infra- can be used, by data and where they are
information if specifications 1…*
structure among them a infra- deployed, the data
required by are defined by
Service reference structure. infrastructures may also
data infra- the data
implementation. operate some of them.
structure. infrastructure.
Participants’
The complete design and governance of Operation and deployment
Local

Business data obtained


business application services belongs to the * are each participant’s
Application via the data
participant. decision.
space system.

Page 3648
The global level represents the presentation level 4. Enabling Sustainability and Resilience
of the resulting data and service availability, drawn
from the aggregation of several resources. On the In the following, the framework is applied to an
federation layer, industrial enterprises have various exemplary circular economy use case in pump
options for shared and federated services that can be manufacturing. Pumps used in industrial application
designed according to their needs. Such needs can be scenarios (e.g., as part of a chemical plant) consist of
manifested as interoperability, demands for policies components provided by different suppliers. They
and data sovereignty, and the security or performance include shaft, impeller, housing, bearing and motor,
demands that guide design decisions. One design amongst others. After the end-of-life of a pump,
option is the provisioning of a central service, which decisions must be made if the components can be
is only made available one time for the data space. repaired, refurbished, reconditioned, reprocessed, or
Federated services can occur multiple times, once or remanufactured. As main component, the motor is of
not at all, depending on their type. If federated services special interest as it can often be easily separated from
are handling metadata only, they are referred to as the other parts and be potentially reused or
federated intermediary services. If they are handling remanufactured for other applications. Further, the
actual data, they are labeled as federated business motor contains valuable elements such as rare earths
services. Data infrastructure services also represent that are being used for permanent magnets due to their
federated services where certain design decisions are high efficiency and high energy density (Li et al.,
made by the data infrastructure initiatives. 2019). In order to make sound reuse decisions, such as
Table 3. Conceptual Framework applied to Circular Economy Use Case
Category Example and Explanation Benefits to foster data sharing
The data availability creates
Global In sum the information about product lifecycle relevant for
information about different
Dataspace recycling decisions.
products during their lifecycle.
One commonly used frontend and a functionality that gives an A central portal allows for a
Central overview about data transactions by analyzing metadata, and single point of contact for end-
Service one service that analyzes the payload data to estimate the CO2 users to execute data sharing
savings reached via the product reuse decisions. via the data space activities.
Specialized services are required to detect toxic materials during By offering the distinct services
the product lifecycle and issues alerts. One service is performed that issue alerts, different
Federated as hyperscaler-based cloud solution, another version on a analysis methods can be used
Business European-hosted solution in case data is not allowed to leave and different hosting options
Service Europe, and another cloud service exists that demands extensive allow for compliance
high computing power due to distinct artificial intelligence conformity and to fulfill
algorithms to detect certain implications of complex materials. computing demands.
To realize circular economy
One service is a distinct logging service for audit reasons that
applications and data
includes a history of data exchange partners. Further services are
Federated integration via data space
a distinct search and query functions and a corresponding
Intermediary principles, different
catalog that is tailored to the circular economy needs.
Service interoperable metadata-
Additionally, a suitable data model is needed that fits the
processing services are
sustainability demands.
required.
Federated An identity management approach is selected, and necessary A standardized approach for
Data components and support systems are provided. For instance, the identity management allows to
Infrastructure eligible identity certificate providers are defined and how the easily connect to other data
Service identities are proved. spaces.
Raw data as basis of circular economy use case is collected on
Data is only shared on need-to-
participant level such as in PLM, ERP or MES systems. In-house
Business know basis and remains at
data and data obtained via the data space can be processed in
Application participant until data sharing
own applications to gain information that determines the
agreement is reached.
potential use and specific constraints of components.

Page 3649
opting for a remanufacturing of the motor versus sustainable system itself and prevents it from large re-
recycling of distinct parts and materials, data along the build demands.
whole lifecycle of the product is needed. For instance,
motor curve data measured during service may
indicate the wear of the device, environmental data 5. Discussion
gives hints on the contact of certain parts with toxic
substances that impact reusability from environmental The design of industrial DSSPs must consider
and safety perspectives, and demand data about parts different services as well as their processed data and
or materials enable to assess the economic benefits of operational options simultaneously. Service categories
different reuse options. Assuming the pump motor is distinguish between (a) shared and federated services,
given to a recycling service provider after its end-of- (b) the use of highly sensitive business data and
life, information about the scenarios mentioned above metadata, and (c) services that support the data space
is commonly not available as the data streams are defined by domain-neutral data infrastructure. The
interrupted between different stages of the product different categories also follow different business
lifecycle across stakeholders and systems (Wang & models. Consequently, the conceptual framework
Wang, 2019). To share and prepare the required data displays the nature of data infrastructures and
throughout the lifecycle, different data spaces support highlights their infrastructural characteristics (Hanseth
services are required that consider technological & Monteiro, 1998). Besides defining the services for
constraints, but also trust and governance aspects of each category, data space alliances must also decide
the stakeholders involved. what is mandatory to be used and what is not. This
In the light of the mentioned circular economy use decision covers whole service instances but also
case Table 3 illustrates how certain design choices dedicated governance rules, specifications, or
foster data sharing to achieve higher transparency for infrastructural options. The case study examined in the
sustainability actions. Providing and applying generic present study further distinguishes between optional
federated data infrastructure services enables the easy and recommended services. Notably, the demand of
integration of a broad range of participants and their some services may imply dependencies to other
data into different data spaces. Such multiple data services that become implicitly mandatory or can pose
space integration fosters the sharing of data across lock-in effects. The abstraction level of the conceptual
domains that may be crucial for some information framework (Table 2) allows for a unifying view on
chains. For example, sharing the carbon footprint of DSSPs of different natures and their comparison. The
manufacturing enterprises with banks may allow for focus on operational environments allows for
sustainable financing (Xu & Li, 2020). Supply chains comparing and composing different options. Different
may also cross different jurisdictions that require to operation options can be selected depending on the use
rely on common, fundamental agreements. Further, case’s specific threats and targets. For example, as
disruptive scenarios with dynamic changes of supply Adhikari and Winslett (2019, p. 974) note that “supply
chains due to interruptions (such as environmental chain data and its threat model are a good match for
disasters) or business interruptions due to new blockchains […] other fine-grained data from a factory
business models that require different data products floor can be valuable for manufacturing analytics, but
require flexibility in data spaces and participants. Next is a poor match for blockchains, due to its volume
to enabling uniformity and standardization with data [and] velocity”. This characteristic highlights the
infrastructures, at the same time the flexible design of necessity for different design options especially for the
added federated intermediary and business services infrastructural and operational aspects.
allows for purposeful tailoring to the demands of
single data spaces and staying flexible. This way, also
the adjustment and lowering of their energy and cloud
resource consumption is possible, as well as the ability
6. Conclusion, Limitations, and Outlook
to define own governance rules and machine-
interpretable information including ecological or This study has elaborated on the foundational
social fairness information besides data protection and concepts of a data space support platform (DSSP) and
interoperability ones. These information enables has proposed a conceptual framework for industrial,
informed decisions to grant or deny access to the federated data spaces aimed at creating information
whole data space or certain resources. transparency. The use of this model can ease the
Next to ecological or environmental aspects of design of DSSPs at an emerging development stage
sustainability, the system design also allows for long- and enables sustainable applications as well as design
term use, reliability and stability that makes it a decisions in manufacturing that are reliant on the data

Page 3650
shared across organizations. The following limitations Busse, S., Kutsche, R.‑D., & Leser, U. (2000). Strategies
must be considered, however. First, the case for the Conceptual Design of Federated
considered in this study is a single case and thus does Information Systems. In EFIS. Symposium
not allow for comparison between different cases. The conducted at the meeting of Citeseer.
Capiello, C., Gal, A., Jarke, M., & Rehof, J. (2020). Data
case is also a data space endeavor in the ramp-up stage Ecosystems: Sovereign Data Exchange among
and is not yet fully operationalized. The conceptual Organizations (Dagstuhl Seminar 19391).
analysis shows only a snapshot, and the concepts and Advance online publication.
services of the case have yet to be completely defined https://doi.org/10.4230/DagRep.9.9.66.
and may still change. Future research opportunities Castro, A., Machado, J., Roggendorf, M., & Soll, H.
could include a detailed analysis the remaining (2021). How to build a data architecture to drive
properties of data ecosystem resources of quality, innovation—today and tomorrow. McKinsey
standards, and license constraints (Oliveira et al., Digital. https://www.mckinsey.de/business-
2018). Doing so would allow for further locating functions/mckinsey-digital/our-insights/how-to-
build-a-data-architecture-to-drive-innovation-
production-specific standards and constraints in a today-and-tomorrow
more fine-grained manner. Additionally, key Chen, G., Low, C. P., & Yang, Z. (2008). Coordinated
components and sustainability-specific concepts could Services Provision in Peer-to-Peer Environments.
be added and refined as additional governance layer. IEEE Transactions on Parallel and Distributed
Closely related are also the implications of centralized Systems, 19(4), 433–446.
or decentralized service design and operation, https://doi.org/10.1109/TPDS.2007.70745
including the costs or any legal implications that arise. Chen, Z., & Huang, L. (2021). Digital twins for
information-sharing in remanufacturing supply
chain: A review. Energy, 220, 119712.
7. Acknowledgements https://doi.org/10.1016/j.energy.2020.119712
Constantinides, P., Henfridsson, O., & Parker, G. G.
This work has been supported by the German (2018). Introduction—Platforms and
Federal Ministry for Economic Affairs and Climate Infrastructures in the Digital Age. Information
Action in context of the GAIA-X4KI project (no. Systems Research, 29(2), 381–400.
19A21011E). https://doi.org/10.1287/isre.2018.0794
Curry, E. (2020). Real-time Linked Dataspaces : Enabling
Data Ecosystems for Intelligent Systems. Springer
8. References Nature. https://doi.org/10.1007/978-3-030-29665-
0
Adhikari, A., & Winslett, M. (2019). A hybrid architecture Curry, E., Derguech, W., Hasan, S., Kouroupetroglou, C.,
for secure management of manufacturing data in & ul Hassan, U. (2019). A Real-time Linked
industry 4.0. In 2019 IEEE International Dataspace for the Internet of Things: Enabling
Conference on Pervasive Computing and “Pay-As-You-Go” Data Management in Smart
Communications Workshops (PerCom Environments. Future Generation Computer
Workshops). Symposium conducted at the Systems, 90, 405–422.
meeting of IEEE. https://doi.org/10.1016/j.future.2018.07.019
Baskerville, R., Baiyere, A., Gregor, S., Hevner, A., & Dai, H.‑N., Wang, H., Xu, G., Wan, J., & Imran, M.
Rossi, M. (2018). Design Science Research (2020). Big data analytics for manufacturing
Contributions: Finding a Balance between internet of things: opportunities, challenges and
Artifact and Theory. Journal of the Association enabling technologies. Enterprise Information
for Information Systems, 19(5), 358–376. Systems, 14(9-10), 1279–1303.
https://doi.org/10.17705/1jais.00495 Dataspace Business Alliance. (2021). Unleashing the
Beverungen, D., Hess, T., Köster, A., & Lehrer, C. (2022). European Data Economy. https://data-spaces-
From private digital platforms to public data business-alliance.eu/
spaces: implications for the digital Dehghani, Z. (2022). Data mesh: Delivering data-driven
transformation. Electronic Markets. Advance value at scale. O'Reilly.
online publication. https://doi.org/10.1007/s Duan, N. (2009). Design Principles of a Federated Service-
12525-022-00553-z oriented Architecture Model for Net-centric Data
Borman, M., & Ulbrich, F. (2011). Managing Sharing. The Journal of Defense Modeling and
Dependencies in Inter-Organizational Simulation: Applications, Methodology,
Collaboration: The Case of Shared Services for Technology, 6(4), 165–176.
Application Hosting Collaboration in Australia. https://doi.org/10.1177/1548512909352790
In I. Staff (Ed.), 2011 44th Hawaii International European Commission. (2020, December 10). Green Deal:
Conference on System Sciences (pp. 1–10). I E E Sustainable batteries for a circular and climate
E. https://doi.org/10.1109/HICSS.2011.295 neutral economy [Press release].

Page 3651
https://ec.europa.eu/commission/presscorner/ Age - dgo '18 (pp. 1–10). ACM Press.
detail/en/ip_20_2312 https://doi.org/10.1145/3209281.3209333
Franklin, M., Halevy, A., & Maier, D. (2005). From Otto, B., & Jarke, M. (2019). Designing a multi-sided data
databases to dataspaces. ACM Sigmod Record, platform: findings from the International Data
34(4), 27–33. https://doi.org/10.1145/1107499. Spaces case. Electronic Markets, 29(4), 561–580.
1107502 https://doi.org/10.1007/s12525-019-00362-x
Halevy, A., Franklin, M., & Maier, D. (2006). Principles of Ridder, H.‑G. (2017). The theory contribution of case study
dataspace systems: 2006 ACM SIGMOD research designs. Business Research, 10(2), 281–
International Conference on Management of 305. https://doi.org/10.1007/s40685-017-0045-z
Data. Robinson, S. (2010). Conceptual modelling: Who needs it.
Hanseth, O., & Monteiro, E. (1998). Understanding SCS M&S Magazine, 2(7).
information infrastructure. Unpublished Steinke, M., & Hommel, W. (2018). A data model for
Manuscript, retrieved from http://heim. ifi. uio. federated network and security management
no/~ oleha/Publications/bok. pdf information exchange in inter-organizational IT
Heimbigner, D., & McLeod, D. (1985). A federated service infrastructures. In NOMS 2018 - 2018
architecture for information management. ACM IEEE/IFIP Network Operations and Management
Transactions on Information Systems (TOIS), Symposium (pp. 1–2). IEEE.
3(3), 253–278. https://doi.org/10.1109/NOMS.2018.8406162
Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). van Fenema, P. C., Keers, B., & Zijm, H. (2014).
Design Science in Information Systems Research. Interorganizational Shared Services: Creating
MIS Quarterly: Management Information Value across Organizational Boundaries. In T.
Systems, 28(1), 75–105. Bondarouk (Ed.), Advanced Series in
https://doi.org/10.2307/25148625 Management: Vol. 13. Shared services as a new
Inmon, W. H., Linstedt, D., & Levins, M. (2019). Data organizational form (Vol. 13, pp. 175–217).
architecture: A primer for the data scientist Emerald Group Publishing.
(Second Edition). Academic Press. https://doi.org/10.1108/S1877-
Jacobides, M. G., Cennamo, C., & Gawer, A. (2018). 636120140000013009
Towards a theory of ecosystems. Strategic Walden, J., Steinbrecher, A., & Marinkovic, M. (2021).
Management Journal, 39(8), 2255–2276. Digital Product Passports as Enabler of the
https://doi.org/10.1002/smj.2904 Circular Economy. Chemie Ingenieur Technik,
Khan, A. A., & Abonyi, J. (2022). Information sharing in 93(11), 1717–1727.
supply chains – Interoperability in an era of https://doi.org/10.1002/cite.202100121
circular economy. Cleaner Logistics and Supply Wang, X. V., & Wang, L. (2019). Digital twin-based
Chain, 5, 100074. WEEE recycling, recovery and remanufacturing
https://doi.org/10.1016/j.clscn.2022.100074 in the background of Industry 4.0. International
Kramberg, P., & Heinzl, A. (2021). Journal of Production Research, 57(12), 3892–
Datenplattformökosysteme [Data Platform 3902. https://doi.org/10.1080/00207543.2018.
Ecosystems]. HMD Praxis Der 1497819
Wirtschaftsinformatik, 58(3), 477–493. Whalen, M. W., Gacek, A., Cofer, D., Murugesan, A.,
https://doi.org/10.1365/s40702-021-00716-0 Heimdahl, M. P. E., & Rayadurgam, S. (2012).
Li, Z., Kedous-Lebouc, A., Dubus, J.‑M., Garbuio, L., & Your" what" is my" how": Iteration and hierarchy
Personnaz, S. (2019). Direct reuse strategies of in system design. IEEE Software, 30(2), 54–60.
rare earth permanent magnets for PM electrical Xu, X., & Li, J. (2020). Asymmetric impacts of the policy
machines – an overview study. The European and development of green credit on the debt
Physical Journal Applied Physics, 86(2), 20901. financing cost and maturity of different types of
https://doi.org/10.1051/epjap/2019180289 enterprises in China. Journal of Cleaner
Mageto, J. (2021). Big Data Analytics in Sustainable Production, 264, 121574.
Supply Chain Management: A Focus on https://doi.org/10.1016/j.jclepro.2020.121574
Manufacturing Supply Chains. Sustainability, Yin, R. K. (1981). The Case Study as a Serious Research
13(13), 7101. https://doi.org/10.3390/su13137101 Strategy. Knowledge, 3(1), 97–114.
Nagel, L., & Lycklama, D. (2021). Design Principles for https://doi.org/10.1177/107554708100300106
Data Spaces - Position Paper. https://design- Yin, R. K. (2010). Case study research: Design and
principles-for-data-spaces.org/ methods (4. ed.,). Applied social research
https://doi.org/10.5281/zenodo.5105744 methods series: Vol. 5. Sage.
Oliveira, M. I. S., Oliveira, L. E. R. A., Batista, M. G. R., Zhong, R. Y., Newman, S. T., Huang, G. Q., & Lan, S.
& Lóscio, B. F. (2018). Towards a meta-model (2016). Big Data for supply chain management in
for data ecosystems. In M. Janssen, S. A. Chun, the service and manufacturing sectors:
& V. Weerakkody (Eds.), Proceedings of the Challenges, opportunities, and future
19th Annual International Conference on Digital perspectives. Computers & Industrial
Government Research Governance in the Data Engineering, 101, 572–591.
https://doi.org/10.1016/j.cie.2016.07.013

Page 3652

You might also like