Sustainable Supply Chain Design with GAE
Sustainable Supply Chain Design with GAE
To cite this article: Yuhan Guo, Runsheng Chen, Youssef Boulaksil & Hamid Allaoui (05
Aug 2025): Modelling and solving sustainable supply chain network design based on graph
autoencoder clustering algorithm, International Journal of Production Research, DOI:
10.1080/00207543.2025.2542506
Article views: 89
Modelling and solving sustainable supply chain network design based on graph
autoencoder clustering algorithm
Yuhan Guo a , Runsheng Chen b , Youssef Boulaksil c and Hamid Allaoui d
a School of Artificial Intelligence and Information Engineering, Zhejiang University of Science and Technology, Hangzhou, People’s Republic of
China; b School of Software, Liaoning Technical University, Huludao, People’s Republic of China; c College of Business and Economics, UAE
University, Al Ain, United Arab Emirates; d Laboratoire de Génie Informatique et d’Automatique de l’Artois (LGI2A), Univ. Artois, UR 3926,
Béthune, France
CONTACT Yuhan Guo [Link]@[Link] School of Artificial Intelligence and Information Engineering, Zhejiang University of Science and
Technology, Hangzhou 310023, People’s Republic of China
This article has been corrected with minor changes. These changes do not impact the academic content of the article.
design with low responsiveness may be impractical and issues (Chattopadhyay, Sengupta, and Sahay 2016). How-
difficult to implement in real-world scenarios. ever, traditional clustering algorithms lack a learning
Supply chain responsiveness is conventionally mea- mechanism, which limits their ability to maintain solu-
sured through production time and transportation time, tion quality and stability, especially when solving large-
two operational metrics that capture temporal efficiency scale instances (Ran et al. 2024). These limitations arise
across manufacturing and distribution processes (Moh’d because traditional algorithms do not adapt to changes
Anwer 2022). This temporal dimension of responsive- over time or adjust based on new data inputs, leading to
ness has been widely recognised as a critical precur- decreased effectiveness in dynamic environments.
sor to organisational resilience (Bronzo et al. 2024). Graph autoencoder (GAE) offers a powerful solution
While comprehensive resilience frameworks incorpo- to the challenges faced in sustainable supply chain net-
rate multi-phase capabilities from disruption antici- work design through the use of graph neural networks.
pation to system recovery (Emami, Lorenzoni, and Unlike traditional clustering algorithms, GAE provides
Turchetti 2024), optimising temporal responsiveness several key advantages when addressing large-scale sus-
establishes essential infrastructure for achieving these tainable supply chain challenges. First, GAE captures the
capabilities. Empirical evidence further confirms that intricate interconnections among nodes in the supply
accelerated production-transportation cycles enhance chain network, enabling a more accurate representation
supply chain agility and structural robustness under of the system’s topology Dong et al. (2023). Second, GAE
uncertainty (Ha and Lee 2024). The established correla- can handle both structured and unstructured data, mak-
tion between responsiveness and resilience thus justifies ing them versatile tools for dealing with the diverse types
using these time-based metrics as practical operational of information encountered in supply chain manage-
proxies in resilience optimisation. ment. Third, GAE is capable of learning latent represen-
Traditional supply chain design is a well-known tations that preserve important features while reducing
NP-hard problem. Incorporating sustainability criteria dimensionality, thus enhancing the interpretability and
requires the inclusion of numerous additional variables, efficiency of the clustering process (Ren et al. 2024). Most
constraints, and objective functions, which significantly importantly, GAE leverage the high-level information
increase the model’s complexity. Madani, Saihi, and embedded in the data of sustainable supply chains and
Abdelfatah (2024) reviewed 102 papers on sustainable fully exploit the non-Euclidean structure, which cannot
supply chain network design and found that most stud- be achieved with traditional clustering methods such as
ies rely on conventional solvers to address the generated k-means. Moreover, some studies have shown that neural
models. However, the size and scope of variables and con- networks exhibit significant potential in solving mixed-
straints in real-world sustainable supply chain models integer programming problems (Nair et al. 2020) and
make it infeasible to obtain solutions within a reasonable that graph clustering algorithms outperform other meth-
timeframe. While heuristic and metaheuristic methods ods in terms of stability when applied to general data
can significantly reduce computational complexity, they objects (Li, Zhang, and Zhang 2021), indicating their
often result in solutions of lower quality and stability. suitability for optimisation tasks in complex supply chain
Therefore, there is an urgent need for efficient and robust scenarios. Despite these advantages, however, there has
solution methods to address large-scale, complex sustain- been limited substantial research applying GAE specifi-
able supply chain problems, which is the focus of this cally to the problem of sustainable supply chain network
study. design. By integrating GAE into sustainable supply chain
Given that the nodes within a sustainable supply chain design, we address a critical gap in the literature, pro-
network are interconnected, clustering algorithms can viding a novel perspective that enhances both academic
leverage these relationships to reduce the problem’s size research and practical applications.
by breaking down large supply chain challenges into Based on the aforementioned motivations, we propose
smaller, more manageable subproblems. As a result, they a novel clustering mapping algorithm based on graph
have been adopted to address complex issues across var- autoencoder in this study. We integrate GAE with cluster-
ious levels of sustainable supply chain problems. For ing algorithms to capture key features and relationships
instance, researchers have used clustering algorithms within supply chain networks, thereby constructing a
to evaluate the sustainable performance of suppliers more flexible and efficient supply chain network model
and regions (Jabbarzadeh, Fahimnia, and Sabouhi 2018; capable of addressing complex sustainability issues. The
Sabouhi, Jabalameli, and Jabbarzadeh 2021), identify proposed algorithm tackles the challenges of high com-
spatial locations for national carbon-to-fuel infrastruc- plexity and low solution efficiency in large-scale models.
ture (Kim, Do, and Kim 2024), form production route Experimental results demonstrate that the algorithm not
sets (Geiger 2024), and tackle other complex supply chain only provides high-quality solutions close to optimal in
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 3
large-scale instances but also exhibits fast runtime and Wang et al. 2023). The application domains of these mod-
robust stability. Notably, this study represents the first els show a shift in focus over time, reflecting emerging
attempt to model and solve sustainable supply chain net- global challenges and trends, with particular attention
work design problems using graph clustering algorithms. to topics such as mining, food security and pandemics.
The key contributions of this study are summarised as However, although these models can help decision-
follows: makers optimise their sustainable supply chains in vari-
ous fields, few models provide a universal framework that
• We model the supply chain network structure based can be applied across different sectors.
on intercity distances, extending the traditional point- Most studies have focussed on economic objectives
to-point problem and making the model more suitable within sustainability. For example, Sabogal-De La Pava
for clustering representation. et al. (2021) used market value maximisation as the mea-
• We incorporate the responsiveness indicator into the sure of economic performance. Some studies have also
sustainable supply chain model, thereby filling the gap integrated environmental sustainability into economic
in relevant research. goals, balancing profitability and pollution reduction. Cai
• We propose a novel method that abstracts potential and Jiang (2023) introduced decision models of pricing
participants’ attributes to generate a feature matrix, and carbon emission reduction for low-carbon supply
uses a generative model to create sparse graphs, and chain under cap-and-trade regulation. Elhedhli, Gzara,
employs GAE to learn and classify the graph informa- and Waltho (2021) developed a profit-maximisation
tion, leading to efficient problem-solving. model that examined how carbon emissions at differ-
• We introduce an accurate mapping mechanism to ent stages of the supply chain affect consumer demand.
translate abstract solutions into real-world decisions, Similarly, Valderrama et al. (2020) proposed a multi-
providing timely decision support for enterprises. level, multi-period, multi-product mining supply chain
network design model aimed at reducing greenhouse
The remainder of this paper is organised as fol- gas emissions while minimising costs. It is evident
lows. In Section 2, we review relevant models and algo- that the models for both economic and environmen-
rithms from the literature. In Section 3, we present the tal dimensions have innovated in response to chang-
mathematical model for sustainable supply chains. The ing global mechanisms, such as the proposal of market
GAE-based clustering method and its three-step solu- value maximisation in economics and the integration
tion process are described in Section 4. In Section 5, of cap-and-trade in environmental regulation. However,
we discuss experimental results and managerial insights; the key challenge remains in developing a measure-
Section 6 provides conclusions and suggestions for future ment framework for the social dimension and achiev-
research. ing a balanced approach across all three sustainability
dimensions.
A few studies have sought to balance all three sus-
2. Literature review
tainability dimensions, offering more comprehensive
This section reviews the mathematical models, solving approaches. For instance, Varsei and Polyakovskiy (2017)
and clustering algorithms in sustainable supply chain introduced a general model for sustainable wine supply
network design. chain network design, incorporating economic, environ-
mental, and social objectives. Their findings showed that
the model helped decision-makers balance competing
2.1. Mathematical models
sustainability goals more effectively. Naeeni and Sab-
Numerous studies have proposed mathematical mod- baghi (2022) proposed a multi-objective mixed-integer
els that incorporate sustainability dimensions across dif- programming model for sustainable supply chain net-
ferent contexts, such as wine (Varsei and Polyakovskiy work design, addressing economic, environmental, and
2017), mining (Valderrama et al. 2020), perishable prod- social objectives. Their analysis of experimental results
ucts (Biuki, Kazemi, and Alinezhad 2020), biofuels suggested a more sustainable and balanced network
(Zarrinpoor and Khani 2021), food (Reusken, Cruijssen, design that outperformed existing configurations in all
and Fleuren 2023; Yadav et al. 2021), natural gas (Baz- three sustainability dimensions. The ability to success-
yar, Zarrinpoor, and Safavian 2021), municipal solid fully balance these three dimensions requires not only
waste (Eghbali, Arkat, and Tavakkoli-Moghaddam 2022), advanced modelling techniques but also an understand-
epidemic (Hosseini and Ivanov 2022; Katsoras and ing of the interactions and trade-offs between them.
Georgiadis 2022), maritime transport (Kong, Liu, and For example, while reducing environmental impact may
Chen 2023), and personal protective equipment (Bill lead to short-term economic costs, it may simultaneously
4 Y. GUO ET AL.
offer long-term social benefits, such as improving public et al. (2021) introduced a fresh food distribution network
health or enhancing community well-being. This balance model, one of the few that considered delivery time,
is essential for creating truly sustainable supply chains using it as a social sustainability metric. Moreover, trans-
that can meet the needs of both present and future gen- portation plays a critical role in supply chain network
erations. design and is one of the major contributors to green-
Regarding uncertainty, most research has focussed house gas emissions (Joshi 2022). While some stud-
on uncertainties in raw material supply, facility capac- ies have explored how selecting appropriate transporta-
ity, and product demand. For example, Das, Shaw, and tion modes can reduce economic and environmental
Irfan (2020) introduced a multi-product, multi-echelon impacts (Brahami et al. 2022; Sherafati et al. 2020), few
supply chain network design model addressing sustain- have considered transportation time as a key factor in
ability goals under uncertain demand and capacity con- their models.
ditions. Lahri, Shaw, and Ishizaka (2021) proposed a Integrating responsiveness into sustainable supply
two-stage, multi-objective possibilistic integer linear pro- chain design can produce multiple benefits, promoting a
gramming model aimed at maximising social sustain- more resilient and sustainable operational model (Seti-
ability while minimising economic and environmental awan, Tarigan, and Siagian 2023). Specifically, our
objectives. This model tackled uncertainties arising from emphasis on responsiveness distinguishes our model
fluctuations in demand, capacity, and emissions. Jiyao from others.
Gao and You (2017) developed a stochastic mixed-integer
linear fractional programming model to handle various
2.2. Solving and clustering algorithms
uncertainties related to raw material supply and product
demand. The Sustainable Supply Chain Network Design (SSCND)
Additionally, many researchers have addressed trans- problem is typically a combinatorial optimisation prob-
portation issues related to sustainable supply chain. For lem, where the objective is to find the optimal solu-
example, Sherafati et al. (2020) identified transportation tion from a finite set of feasible options. For small-
as a major source of greenhouse gas emissions and pro- scale instances, exact algorithms have been proposed
posed sustainable supply chain network designs to miti- to solve these problems, such as Benders Decom-
gate these issues, along with appropriate carbon emission position Shaw et al. (2016) and Lagrangian Relax-
regulatory mechanisms. Vafaei et al. (2020) introduced a ation (Moheb-Alizadeh, Handfield, and Warsing 2021).
mixed-integer programming model for a sustainable dis- For example, Jiyao Gao and You (2017) developed a
tribution network, considering multiple products, multi- parametric decomposition multi-cut L-shaped method,
ple echelons, and various transportation modes, particu- and Guo et al. (2019) proposed a decomposition-based
larly third-party logistics. Brahami et al. (2022) proposed distributed approximation method. While these exact
a multi-objective model aimed at minimising both trans- algorithms are effective for small-scale problems, they
portation costs and environmental impact by consider- face significant challenges in large-scale instances due
ing the environmental implications of different trans- to the exponential growth in computational complexity.
portation modes. Chadha, Ülkü, and Venkatadri (2022) Their reliance on precise formulations and exhaustive
proposed a physical internet supply chain model with searches limits their practicality in real-world sustain-
peddling and shipment consolidation to optimise freight able supply chain networks, which involve a vast num-
delivery. Considering the growing importance of trans- ber of variables and constraints. Therefore, more and
portation in sustainable supply chains, we fully con- more studies have focussed on improving heuristic or
sider the transportation aspect, separately analyse the metaheuristic methods.
key transportation-related indicators and provide valu- Heuristic and metaheuristic methods have been
able managerial insights into optimising transportation widely applied to solve large-scale supply chain net-
in sustainable supply chain networks. work problems. For instance, Zahiri, Zhuang, and
An overview of the model from the literature is Mohammadi (2017) proposed a Pareto-based meta-
presented in Table 1. According to the latest review heuristic algorithm, and Sabogal-De La Pava et al. (2021)
by Madani, Saihi, and Abdelfatah (2024), only a few designed a primal heuristic approach based on optimisa-
studies have incorporated responsiveness into sustain- tion. Habibi, Abbasi, and Chakrabortty (2023) developed
able supply chain models. For example, Moheb-Alizadeh, a population-based simulated annealing metaheuristic
Handfield, and Warsing (2021) was one of the few studies algorithm and improved the Grey Wolf Optimiser. Guo
focussed on balancing efficiency and sustainability, but it et al. (2023) proposed a heuristic algorithm that inte-
only focussed on improving facility resource efficiency in grates a Markov decision-making process and informa-
closed-loop supply chain networks. Furthermore, Yadav tion accumulation mechanism with the exploration of
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 5
the solution space. Benmamoun et al. (2024) introduced leading to potential premature convergence or subop-
a novel bio-inspired metaheuristic algorithm called the timal solutions. Additionally, GA-based methods can
Wombat Optimiser, specifically designed for supply be computationally intensive for large-scale problems,
chain optimisation problems. Kumar and Kumar (2024) and their performance heavily depends on parameter
proposed a greedy heuristic method tailored for solv- tuning, which can be time-consuming and problem-
ing large-scale decision problems. While these meth- specific. Furthermore, these approaches may lack adapt-
ods provide computational efficiency and flexibility, they ability to dynamic and uncertain environments, which
often suffer from several limitations, such as premature are common in real-world sustainable supply chain
convergence, lack of global optimality guarantees, and scenarios.
sensitivity to parameter settings. Moreover, many studies have integrated multiple algo-
Among various heuristic and metaheuristic appro- rithms to improve performance. Quddus et al. (2017)
aches, genetic algorithms (GA) are the most widely combined the Sample Average Approximation method
applied and refined. Shen (2020) proposed a hybrid with an enhanced progressive hedging hybrid decom-
GA based on variable-length chromosome encoding, position algorithm, employing rolling horizon heuris-
while Brahami et al. (2022) developed an improved tics and variable fixing techniques to enhance the PH
algorithm based on Non-dominated Sorting Genetic algorithm. Rabbani et al. (2018) used a combination of
Algorithm II. Yao Gao et al. (2024) proposed a two- NSGA and multi-objective particle swarm optimisation
stage adaptive GA to address the challenges of scale and to solve multi-objective problems. Biuki, Kazemi, and
complexity in supply chain networks. Sun et al. (2024) Alinezhad (2020) solved supply chain problems through
designed a model combining a minimum spanning tree hybrid metaheuristic algorithms, showing that parallel
with a two-layer adaptive GA, and Zhen Wang, Ye, and hybrid algorithms outperform serial ones. Mohammed,
Guo (2024) tackled multi-objective optimisation prob- Al-Shaibani, and Duffuaa (2023) proposed a hybrid
lems using an enhanced multi-objective GA. The exten- metaheuristic method combining simulated annealing,
sive focus on GA-based methods demonstrates their tabu search, and variable neighbourhood search. Gang
adaptability and effectiveness. However, such improve- Wang (2023) developed a hybrid algorithm combining
ments based on GA still have certain limitations. For branch-and-price with a primal heuristic approach. Mar-
instance, these enhancements often struggle with balanc- tins and Pato (2024) introduced three decomposition
ing exploration and exploitation in the solution space, and heuristic methods to successfully address large-scale
6 Y. GUO ET AL.
real-world problem instances. Although these hybrid Sengupta, and Sahay (2016) proposed an algorithm based
algorithms have demonstrated notable improvements in on hierarchical self-organising maps that used visual
solving efficiency and solution quality, they primarily clustering diagrams to decompose large-scale problems
focus on integrating and enhancing existing methods into independent sub-problems. An overview of the algo-
rather than achieving significant breakthroughs in algo- rithms from the literature is presented in Table 2.
rithmic innovation. Although these clustering algorithms generate promi-
Research has shown that the complexity of variables sing results in various applications of sustainable sup-
and constraints in sustainable supply chain network ply chain management, they exhibit instability when
models makes finding feasible solutions within a reason- solving large-scale instances due to their lack of learn-
able timeframe a significant challenge (Madani, Saihi, ing capabilities (Ran et al. 2024). Graph autoencoder
and Abdelfatah 2024). As mentioned before, clustering (GAE) can learn data features through the training of
algorithms can leverage the relationships between nodes graph neural networks and demonstrate good stability
in a supply chain network to classify similar connec- on general data, which motivate us to adopt them in this
tions, reducing the problem size and decomposing large- study.
scale problems into smaller, solvable sub-problems. Many
studies have investigated the application of clustering
algorithms in various areas of sustainable supply chain
problems, such as assessing sustainability performance, 3. Problem description and model
facility location, and production route construction. For establishment
instance, Jabbarzadeh, Fahimnia, and Sabouhi (2018) 3.1. Problem description
developed a stochastic bi-objective optimisation model
that used the fuzzy c-means clustering method to quan- This paper examines a four-tier sustainable supply chain
tify and assess the sustainability performance of suppli- network comprising suppliers, producers, distributors,
ers. Sabouhi, Jabalameli, and Jabbarzadeh (2021) pro- and retailers. Raw materials procured from suppliers are
posed a multi-period multi-objective optimisation model transported to the respective producers. After being pro-
that applied k-means clustering to evaluate regional sus- cessed by producers, the products are then transported
tainability performance and inform procurement and to the corresponding distributors for temporary storage.
network design decisions. Kim, Do, and Kim (2024) Subsequently, distributors fulfil orders by transporting
applied the Expectation-Maximisation algorithm based goods to retailers. Each transportation selects suitable
on Gaussian Mixture Models to cluster carbon emission transporters to complete various stages. In this paper, the
sources and determine supply regions. For production model is refined based on three sustainable indicators:
routing problems, Geiger (2024) utilised several cluster- cost, carbon emissions and lead time. These indicators
ing methods to construct route sets, and Chattopadhyay, are further subdivided into the following specific metrics:
• Due to differences in geographical location, scale, pur- • Capacity: Production capacity is measured based on
chasing capacity, and cooperative relationships among the workload of each producer’s workers. Storage
various suppliers, the procurement costs of suppliers capacity is determined by the number of pallets (or
are different. shelves). Transportation capacity is calculated based
• Some raw materials do not require processing; there- on the number of lorries in the fleet and the number
fore, it is assumed that suppliers do not bear raw of containers each lorry can accommodate. Addition-
material processing costs. Additionally, for certain raw ally, each producer, distributor, and transporter has a
materials, protective measures during transportation carbon emission quota that must not be exceeded.
are unnecessary, leading to the additional assumption • Location: To enhance the model’s compatibility with
that no containers are required when transporting raw the graph autoencoder clustering algorithm, we mod-
materials. ify the distance attribute used in traditional supply
• Producers have different production capacities, types chains and substitute it with the location attribute.
of energy, and processing technologies, resulting in This modification is explained in detail in the subse-
differences in costs, carbon emissions and processing quent section.
times for producing products. • Economic: Production costs encompass all expenses
• Distributors possess different storage capacities and related to product manufacturing, including labour
storage methods, resulting in differences in costs and costs, machinery costs, and energy consumption costs.
carbon emissions of storing products. Storage costs cover all expenses associated with stor-
• Transporters have different transportation capacities, ing products, such as handling costs, packaging costs,
methods, efficiencies, and used fuels. These differ- and energy consumption costs. Transportation costs
ences lead to variations in transportation costs, carbon include all expenses related to the transportation, such
emissions, and times in transporters. as rental costs for transport vehicles, container costs,
and energy consumption costs.
To maintain the tractability and solvability of the pro- • Environmental: For clustering purposes, we focus on
posed model, we adopt several simplifications commonly the carbon emissions from each stage in the model,
used in the sustainable supply chain network design lit- which include emissions from processing, storing, and
erature. Specifically, the model assumes deterministic transporting.
demand, fixed facility capacities, and linear cost struc- • Lead time: We select two components most closely
tures. This controlled modelling design follows estab- associated with lead time–production time and trans-
lished practices in computational logistics research where portation time–to ensure the responsiveness of the
model simplification serves to evaluate novel algorithmic sustainable supply chain network. Production time
architectures (Shoaib, Mustafee, and Ramamohan 2025). is influenced by factors such as facility size, worker
Specifically, maintaining linear program formula- speed, and machinery production efficiency. Trans-
tions and static parameters isolates the graph clus- portation time is affected by factors including work-
tering method’s topological decomposition capability ers’ packaging speed, transportation methods, and
from confounding optimisation effects – a critical step weather conditions.
when benchmarking computational efficiency in large-
scale SSCND problems. While acknowledging that real-
world implementations require stochastic programming
3.4. Objective functions
extensions, the current abstraction enables: polynomial-
time complexity analysis of cluster generation algo- Based on the problem description, assumptions, and
rithms, unambiguous attribution of solution quality symbol definitions provided, the objective functions of
improvements to network partitioning mechanisms, the sustainable supply chain network model established
and establishment of baseline performance metrics for are as follows:
future comparative studies with robust optimisation
variants. • Economic objective function comprises two compo-
nents, functional cost and transportation cost. Func-
tional cost encompasses the expenses of procuring
3.3. Symbol description
raw materials from suppliers, processing raw materi-
This section provides a detailed explanation of relevant als into products by producers and storing products
symbols. The sets, constants, and decision variables asso- by distributors. Transportation cost contains logistical
ciated with the model are summarised in Table 3. expenses between different levels in the supply chain
8 Y. GUO ET AL.
network. + qdk rl SCDk
k∈K l∈L
⎛
+ qdk tm rl NCTm CCTm dk,l
Objeco = min ⎝ qsi pj CSi m∈M k∈K l∈L
i∈I j∈J (1)
+ qsi tm pj CTm di,j • Environmental objective function encompasses two
m∈M i∈I j∈J
components, functional carbon emissions and trans-
+ qpj dk CPj portation carbon emissions. Functional carbon emis-
j∈J k∈K sions entail the emissions of processing raw materials
into products by producers and storing products by
+ qpj tm dk NCTm CCTm dj,k distributors. Transportation carbon emissions denote
m∈M j∈J k∈K
the emissions of conveying raw materials or products
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 9
by transporters. qsi tm pj + qpj tm dk + qdk tm rl
⎛ i∈I j∈J k∈K
Objenv = min ⎝ qsi tm pj CETm di,j ≤ CapTm (m ∈ M) (9)
m∈M i∈I j∈J
Equation (5) represents the constraint on the maxi-
+ qpj dk CEPj mum supply capacity of suppliers; Equation (6) rep-
j∈J k∈K resents the constraint on the maximum production
capacity of producers; Equation (7) represents the
+ qpj tm dk NCTm CECTm dj,k
constraint on the maximum storage capacity of dis-
m∈M j∈J k∈K
tributors; Equation (8) represents the demand con-
+ qdk rl SCEDk straint of retailers; Equation (9) represents the con-
k∈K l∈L straint on transportation capacity of transporters.
• Carbon emission constraints
+ qdk tm rl NCTm CECTm dk,l
qpj dk CEPj ≤ CQPj (j ∈ J) (10)
m∈M k∈K l∈L
k∈K
(2)
qdk rl SCEDk ≤ CQDk (k ∈ K) (11)
• Responsiveness objective function comprises two ele- k∈K
ments, the production time and transportation time.
qsi tm pj CETm di,j
⎛
i∈I j∈J k∈K
Objeff = min ⎝ qpj dk TPj + qpj tm dk NCTm CECTm dj,k
j∈J k∈K
+ qdk tm rl NCTm CECTm dk,l ≤ CQTm (m ∈ M)
+ qsi tm pj TTm di,j (12)
m∈M i∈I j∈J
Equation (10) represents the carbon emission con-
+ qpj tm dk TTm dj,k straint when producers produce products; Equa-
m∈M j∈J k∈K tion (11) represents the carbon emission constraint
when distributors store products; Equation (12) rep-
+ qdk tm rl TTm dk,l (3) resents the carbon emission constraint when trans-
m∈M k∈K l∈L porters transport raw materials or products.
• Consistency constraints
Utilising weighted aggregation, the multi-objective
optimisation model is converted into a single-objective qsi pj = qpj dk (13)
optimisation model, as illustrated in Equation (4). Here, i∈I k∈K
α, β and γ denote the weights assigned to each objective.
qpj dk = qdk rl (14)
Obj = αObjeco + βObjenv + γ Objeff (4) j∈J l∈L
qsi tm pj = qsi pj (i ∈ I) (15)
3.5. Constraints m∈M j∈J j∈J
The constraints of the model in this paper are as follows: qpj tm dk = qpj dk (j ∈ J) (16)
m∈M k∈K k∈K
• Capacity constraints qdk tm rl = qdk rl (k ∈ K) (17)
m∈M l∈L l∈L
qsi pj ≤ CapSi (i ∈ I) (5)
j∈J Equations (13) and (14) represent constraints on the
quantity of products produced by producers; Equa-
qpj dk ≤ CapPj (j ∈ J) (6) tions (15)–(17) represent the constraints on quantity
k∈K of raw materials or products transported by trans-
qdk rl ≤ CapDk (k ∈ K) (7) porters.
l∈L
• Non-negativity constraints
qdk rl = DRl (l ∈ L) (8) qsi pj , qpj dk , qdk rl ≥ 0 (18)
k∈K
10 Y. GUO ET AL.
qsi tm pj , qpj tm dk , qdk tm rl ≥ 0 (19) at layer l, W (l) are trainable weight matrices, and σ is an
di,j , dj,k , dk,l ≥ 0 (20) activation function. The decoder in the GAE reconstructs
the adjacency matrix or predicts other graph properties
Equations (18)–(20) represent integer constraints. from the latent embeddings:
A = σ (ZZT ), (22)
4. Solution method
where A is the reconstructed adjacency matrix. In the
In traditional supply chain architectures, intricate net-
context of SCND, the problem involves multiple objec-
works of internodal links are prevalent, which poses a
tives such as minimising cost, carbon emissions, and lead
significant complexity. Empirical observation indicates
time. These objectives are formulated as functions of the
that these connections can be systematically categorised
latent embeddings Z:
and merged, thereby significantly improving efficiency.
This paper adopts the AdaGAE clustering method of Li, min L = Lrecon + λLregularisation , (23)
Zhang, and Zhang (2021) and applies it to supply chain
network optimisation. The approach reconstructs con- where Lrecon represents the reconstruction loss that eval-
nections between participants by probabilistically clus- uates the fidelity of the graph structure, Lregularisation
tering similar data points into classes, thereby effectively ensures smoothness and prevents overfitting in the
addressing the scalability problem. embeddings, and λ controls the balance between these
To prepare for clustering, we modify the traditional two terms. Based on these theoretical foundations, we
model attributes. More specifically, the distance attribute propose a modified AdaGAE algorithm.
is replaced with the location attribute to reduce complex- The adapted AdaGAE clustering algorithm (Li, Zhang,
ity and redundancy. This is because distance constants are and Zhang 2021) integrates the GAE into a graph neu-
readily obtained from the locations of upper and lower- ral network for clustering tasks, expanding its application
level roles. Additionally, a category attribute is added beyond graph-structured data to more diverse types. This
to each role to facilitate the recording of clustering and approach adaptively constructs graphs from a genera-
operation results. tive perspective, leveraging non-Euclidean structures to
uncover high-level information for clustering.
The algorithm applies the following general steps:
4.1. AdaGAE clustering algorithm
The Sustainable Supply Chain Network Design (SCND) • Feature extraction: Initially, feature matrices are
problem involves various participants (e.g. suppliers, pro- extracted from node attributes. A sparse graph is then
ducers, distributors, retailers) and their relationships (e.g. formed by employing a generative model under con-
similarity of participants’ attributes). This structure can trolled sparsity levels.
be represented as a graph G = (V, E), where V denotes • GAE encoding-decoding: The graph undergoes enco-
the set of nodes (participants) and E denotes the set of ding and decoding phases within the GAE frame-
edges (relationships). Each node i ∈ V is associated with work, converting the weighted undirected graph into
features such as cost, carbon emissions, and lead time, embedded node representations.
which are captured in a feature matrix X ∈ RN×F . Here, • Graph updating: Following GAE training, the algor-
N represents the number of nodes, and F denotes the ithm adjusts the graph at a higher sparsity level based
number of features per node. The graph structure is fur- on node embeddings.
ther characterised by an adjacency matrix A ∈ RN×N , • Iterative training: GAE training persists iteratively
encoding the connections between nodes. until the clustering results converge.
The traditional Graph Autoencoder (GAE) employs
a Graph Convolutional Network (GCN) to encode node The execution flowchart of the AdaGAE algorithm is
features and graph structure into latent embeddings Z ∈ depicted in Figure 1.
RN×D , where D represents the dimensionality of the Prior to clustering process, we filter out pertinent
latent space. The encoding process is mathematically role attributes from identical locations, including capac-
expressed as: ity, carbon quota, cost, carbon emissions, and lead
1 time. These attributes are stored in node vector xi . The
D− 2
1
H (l+1) = σ A D− 2 H (l) W (l) , (21) resulting attribute dataset constitutes the feature matrix
X = [x1 , x2 , . . . , xn ]T ∈ Rn×d , where n denotes the node
where A = A + I is the adjacency matrix with self-loops, count, and d represents the attribute dimensionality. Usu-
D is the degree matrix of
A, H (l) is the node embeddings ally,a smaller distance xi − xj 22 should be assigned a
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 11
larger probability pij ; thus, it provides a natural method of graph convolutional neural network layers; its goal
to determine the probabilities by solving the following is to transform the original features into latent features
problem: using the constructed graph structure. We define the
latent features generated by m layers as:
n
min xi − xj 22 pij + γi p2ij , (24)
Z = ϕm (Âϕm−1 (· · · ϕ1 (ÂXW1 ) · · · )Wm ). (27)
i,j=1
where γi controls the sparsity of each node. It can be • Decoder: Our goal is to recover the connectivity distri-
proven that the solution has precisely k nonzero entries bution based on Euclidean distance. First, the distance
(i.e. it is k-sparse) if of the latent features, Z, is calculated by d̂ij = z i −
⎛ ⎞ z j 22 . Then, the connectivity distribution is recon-
1⎝ x
k structed by a normalisation step:
γi = kdi,k+1 − dijx ⎠ , (25)
2
j=1 exp(−d̂ij )
qji = n
. (28)
where dijx= xi − xj 22 . We denote dix ∈ Rn×1 as a vec- j=1 exp(−d̂ij )
step size t and sparsity upper bound km as follows: where A ∈ Rm×n , and n ≥ m. Generally, we have
x
km − k0 x= B , (34)
t= , (30) xN
T
n
km = , (31) where B ∈ Rm×m , N ∈ Rm×(n−m) , xB ∈ Rm represents
c basic variables and xN ∈ R(n−m) represents non-basic
variables. Rewrite Equations (32) and (33) in the follow-
where T is the number of iterations to update the weight ing form
adjacency, n represents the original data size and c is the
number of clusters. min cBT xB + cN
T
xN , (35)
x∈Rn
The clustering outcomes derived from the GAE are
stored in the category attribute. We extract role attributes s.t. BxB + NxN = b, xB ≥ 0, xN ≥ 0. (36)
belonging to the same category at the same location, com-
Hence, from the Equation (36), we obtain
pute their averages, and generate new data for subsequent
solving tasks. xB = B−1 b − B−1 NxN . (37)
Algorithm 1 Revised Simplex Method. in six bins, labelled sequentially as Qsi pj , Qpj dk , Qdk rl ,
Require: B, N, cB , cN , xB , xN , b. Qsi tm pj , Qpj tm dk , and Qdk tm rl . Although all sets share a
Find a basic feasible solution(bfs) x with A = [B | N]. similar structure, we focus on describing Qsi pj for illus-
while True do trative purposes.
Compute rq by Equation (42). To facilitate the restoration process, adjustments are
if All rq ≥ 0 then made to the id attribute of the abstracted objects. Rather
return The current bfs x is optimal solution. than sequentially numbering them in the conventional
break; manner, their id values comprise two parts represented
else by a two-digit number. The first digit, idl , denotes the
Choose rq by Equation (43). location of the abstracted object, and the second digit,
end if idk , signifies the category to which the abstracted object
Compute dq by Equation (40). belongs. Before commencing the restoration process, the
if All dq ≥ 0 then number of abstracted objects, m, is calculated as follows:
return The LP problem is unbounded. n
break; m= , (45)
r
else
Compute λ by Equation (44), where n denotes the original data size and r is the scaling
Update x ← x + λdq , B and N for the next itera- factor. The size of Qsi pj is m2 . To traverse the set of suppli-
tion. ers, the range of the traversal number i is set to [0, m − 1].
end if Consequently, we have:
end while
i
Ensure: Optimal solution. idl = , (46)
c
idk =i mod c, (47)
dividing by dj represents normalisation. We determine
the step length using the minimum ratio rule id =10idl + idk , (48)
where c represents the number of clusters. While travers-
xi q
λ = min q | − di < 0 , (44) ing the Qsi pj set, converting from the traversal number, i,
i∈B −di to the id of the abstract object is completed. Based on the
structure of the id attribute, we then have
where xi is the ith component of the current search point,
q
and di is the ith component of dq . id
After updating of x ← x + λdq , we adjust the basis l= , (49)
10
matrix and iterate through the calculation process until
all rq ≥ 0. Once achieved, we output the optimal solu- k =id mod 10, (50)
tion. The detailed algorithmic steps are outlined in
where l signifies the location of the real solution, and k
Algorithm 1.
denotes the category to which the real solution belongs.
At this juncture, we attain the optimal solution for the
An example of the restoration process is depicted in
abstracted entities after clustering, which might not cor-
Figure 2.
respond to the optimal solution for the original problem.
At this stage, we map the abstract solution to the spe-
Therefore, we proceed by filtering the optimal solution
cific solution. Subsequently, by utilising the location and
derived by the solver, which lays the groundwork for the
category of the real solution as filtering criteria, we iden-
subsequent restoration of the true solution.
tify the set of real solutions corresponding to the abstract
solution. From this set, we select the solution with the
4.3. Mapping from abstract solutions to real-world best weighted attributes as the true solution.
solutions
Mapping abstract solutions back to real-world solutions 5. Experiments and insights
involves transforming abstract representations into their
5.1. Experimental environment, test cases and
corresponding real-world counterparts. Initially, we fil-
parameters
ter nonzero decision variable values from the clustering
phase based on their type. As this process involves six We implement our algorithm using Python 3.9 on a com-
quantity decision variables, the filtered results are stored puter equipped with an 11th Gen Intel(R) Core(TM)
14 Y. GUO ET AL.
Table 4. The vale of experimental parameters. Table 5. The constant unit and rage of test instances.
Symbol Description Value Symbol Unit Range
α the weight of economic side 0.39 DPRl ten thousand [1000, 3500]
β the weight of environment side 0.2806 LRl , LSi , LPj , LDk – [1, 10]
γ the weight of lead time side 0.3294 CapMSi ten thousand [8000, 10,000]
k0 initial sparsity 1 CSi dollar [3000, 7000]
ti number of iterations to update GAE 50 CapPj ten thousand [4000, 8000]
n CQPj ton [50,000, 60,000]
km upper bound of sparsity
c CPj dollar [5000, 10,000]
T number of iterations to update weight adajacency 10 CEPj ton [0.15, 0.35]
r scaling factor 3 TPj hour [1, 24]
CapDk ten thousand [2000, 4000]
CQDk ton [10,000, 20,000]
SCDk dollar [300, 500]
SCEDk ton [0.01, 0.05]
i7-11800H processor running at 2.30 GHz, with 16 GB CapTm ten thousand [10,000, 12,000]
CQTm ton [500, 1000]
RAM and a Windows 11 operating system. The algorithm CTm dollar [0.1, 0.5]
parameters are summarised in Table 4. CETm ton [0.01, 0.02]
NCTm unit [1, 10]
To illustrate the effectiveness of the proposed algor- CCTm dollar [5, 10]
ithm, three large-scale instances are generated based on CECTm ton [0.01, 0.05]
a food processing industry scenario. These instances are TTm hour [0.01, 0.1]
configured with supplier/producer/distributor/retailer/
transporter quantities set at 90/90/90/90/90 (90), 120/120
Figure 3 visualises the producers after AdaGAE clus-
/120/120/120 (120), and 150/150/150/150/150 (150),
tering. The clustering results of each producer are anal-
respectively. The instance data are obtained from the
ysed based on three key dimensions: cost, carbon emis-
collaborative research platform developed by the Pub-
sions, and production time. A three-dimensional surface
lic Environmental Research Center and the Natu-
plot is used to visualise the clustering results without clus-
ral Resources Conservation Association for sustainable
ter labels. To facilitate a clearer observation of the cluster-
green supply chains. Further details are provided in
ing results, a three-dimensional bar chart is employed to
Table 5.
visualise the clustering results with cluster labels.
Table 8 provides the chosen virtual producers by sim-
plex method to potential real producers through map-
5.2. Case study
ping algorithm. Table 9 displays the real-world solution
In this section, we present an illustrative example to results, which includes all facility locations and material
explain the studied instances. Specifically, we consider an flows.
instance with a size of 90, where S1–S90 represent sup-
pliers, P1–P90 denote producers, D1–D90 correspond to
5.3. Experimental results and analysis
distributors, and R1–R90 are retailers. Due to the large
scale of the instance, we focus on providing detailed 5.3.1. Solution quality analysis
information about the producers, while the details of We execute the k-means (Sabouhi, Jabalameli, and
other participants are omitted for brevity. Jabbarzadeh 2021), k-medians, k-medoids, Fuzzy C-
Table 6 presents the attribute information of producers means (Jabbarzadeh, Fahimnia, and Sabouhi 2018),
along with their cluster labels after AdaGAE clustering. Gaussian mixture (Kim, Do, and Kim 2024), and Ada-
Table 7 provides the attribute information and cluster GAE algorithms five times each and calculate the mean of
labels of the virtual producers generated by the AdaGAE the outputs, with the final results presented in Table 10.
clustering algorithm. The comparisons of solution quality and differences for
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 15
Table 8. The chosen virtual producers to potential real producers instances (90), the maximum deviation using the Ada-
through mapping algorithm. GAE algorithm is 13.02%, with a minimum of 8.33% and
Chosen virtual producer Potential real producer an average of 10.76%. Notably, in some smaller instances,
P1 P2, P7, P8 the K-means algorithm yields superior solution quality.
P4 P10, P12, P14 In medium scale instances (120), the maximum deviation
P5 P13, P15, P17
P7 P21, P22, P25, P26 with AdaGAE is 12.89%, the minimum is 4.95%, and the
P8 P19, P20, P27 average is 8.84%. The experimental results suggest that
P10 P29, P32, P33
P13 P37, P39, P40 the solution quality achieved by the AdaGAE algorithm
P16 P46, P47, P48, P54 outperforms all other methods. For larger scale instances
P19 P56, P60, P61, P62
P20 P55, P57, P59 (150), the maximum deviation using AdaGAE is 12.10%,
P22 P64, P66, P68, P70 the minimum is 8.60%, and the average is 10.42%, indi-
P25 P73, P75, P79
P28 P89, P90
cating that AdaGAE maintains optimal solution quality
P29 P83, P85, P86, P87, P88 in most cases.
Figure 4 demonstrates that, except for a few relatively
small and large scale instances where the solution qual-
ity of the AdaGAE algorithm is inferior to that of K-
each instance across various algorithms are illustrated in means, the solution quality advantage of AdaGAE is sig-
Figures 4 and 5. nificantly more pronounced in medium scale and most
As shown in Table 10, for the same instance scale, other instances compared to other clustering algorithms.
the solution quality of most datasets using the AdaGAE Figure 5 illustrates that for medium and larger scale
algorithm is closer to the optimal solution. The max- instances, the solution gap of the AdaGAE algorithm
imum deviation from the optimal solution is 13.02%, gradually decreases with different data training, a char-
while the minimum is 4.95%. For relatively smaller scale acteristic not exhibited by other clustering algorithms.
16 Y. GUO ET AL.
17
18 Y. GUO ET AL.
Figure 4. The optimal solution is compared with the results obtained by K-means, K-medians, K-medoids, Fuzzy C-means, Gaussian
mixture, and AdaGAE algorithms when the number of examples is 90, 120, 150.
Figure 5. The solution gap is compared with the results obtained by K-means, K-medians, K-medoids, Fuzzy C-means, Gaussian mixture,
and AdaGAE algorithms when the number of examples is 90, 120, 150.
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 19
−12.89%
−8.68%
−8.33%
−13.02%
−12.10%
−4.95%
−8.60%
−10.94%
−10.57%
Gap
handling large-scale sustainable supply chain network
problems effectively.
AdaGAE
5.3253
5.1503
3.7072
4.2332
4.9328
6.4102
7.1865
4.4220
6.6004
5.3.4. Sensitivity analysis
Sensitivity analysis is a crucial step in evaluating the
−10.21%
−19.71%
−8.24%
−24.40%
−14.96%
−12.88%
−9.01%
−9.44%
−9.77%
robustness of clustering algorithms, as it provides insights
Gap
5.4881
5.5924
6.5069
7.2425
3.7564
4.4833
5.4357
6.4553
in Figure 8.
As illustrated in Figure 8, the Gaussian Mixture Clus-
tering algorithm stands out as the most stable one among
the evaluated methods. This stability is evident in the
−14.77%
−5.25%
−16.83%
−12.73%
−8.48%
−9.13%
−8.65%
−17.02%
−13.80%
7.2216
3.7178
4.3827
6.5075
6.4756
−13.29%
−10.30%
−9.64%
Gap
4.2497
5.1490
5.2992
5.5556
6.5842
7.2556
3.8537
4.2457
6.4785
−12.71%
−12.44%
−9.91%
−9.66%
−9.63%
−16.05%
−17.74%
−13.31%
Gap
5.3440
5.7053
6.5610
7.2570
3.7518
4.3465
5.1445
6.4797
−8.00%
−9.21%
−8.33%
−9.85%
−10.10%
−17.48%
−15.50%
−14.27%
6.4468
5.0462
5.1758
5.4967
7.2695
3.7679
4.4000
6.5348
5.96930
6.61764
3.42221
3.74531
3.98587
4.36919
5.71852
150-2
150-3
90-1
90-2
90-3
Figure 6. The solution results of transportation cost, carbon emission and time under different algorithms are compared when the
number of examples is 120.
Figure 8. Comparison the sensitivity of solutions under different algorithms across various problem sizes.
non-linear data, which is pivotal for modelling the intri- 5.5.2. Policy implications
cate structures and relationships inherent in sustainable This study underscores the critical role of innovative
supply chains. This establishes GNNs as a promising the- technologies in enhancing the speed and effectiveness
oretical framework for addressing the challenges posed of decision-making in sustainable supply chain manage-
by sustainability objectives, such as balancing economic, ment. By adopting our approach, corporate managers
environmental, and social dimensions. can rapidly generate actionable solutions that balance
The study also highlights the need for theoretical economic, environmental, and operational goals. The
advancements in integrating AI-driven methodologies approach’s ability to provide near-optimal results within
into sustainable supply chain design and optimisation. significantly reduced timeframes highlights its suitability
By leveraging the capabilities of graph neural networks, for addressing the dynamic and urgent demands of sus-
researchers can better capture the relationships among tainability initiatives. This is particularly relevant in the
supply chain entities, enabling the development of more current context, where corporations are under increasing
adaptive and efficient models. Furthermore, this explo- pressure to respond swiftly to environmental regulations,
ration encourages a shift in academic discourse toward market demands, and stakeholder expectations.
a deeper examination of how AI techniques, like GNNs, Policymakers can draw from these insights to advo-
can be systematically incorporated into sustainability cate for and support the integration of advanced AI-
paradigms. based tools in corporate practices. Encouraging the adop-
The theoretical contributions of this research extend tion of such technologies through incentives, training
beyond immediate practical applications, offering a foun- programmes, or partnerships can enhance the overall
dation for future studies aimed at integrating AI tech- responsiveness and adaptability of industries to sustain-
nologies into broader sustainability frameworks, ulti- ability challenges. Furthermore, our approach’s demon-
mately contributing to societal and national advance- strated superiority over traditional clustering algorithms
ments. This work paves the way for further exploration of in terms of solution speed and fitting effectiveness sug-
how AI can drive sustainable innovation and foster theo- gests that investing in research and development for sim-
retical growth at the intersection of artificial intelligence ilar AI-driven methods could yield significant societal
and supply chain management. benefits.
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 23
The research also calls for a paradigm shift in corpo- with increasing problem scales but also demonstrate
rate and policy frameworks, emphasising the need for superior predictive accuracy in solution quality com-
infrastructure and regulatory environments that facili- pared to conventional clustering methods, particularly
tate the widespread use of AI technologies. By priori- on large-scale datasets. In the comparative analysis of dif-
tising policies that enable faster adoption of advanced ferent objective functions, the function values derived
computational techniques, governments can foster an from the AdaGAE clustering algorithm show signifi-
ecosystem where businesses are better equipped to align cant advantages across various dimensions, although its
with sustainability objectives. This alignment not only performance in optimising transportation costs and car-
supports corporate competitiveness but also accelerates bon emissions remains suboptimal. Overall, with the
progress toward broader national and global sustainabil- advancement of artificial intelligence, clustering meth-
ity targets. ods present a novel approach to addressing supply chain
challenges, paving the way for enhanced efficiency and
5.5.3. Managerial implications quality in solutions.
The study emphasises the critical role of transporta- The operationalisation of social sustainability in this
tion network design in achieving sustainable supply work focuses on supply chain responsiveness. While this
chains. Corporate managers must prioritise the search selection captures time-sensitive social impacts, compre-
for advanced algorithms capable of simultaneously hensive social sustainability assessment requires integra-
addressing multiple transportation objectives, includ- tion of complementary indicators. Subsequent research
ing cost minimisation and carbon footprint reduc- will focus on utilising clustering algorithms to tackle
tion. A robust transportation network not only sup- more social parameters, enabling simultaneous optimi-
ports sustainability objectives but also enhances opera- sation of workforce, community, and consumer welfare
tional resilience. Managers should integrate algorithm- dimensions. Moreover, the current deterministic mod-
driven insights into their strategic planning to drive elling paradigm employs linear programming formula-
more efficient and environmentally friendly logistics tions with static parameters to establish computational
operations. baselines. This foundational work paves the way for crit-
Moreover, the findings of this study highlight the ical extensions including stochastic demand modelling,
superior solution quality and stability of our approach time-dependent capacity constraints, and non-convex
when applied to large-scale sustainable supply chain chal- cost behaviour integration. Future research will syn-
lenges. For managers, this underscores the importance of thesise these elements and transform the current static
leveraging artificial intelligence to address the inherent framework into an adaptive decision support system
complexity of sustainable supply chain design. Effective capable of handling real-world volatility.
decision-making relies on tools that provide consistent,
high-quality results across diverse scenarios. Managers
should not only adopt GAE but also foster an organi-
Acknowledgments
sational culture that values continuous experimentation The authors thank the anonymous reviewers for their valuable
and iteration to unlock the full potential of such advanced contribution to improve the quality and clarity of this work.
Chat GPT 4.o was used for language improvement.
technologies.
Disclosure statement
6. Conclusion
No potential conflict of interest was reported by the author(s).
In this study, we employ an adaptive Graph Autoen-
coder (GAE) clustering algorithm for the modelling and
solution of sustainable supply chain network problem. Funding
We introduce a sustainable supply chain model based This work is funded by the Natural Science Foundation
on inter-city clustering and conduct a comprehensive of Zhejiang Province (LMS25G010003) and the Scientific
modelling of transporters. To enhance clustering perfor- Research Project of Zhejiang Provincial Department of Edu-
cation (Y202454572). This work is also partially supported by
mance, we modify the attributes traditionally used in
the Research and Innovation on Ecological Responsibility and
supply chain models. Furthermore, we provide methods Autonomous Transport and Mobility (RITMEA) project. The
for reverting to true solutions. The experimental results RITMEA project is co-financed by the European Union with
indicate that this clustering approach not only funda- the European Regional Development Fund, the French state
mentally addresses the high time complexity associated and the Hauts de France Region Council.
24 Y. GUO ET AL.
An Applied Optimisation Model with Peddling and Ship- the Production Routing Problem.” Computers & Industrial
ment Consolidation.” International Journal of Production Engineering 196:110455. [Link]
Research 60 (16): 4995–5011. [Link] 110455.
543.2021.1946613. Granillo-Macías, Rafael. 2021. “Logistics Optimization through
Chattopadhyay, Manojit, Sourav Sengupta, and B. S. Sahay. A Social Approach for Food Distribution.” Socio-Economic
2016. “Visual Hierarchical Clustering of Supply Chain Using Planning Sciences 76:100972. [Link]
Growing Hierarchical Self-organising Map Algorithm.” 2020.100972.
International Journal of Production Research 54 (9): Guo, Yuhan, Fangxia Hu, Hamid Allaoui, and Youssef Boulak-
2552–2571. [Link] sil. 2019. “A Distributed Approximation Approach for Solv-
Chen, Jingfang, Ling Wang, Yile Liang, Yang Yu, Jie Feng, Jiuxia ing the Sustainable Supply Chain Network Design Prob-
Zhao, and Xuetao Ding. 2024. “Order Dispatching via GNN- lem.” International Journal of Production Research 57 (11):
Based Optimization Algorithm for on-Demand Food Deliv- 3695–3718. [Link]
ery.” IEEE Transactions on Intelligent Transportation Systems 6412.
25 (10): 13147–13162. [Link] Guo, Yuhan, Junyu Yu, Youssef Boulaksil, Hamid Allaoui, and
3389090. Fangxia Hu. 2021. “Solving the Sustainable Supply Chain
Das, Rakhi, Krishnendu Shaw, and Mohd Irfan. 2020. “Sup- Network Design Problem by the Multi-neighborhoods
ply Chain Network Design considering Carbon Foot- Descent Traversal Algorithm.” Computers & Industrial Engi-
print, Water Footprint, Supplier’s Social Risk, Solid Waste, neering 154:107098. [Link]
and Service Level under the Uncertain Condition.” Clean 7098.
Technologies and Environmental Policy 22(2): 337–370. Guo, Yuhan, Tao Chen, Youssef Boulaksil, Linfan Xiao, and
[Link] Hamid Allaoui. 2023. “Collaborative Planning of Multi-
Dong, Guimin, Mingyue Tang, Zhiyuan Wang, Jiechao Gao, tier Sustainable Supply Chains: A Reinforcement Learn-
Sikun Guo, Lihua Cai, Robert Gutierrez, Bradford Camp- ing Enhanced Heuristic Approach.” Computers & Industrial
bel, Laura E. Barnes, and Mehdi Boukhechba. 2023. “Graph Engineering 185:109669. [Link]
Neural Networks in LoT: A Survey.” ACM Transactions 109669.
on Sensor Networks 19 (2): 1–50. [Link] Ha, Young-Kyou, and Changjoon Lee. 2024. “Supply Chain
3565973. Agility: Achieving Robustness and Logistics Performance.”
Eghbali, Hamed, Jamal Arkat, and Reza Tavakkoli-Mogh- Journal of Distribution Science 22 (9): 65–72. [Link]
addam. 2022. “Sustainable Supply Chain Network Design for 10.15722/JDS.22.09.202409.65.
Municipal Solid Waste Management: A Case Study.” Journal Habibi, Farhad, Alireza Abbasi, and Ripon Kumar Chakra-
of Cleaner Production381:135211. [Link] bortty. 2023. “Designing an Efficient Vaccine Supply Chain
jclepro.2022.135211. Network Using A Two-Phase Optimization Approach: A
Ekren, Banu Yetkin, Sachin Kumar Mangla, Ecem Eroglu Case Study of COVID-19 Vaccine.” International Journal
Turhanlar, Yigit Kazancoglu, and Guo Li. 2021. “Lat- of Systems Science: Operations & Logistics 10 (1): 2121623.
eral Inventory Share-Based Models for LoT-enabled E- [Link]
commerce Sustainable Food Supply Networks.” Computers Hegab, Hussien, Ibrahim Shaban, Muhammad Jamil, and
& Operations Research 130:105237. [Link] Navneet Khanna. 2023. “Toward Sustainable Future: Strate-
[Link].2021.105237. gies, Indicators, and Challenges for Implementing Sustain-
Elhedhli, Samir, Fatma Gzara, and Cynthia Waltho. 2021. able Production Systems.” Sustainable Materials and Tech-
“Green Supply Chain Design with Emission Sensitive nologies 36:e00617. [Link]
Demand: Second Order Cone Programming Formulation e00617.
and Case Study.” Optimization Letters 15(1): 231–247. Hosseini, Seyedmohsen, and Dmitry Ivanov. 2022. “A Multi-
[Link] layer Bayesian Network Method for Supply Chain Dis-
Emami, Seyedeh Gelareh, Valentina Lorenzoni, and Giuseppe ruption Modelling in the Wake of the COVID-19 Pan-
Turchetti. 2024. “Towards Resilient Healthcare Systems: A demic.” International Journal of Production Research 60 (17):
Framework for Crisis Management.” International Journal 5258–5276. [Link]
of Environmental Research and Public Health 21 (3): 286. 3180.
[Link] Jabbarzadeh, Armin, Behnam Fahimnia, and Fatemeh Sabouhi.
Gao, Yao, Shaojun Lu, Hao Cheng, and Xinbao Liu. 2024. 2018. “Resilient and Sustainable Supply Chain Design:
“Data-Driven Robust Optimization of Dual-Channel Sustainability Analysis under Disruption Risks.” Interna-
Closed-Loop Supply Chain Network Design considering tional Journal of Production Research 56 (17): 5945–5968.
Uncertain Demand and Carbon Cap-and-trade Policy.” [Link]
Computers & Industrial Engineering 187:109811. [Link] Joshi, Sidharath. 2022. “A Review on Sustainable Supply Chain
org/10.1016/[Link].2023.109811. Network Design: Dimensions, Paradigms, Concepts, Frame-
Gao, Jiyao, and Fengqi You. 2017. “Modeling Framework work and Future Directions.” Sustainable Operations and
and Computational Algorithm for Hedging against Uncer- Computers 3:136–148. [Link]
tainty in Sustainable Supply Chain Design Using Functional- 01.001.
Unit-based Life Cycle Optimization.” Computers & Chemi- Katsoras, Efthymios, and Patroklos Georgiadis. 2022. “An Inte-
cal Engineering 107:221–236. [Link] grated System Dynamics Model for Closed Loop Supply
chemeng.2017.05.021. Chains under Disaster Effects: The Case of COVID-19.”
Geiger, Andreas. 2024. “Enhancing Supply Chain Coordina- International Journal of Production Economics 253:108593.
tion: A Comparative Analysis of Clustering Techniques for [Link]
26 Y. GUO ET AL.
Kim, Changsu, Thai Ngan Do, and Jiyong Kim. 2024. “Spa- Quddus, Md Abdul, Niamat Ullah Ibne Hossain, Marufuz-
tially Explicit Supply Chain for Nationwide CO2-to-fuel zaman Mohammad, Raed M. Jaradat, and Mohammad S.
Infrastructure: Data-Driven Optimization with Gaussian Roni. 2017. “Sustainable Network Design for Multi-purpose
Mixture Model-Based Region Screening and Clustering.” Pellet Processing Depots under Biomass Supply Uncer-
Journal of Cleaner Production 471:143390. [Link] tainty.” Computers & Industrial Engineering 110:462–483.
10.1016/[Link].2024.143390. [Link]
Kong, Yudan, Jiaguo Liu, and Jihong Chen. 2023. “Exploring Rabbani, Masoud, Fatemeh Navazia, Hamed Farrokhi-Asl, and
the Carbon Abatement Measures in Maritime Supply Chain: Mohammad Hasan Balali. 2018. “A Sustainable Transpor-
A Scenario-Based System Dynamics Approach.” Interna- tation-Location-routing Problem with Soft Time Windows
tional Journal of Production Research 61 (18): 6131–6152. for Distribution Systems.” Uncertain Supply Chain Manage-
[Link] ment 6 (3): 229–254. [Link]
Kumar, Amit, and Kaushal Kumar. 2024. “An Uncertain 12.002.
Sustainable Supply Chain Network Design for Regulating Ran, Xiaojuan, Naret Suyaroj, Worawit Tepsan, Jianghong Ma,
Greenhouse Gas Emission and Supply Chain Cost.” Cleaner Xiangbing Zhou, and Wu Deng. 2024. “A Hybrid Genetic-
Logistics and Supply Chain10:100142. [Link] Fuzzy Ant Colony Optimization Algorithm for Automatic
16/[Link].2024.100142. K-means Clustering in Urban Global Positioning System.”
Lahri, Vijay, Krishnendu Shaw, and Alessio Ishizaka. 2021. Engineering Applications of Artificial Intelligence 137:109237.
“Sustainable Supply Chain Network Design Problem: Using [Link]
the Integrated BWM, TOPSIS, Possibilistic Programming, Ren, Yazhou, Jingyu Pu, Zhimeng Yang, Jie Xu, Guofeng
and -constrained Methods.” Expert Systems with Appli- Li, Xiaorong Pu, Philip S. Yu, and Lifang He. 2024.
cations 168:114373. [Link] “Deep Clustering: A Comprehensive Survey.” IEEE Trans-
4373. actions on Neural Networks and Learning Systems 36 (4):
Li, Xuelong, Hongyuan Zhang, and Rui Zhang. 2021. “Adap- 5858–5878. [Link]
tive Graph Auto-encoder for General Data Clustering.” IEEE Reusken, Meike, Frans Cruijssen, and Hein Fleuren. 2023. “A
Transactions on Pattern Analysis and Machine Intelligence 44 Food Bank Supply Chain Model: Optimizing Investments to
(12): 9725–9732. [Link] Maximize Food Assistance.” International Journal of Produc-
5687. tion Economics 261:108886. [Link]
Madani, Batool, Afef Saihi, and Akmal Abdelfatah. 2024. “A 23.108886.
Systematic Review of Sustainable Supply Chain Network Sabogal-De La Pava, Martha Lucía, Carlos Julio Vidal-Holguín,
Design: Optimization Approaches and Research Trends.” Diego Fernando Manotas-Duque, and Juan José Bravo-
Sustainability 16 (8): 3226. [Link] Bastidas. 2021. “Sustainable Supply Chain Design consider-
3226. ing Indicators of Value Creation.” Computers & Industrial
Martins, C. L., and M. V. Pato. 2024. “Decomposition Heuris- Engineering 157:107294. [Link]
tics for Multiobjective Problems. The Food Bank Network 107294.
Redesign Case.” International Journal of Production Eco- Sabouhi, Fatemeh, Mohammad Saeed Jabalameli, and Armin
nomics 268:109121. [Link] Jabbarzadeh. 2021. “An Optimization Approach for Sus-
9121. tainable and Resilient Supply Chain Design with Regional
Mohammed, Awsan, Maged S. Al-Shaibani, and Salih O. Duf- Considerations.” Computers & Industrial Engineering 159:
fuaa. 2023. “A Meta-heuristic-Based Algorithm for Design- 107510. [Link]
ing Multi-objective Multi-echelon Supply Chain Network.” Setiawan, Hendry Sugianto, Zeplin Jiwa Husada Tarigan, and
Applied Soft Computing 147:110774. [Link] Hotlan Siagian. 2023. “Digitalization and Green Supply
16/[Link].2023.110774. Chain Integration to Build Supply Chain Resilience toward
Moh’d Anwer, A. L.-Shboul. 2022. “An Investigation of Trans- Better Firm Competitive Advantage.” Uncertain Supply
portation Logistics Strategy on Manufacturing Supply Chain Chain Management 11 (2): 683–696. [Link]
Responsiveness in Developing Countries: The Mediating 67/[Link].2023.1.012.
Role of Delivery Reliability and Delivery Speed.” Heliyon 8 Shaw, Krishnendu, Mohd Irfan, Ravi Shankar, and Surendra
(11): e11283. [Link] S. Yadav. 2016. “Low Carbon Chance Constrained Supply
Moheb-Alizadeh, Hadi, Robert Handfield, and Donald Wars- Chain Network Design Problem: A Benders Decomposition
ing. 2021. “Efficient and Sustainable Closed-Loop Supply Based Approach.” Computers & Industrial
Chain Network Design: A Two-Stage Stochastic Formu- Engineering 98:483–497. [Link]
lation with A Hybrid Solution Methodology.” Journal of 06.011.
Cleaner Production 308:127323. [Link] Shen, Jiayu. 2020. “An Uncertain Sustainable Supply Chain
epro.2021.127323. Network.” Applied Mathematics and Computation 378:
Naeeni, Hannan Sadjady, and Navid Sabbaghi. 2022. “Sustain- 125213. [Link]
able Supply Chain Network Design: A Case of the Glass Sherafati, Mahtab, Mahdi Bashiri, Reza Tavakkoli-Moghaddam,
Manufacturer in Asia.” International Journal of Production and Mir Saman Pishvaee. 2020. “Achieving Sustainable
Economics 248:108483. [Link] Development of Supply Chain by Incorporating Various
108483. Carbon Regulatory Mechanisms.” Transportation Research
Nair, Vinod, Sergey Bartunov, Felix Gimeno, Ingrid von Glehn, Part D: Transport and Environment 81:102253. [Link]
Pawel Lichocki, Ivan Lobov, Brendan O’Donoghue, et al. org/10.1016/[Link].2020.102253.
2020. “Solving Mixed Integer Programs Using Neural Net- Shoaib, Mohd, Navonil Mustafee, and Varun Ramamohan.
works.” arXiv Preprint ArXiv :2012.13349. 2025. “A Framework for Predicting Runtime Savings from
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH 27
Discrete-Event Simulation Model Simplification Opera- ogy to Ensure Compliance with Sustainability Standards in
tions.” arXiv Preprint ArXiv: 2503.19264. the PPE Multi-tier Supply Chain.” International Journal of
Sorooshian, Shahryar. 2024. “The Sustainable Development Production Research 61 (14): 4934–4950. [Link]
Goals of the United Nations: A Comparative Midterm 10.1080/00207543.2022.2025944.
Research Review.” Journal of Cleaner Production 453:142272. Wang, Gang. 2023. “Designing A Stochastic Supply Chain
[Link] Network: An Error-Bound-based Heuristic.” Computers &
Sun, Junyan, Zefei Chen, Zhirui Chen, and Xiaopeng Li. Industrial Engineering 184:109570. [Link]
2024. “Robust Optimization of A Closed-Loop Supply Chain [Link].2023.109570.
Network Based on an Improved Genetic Algorithm in Wang, Zhen, Chunming Ye, and Jianquan Guo. 2024. “Robust
an Uncertain Environment.” Computers & Industrial Engi- Optimization of Multi-objective Multi-cycle Remanufactur-
neering 189:109997. [Link] ing Supply Chain Network considering Incentive Compat-
997. ibility Theory under Improved Algorithm.” Computers &
Vafaei, Arezoo, Saeed Yaghoubi, Javad Tajik, and Farnaz Barz- Industrial Engineering 187:109777. [Link]
inpour. 2020. “Designing A Sustainable Multi-channel Sup- [Link].2023.109777.
ply Chain Distribution Network: A Case Study.” Journal Yadav, Vinay Surendra, A. R. Singh, Rakesh D. Raut, and
of Cleaner Production251:119628. [Link] Naoufel Cheikhrouhou. 2021. “Design of Multi-objective
jclepro.2019.119628. Sustainable Food Distribution Network in the Indian Con-
Valderrama, Claudio Vergara, Ernesto Santibanez-González, text with Multiple Delivery Channels.” Computers & Indus-
Bruno Pimentel, Alfredo Candia-Véjar, and Linda Canales- trial Engineering 160:107549. [Link]
Bustos. 2020. “Designing an Environmental Supply Chain 2021.107549.
Network in the Mining Industry to Reduce Carbon Zahiri, Behzad, Jun Zhuang, and Mehrdad Mohammadi. 2017.
Emissions.” Journal of Cleaner Production 254:119688. “Toward an Integrated Sustainable-Resilient Supply Chain:
[Link] A Pharmaceutical Case Study.” Transportation Research
Varsei, Mohsen, and Sergey Polyakovskiy. 2017. “Sustainable Part E: Logistics and Transportation Review 103:109–142.
Supply Chain Network Design: A Case of the Wine Industry [Link]
in Australia.” Omega 66:236–247. [Link] Zarrinpoor, Naeme, and Aida Khani. 2021. “Designing A Sus-
omega.2015.11.009. tainable Biofuel Supply Chain by considering Carbon Poli-
Wang, Bill, Zhiyu Lin, Michael Wang, Fangyi Wang, Peng cies: A Case Study in Iran.” Energy, Sustainability and Society
Xiangli, and Zhi Li. 2023. “Applying Blockchain Technol- 11(1): 1–22. [Link]
GAEs promote sustainability in supply chain management by effectively optimizing networks for minimal transportation costs and emissions, thereby aligning with environmental objectives. Their ability to model complex, non-linear data enhances the management of sustainable supply chains, ensuring that environmental considerations do not compromise operational efficiency . The GAEs’ adaptability and learning capabilities are crucial in navigating the evolving demands of sustainability objectives .
The use of graph autoencoders introduces significant methodological advancements in supply chain design by enabling the handling of complex, interconnected, non-linear data structures essential for accurately mapping sustainable supply chains. This capability represents a major departure from traditional methodologies, allowing for more granular and adaptive analysis. The expected impact on future research includes a shift towards more AI-driven, adaptable solutions that can dynamically respond to new sustainability challenges and data inputs, potentially setting a new standard for computational approaches in the field .
GAEs improve clustering effectiveness by capturing intricate interconnections among nodes and handling both structured and unstructured data. They learn latent representations that reduce dimensionality while preserving important features, unlike traditional k-means which cannot exploit these non-Euclidean structures . Additionally, GAEs offer high-level information exploitation and dynamic adaptability, overcoming the limitations of static traditional models.
The GAE approach enhances practical applications by constructing a flexible and efficient supply chain network model that captures key features and relationships within the network. It introduces mechanisms to translate abstract solutions into actionable decisions, offering timely decision support for enterprises . The method's robust performance, validated via experimental results, shows potential for real-world applications where sustainability and efficiency are critical .
The novel GAE-based algorithm provides theoretical insights into using artificial intelligence, specifically graph neural networks, to transform sustainable supply chain management. It emphasizes integrating AI-driven methods with supply chain design to address the challenges of balancing economic, environmental, and social sustainability objectives effectively . Additionally, it lays a framework for modelling complex supply chain structures and relationships .
GAEs address the limitations of traditional clustering algorithms by incorporating a learning mechanism that adapts to changes over time, maintaining solution quality and stability in dynamic environments. This adaptability allows them to efficiently handle large-scale problems and varied data types, which traditional static models like k-means fail to achieve .
GAEs provide significant advantages in multi-objective optimization by achieving low deviations in transportation costs and emissions while reducing production time, demonstrating an ability to balance sustainability with operational efficiency. Their computational efficiency and scalability further enhance their suitability for solving large-scale supply chain challenges, as evidenced by their high-quality solutions and speed advantages compared to other methods .
The complexity of sustainable supply chain network design arises from the need to incorporate numerous variables, constraints, and objective functions related to sustainability into the model, significantly increasing its complexity beyond traditional supply chain models . Conventional solution methods struggle to process these models within reasonable timeframes due to their size and intricacy, hence the classification as an NP-hard problem .
AdaGAE demonstrates superior performance over K-means and other methods in terms of solution quality and stability. It exhibits the lowest average objective values with deviations below 11% and maintains lower run-to-run variance (0.5%-1.2%). These metrics emphasize AdaGAE's capability to deliver stable and high-quality solutions consistently, in contrast to K-means, which shows higher variance and less consistency across runs .
Incorporating a responsiveness indicator into sustainable supply chain models addresses the challenge of adapting to dynamic changes and improving the agility of decision-making processes. This extension makes the model more suitable for clustering representation by enabling a more accurate depiction of intercity distances and the dynamic elements of supply chains, which traditional models either omit or inadequately represent .