GNNs For Zero Forcing Sets
GNNs For Zero Forcing Sets
2, MARCH/APRIL 2024
Abstract—This article studies the problem of computing zero- such colorings have been considered with many applications
forcing sets (ZFS) in graphs and provides a machine-learning in network science and engineering, such as air traffic flow
solution. Zero-forcing is a vertex coloring process to color the management [1], nucleic acid sequence design in biochemical
entire vertex set from a small subset of initially colored vertices
constituting a ZFS. Such sets have several applications in network networks [2], channel assignment in wireless networks [3], and
science and networked control systems. However, computing a community detection in social networks [4]. In addition, graph
minimum ZFS is an NP-hard problem, and popular heuristics coloring also serves as an effective tool in solving other signifi-
encounter scalability issues. We investigate the greedy heuristic for cant graph theory problems, for instance, graph partitioning [5]
this problem and propose a combination of the random selection and clique computation [6].
and greedy algorithm called the random-greedy algorithm, which
offers an efficient solution to the ZFS problem. Moreover, we en- Zero forcing is a dynamic coloring of vertices, which are
hance this approach by incorporating a data-driven solution based initially colored either black or white. A black vertex can change
on graph convolutional networks (GCNs), leveraging a random the color of its white neighbor to black under some conditions
selection process. Our machine-learning architecture, designed to (i.e., the black vertex has exactly one white neighbor). The goal
imitate the greedy algorithm, achieves significant speed improve- is to select the minimum number of vertices in a graph, which,
ments, surpassing the computational efficiency of the greedy al-
gorithm by several orders of magnitude. We perform thorough if colored black initially, will render all vertices black at the end
numerical evaluations to demonstrate that the proposed approach of the coloring process. Such a subset of initial black vertices
is considerably efficient, scalable to graphs about ten times larger is called a zero forcing set (ZFS) in a graph (as explained in
than those used in training, and generalizable to several different Section II-A). Zero forcing has applications in modeling various
families of synthetic and real-world graphs with comparable and
physical phenomena, including logic circuits analysis, disease
sometimes better results in terms of the size of ZFS. We also curate
a comprehensive database comprising synthetic and real-world spread analysis, and information spread in social networks [7],
graph datasets, including approximate and optimal ZFS solutions. [8], [9], [10]. In particular, ZFS is an important notion in studying
This database serves as a benchmark for training machine-learning network controllability, which is a central phenomenon in the
models and provides valuable resources for further research and control of networked systems. Network controllability is ma-
evaluation in this problem domain. Our findings showcase the ef- nipulating a network of agents as desired by injecting external
fectiveness of the proposed machine-learning solution and advance
the state-of-the-art in solving the ZFS problem. control inputs through a subset of agents called leaders. The
crucial problem is determining the minimum leader agents to
Index Terms—Zero-forcing Set, graph convolutional network, make the network completely controllable. It turns out that ZFS
network controllability, leader selection problem.
in a network characterizes a leader set for the network’s complete
controllability [10], [11] (as discussed in Section II-B).
I. INTRODUCTION Unfortunately, finding a minimum ZFS for a given graph is a
YNAMIC coloring in graphs is a process of coloring combinatorial optimization problem shown to be NP-Hard [12].
D vertices iteratively according to some pre-defined rules
and conditions. Based on the coloring rules, several variants of
The algorithms to compute the optimal solutions take exponen-
tial time and are not scalable to huge networks. Algorithms
computing approximate solutions, such as greedy, generally
provide reasonable results; however, they also incur scalability
Manuscript received 15 June 2023; revised 27 October 2023; accepted 22
November 2023. Date of publication 4 December 2023; date of current version problems as the network sizes grow. Additionally, degenerate
23 February 2024. This work was supported by the National Science Foundation cases exist for which the greedy solution can be arbitrarily
under Grants 2325416 and 2325417. Recommended for acceptance by Prof. X. bad [13]. We discuss the exact and approximate solutions in
Li. (Corresponding author: Obaid Ullah Ahmad.)
Obaid Ullah Ahmad is with the Electrical Engineering Department, The Section III.
University of Texas at Dallas, Richardson, TX 75080 USA (e-mail: Obaidul- This work aims to find a computationally fast and accurate
[email protected]). solution for the ZFS problem using random selection, machine
Waseem Abbas is with the Systems Engineering Department, The
University of Texas at Dallas, Richardson, TX 75080 USA (e-mail: learning (ML) and data-driven approaches. Recently, ML-based
[email protected]). solutions have found applications in solving computationally
Mudassir Shabbir and Xenofon Koutsoukos are with the Computer Science hard problems. Gama et al. propose a GNN-based distributed
Department, Vanderbilt University, Nashville, TN 37235 USA (e-mail: mudas-
[email protected]; [email protected]). solution to solve the flocking and the multi-agent path plan-
Digital Object Identifier 10.1109/TNSE.2023.3337750 ning problems [14]. They propose a novel framework using
2327-4697 © 2023 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2111
graph signal processing (GSP) to learn the controllers for these scalable. Furthermore, we train the GCN on smaller graphs
problems. Additionally, there has been an increasing inter- and evaluate it on graphs ten times the sizes of graphs
est in utilizing these data-driven, and graph machine learning in the training set and observe a remarkable difference in
(GML) approaches to solve hard combinatorial optimization computation time from the greedy algorithm.
problems [15], [16]. Cappart et al. provide a comprehensive ZFS computation is a hard problem, as explained in Sec-
review of recent works involving the use of machine learning to tion III. There are ways to find an approximate solution that are
solve combinatorial optimization problems [16]. By exploiting generally practicable but can have unrealistic time complexity
the known instances’ solution patterns and relating them to the for huge graphs. ML methods afford the great potential to
underlying network structure, we can train ML models that provide time-efficient solutions to challenging combinatorial
provide approximately optimal solutions to large networks in optimization problems. However, in literature, these solutions
a fraction of the time than the best-known heuristic algorithms. are not trivial. For instance, Joshi et al. show that for the
Further, when combined with heuristic components, the learning Travelling salesman problem (TSP), the trained model’s learning
models could compute solutions with much better approxi- is usually limited to a certain scale level, and the model’s perfor-
mations. Nevertheless, designing efficient and stable machine mance drops drastically when evaluated on larger networks [19].
learning architectures for combinatorial problems has inherent Researchers are trying to generalize machine learning solu-
challenges. Some significant challenges include the availability tions for the more extensive networks for the TSP [20], [21],
of datasets for learning, designing appropriate ML architectures, [22]. Bello et al. introduce a neural network-based framework
and scalability and transferability [16], [17], [18] (as discussed for solving combinatorial optimization, with a focus on the
in detail in Section III-C). traveling salesman problem (TSP). It trains a recurrent neural
Our approach relies on combining data-driven and algorith- network to predict city permutations, optimizing parameters
mic insights to find near-optimal ZFS. We study the structural using reinforcement learning and comparing training methods
aspects of the problem to design a graph convolutional network using negative tour length rewards [20]. Additionally, collecting
(GCN). We also generate a huge database using synthetic and optimal data for training is also a massive problem for hard
real-world graph datasets to train our GCN model. As a result, we combinatorial optimization problems [23]. We have devised a
achieve comparable results to the greedy algorithm in a fraction way to use machine learning without needing an optimal labeled
of the time for huge networks. Our main contributions are: dataset.
r We propose a GCN-based architecture using the insights The article is organized as follows: Section II introduces the
from the greedy algorithm. The key aspect is to design a notations and definitions; Section III discusses known ways to
GCN capable of learning to imitate the steps of the greedy compute ZFS and the potential and challenges of ML; Section IV
algorithm much more efficiently. By employing this do- describes the datasets; Section V discusses the structural aspect
main knowledge in the GCN designing process, we achieve of the problem. In the last three sections, we propose a GCN-
a scalable, generalizable, and time-efficient solution, as based architecture for the ZFS problem and evaluate several
explained in Section VI-D. We analyze the proposed GNN of its aspects including generalizability and scalability. Finally,
in detail and discuss various model parameters to obtain Section IX concludes the article.
superior solutions in terms of time complexity and ZFS
size. II. PRELIMINARIES
r We study the greedy algorithm for the ZFS problem in de-
An undirected graph G = (V, E) models a multiagent net-
tail and uncover some valuable insights into its underlying
work. Vertex set V and edge set E ⊂ V × V represent agents
structure. In particular, we develop a hypothesis pertaining
and interactions between them, respectively. The edge between
to the random selection of ZFS in the initial iterations of the
vertices u and v is denoted by an unordered pair (u, v). The
greedy algorithm and empirically validate it across several
neighborhood of u is the set Nu = {v ∈ V : (u, v) ∈ E} and
distinct datasets. To the best of our knowledge, existing
the degree of u is deg(u) = |Nu |. The distance between vertices
research has not explored the integration of random and
u and v, denoted by d(u, v), is the number of edges in the shortest
greedy vertex selection strategies as a means of designing
path between them. Next, we define the zero forcing process and
effective heuristics for combinatorial optimization prob-
the respective terms and then its applications.
lems, such as ZFS.
r We generate a huge database for the ZFS problem. We
compute optimal solutions for synthetic graphs and greedy A. Zero Forcing Process
solutions for three synthetic and four real-world graph Definition (Zero forcing (ZF) Process): Consider a graph
datasets. G = (V, E), such that each v ∈ V is colored either black or
r Using the dataset with the optimal solutions, we show that white initially. The ZF process is to iteratively change the colors
for the ZFS problem, the optimal data is not vital for the of vertices using the following rule until no further changes are
training of our proposed architecture to achieve satisfactory possible.
results. Instead, training can be done using approximate Color change rule: If v ∈ V is colored black and has exactly
solutions. See Section VII for further details. one white neighbor u, change color of u to black.
r We thoroughly evaluate the proposed solution on an exten- We say that v infected u if the color of white vertex u is
sive collection of graphs and show that our model is highly changed to black by any black vertex v.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
2112 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 11, NO. 2, MARCH/APRIL 2024
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2113
offer improvements, their time complexity remains exponential stochastic optimization methods to adapt to the given data sam-
in general. Hence, many researchers have proposed approximate ples while optimizing the learning model parameters. GNNs
algorithms to find the Z(G) in polynomial time. In the follow- exploit the patterns and invariances in the given data distri-
ing subsection, we discuss a few approaches to compute the bution, which help solve many problems on graph-structured
approximate solutions for the minimum ZFS problem. data like toxicity prediction in chemical compounds, community
detection in social networks [37], etc. GNNs have also been
applied to challenging combinatorial optimization problems on
B. Approximate ZFS Computation graphs with some success. Satisfiability, Travelling Salesman,
Various heuristics have been proposed to compute Z(G) in Knapsack, Minimum Vertex Cover, and Maximum Cut are a few
large graphs in feasible times. The most notable is the greedy of the problems that have been solved using GNNs [21], [22],
heuristic that typically computes a small-sized Z(G). The main [38]. This is an exciting avenue for complicated combinatorial
idea is to iteratively construct a Z(G) by adding a vertex to a problems that would otherwise either take an exceptional time
Z(G) solution that maximizes the size of the derived set in that even for an approximate solution or could not be computed with
iteration. Once a solution is obtained by this iterative process, limited computational resources. This work shows a GNN-based
redundant vertices are removed to obtain a minimal Z(G). The approach can be used to compute Z(G) with reliable accuracy
greedy heuristic generally performs well. Moreover, though the and efficiency. The sizes of Z(G) returned by our GNN solution
greedy heuristic is faster than optimal algorithms and it can be are comparable to or smaller than the existing techniques while
verified in linear time whether a given set is a Z(G) [36], it still taking only a fraction of runtime as we thoroughly compare and
takes significant time to compute a solution for huge graphs as evaluate in the later sections. However, we note that in general,
we demonstrate in our numerical evaluations (Section VI-D). several challenges need to be addressed to design an efficient
Utilizing the idea of potential games, [13] presents another GNN solver for a hard combinatorial optimization problem like
heuristic based on the log-linear learning (LLL) that returns the minimum Z(G).
comparable solutions (in terms of ζ(G)) to the greedy algorithm. 1) Limited Datasets for Training: Constructing a required
However, the quality of the solution is a function of the number size dataset suitable for training a machine learning model is
of iterations. Numerical evaluations show a fast convergence; usually the most critical and challenging step in designing a data-
however, the evaluation is confined to graphs of small sizes (50 driven solution for a combinatorial optimization problem. This
vertices). Some other heuristics are also discussed in [8], [34]. is also true for the minimum ZFS problem for which there is no
As mentioned above, the best-known heuristic to find the ap- known labeled dataset available for learning. Generating a large
proximate solution for the ZFS problem is the greedy algorithm. enough data set with optimal solutions for hard combinatorial
However, it is not scalable to large graphs and has a huge time problems, such as computing a minimum Z(G), requires due
complexity for large graphs. For instance, in one of our numeri- diligence along with sufficient computational resources [16].
cal evaluations, it takes the greedy algorithm about 156 minutes We generate a huge new labeled database consisting of several
to find Z(G) of a graph with 990 vertices. Similarly, [13], [34] synthetic and real-world graph datasets (Section IV).
shows that there exist graphs for which the greedy solution can 2) Scaling and Generlizability: A related issue of scaling
be arbitrarily bad; that is, the difference between the optimal also plagues many proposed solutions. Even when there are
and greedy solutions can be arbitrarily large. This limits the some datasets available for some graph problems, the instances
applicability of the greedy algorithm for real-world network in these datasets are of very small sizes. Consequently, the
problems. machine learning models that are trained on these instances can
only reliably be tested on similar-size problems. An important
problem that some of the previous works have tried to address is
C. Graph Machine Learning – Proposed Approach for ZFS generalizability, i.e., training a model on small instances that per-
Computation form well on large instances. Such works report limited success
Graph datasets present several new challenges for traditional so far [17]. Along with being scalable and generalizable to larger
machine learning like input size variations, non-regular neigh- sizes of problems, the learned models should also extrapolate
borhood structure, and vertex permutations. There are two main and transfer well to different families of graphs. Designing a
approaches for graph-structured data. Graph embeddings trans- machine learning solution that performs well outside the support
form the input graphs to a fixed-sized low-dimensional interme- of the training distribution is a crucial research challenge [18].
diate representation that is then used in a classifier. In contrast, In this article, we present an approach to find the minimum
Graph Neural Networks (GNNs) are trained in an end-to-end Z(G) that is a combination of both data-driven and heuristic
manner, and graph structural information is used to construct methods (Section VI). We perform several experiments where
the layout of the Neural Network. we are able to illustrate that our approach is capable of both
GNNs generalize deep neural networks (DNNs) for graph- scaling and generalizing since we exploit the intricate details of
structured data. The main objective is to achieve vector repre- the greedy algorithm in the design of our method. As the major
sentations of vertices/graphs in the low-dimensional space that part of the solution comes from the machine learning model, our
encodes structural relationships in graphs in the corresponding architecture is also much faster and computationally inexpensive
vector representations. A GNN is then trained in an end-to-end than the greedy heuristic without compromising on the quality
manner against some appropriately designed loss function using of the solution.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
2114 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 11, NO. 2, MARCH/APRIL 2024
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2115
from 0.2 to 0.7. Only for these small graphs, we also compute
Algorithm 1: Greedy Algorithm for ZFS.
optimal solutions using the wavefront algorithm [8]. In fact,
the graphs in this dataset are selected from a pool so that, Input: G = (V, E)
the greedy algorithm performs badly on them compared to Output: Zero forcing set Z
the optimal solution. The average difference in the size of the 1: Initialize: Z ← {}
optimal solutions ζop (G) and greedy solutions ζgr (G) is 6.1 2: while D(Z) = V do
vertices. 3: v ∗ ← argmax |D(Z ∪ {vi })|
vi ∈V \Z
-scale-free graphs: This dataset contains 500 sparse undirected 4: Z ← Z ∪ {v ∗ }
scale-free graphs synthesized using the same python module. 5: end while
The average degree of graphs in this dataset is 2.72 with the 6: for all vi ∈ Z do // removing redundancies
number of vertices ranging from 50 to 1000. 7: if D(Z \ {vi }) = V then
8: Z ← Z\ {vi }
B. Real-World Datasets 9: end if
We also include four standard real-world graph datasets in our 10: end for
database.
-IMDB-BINARY & IMDB-MULTI are both movie collaboration We define this process of the combination of random selection
datasets where the nodes are the actors/actresses and an edge along with greedy heuristics as a Random-Greedy algorithm.
is formed if the actors appear in the same movie [40]. We empirically validate Conjecture 1 utilizing the database
-REDDIT-BINARY is a social networks dataset consisting of presented in Section IV. We perform experiments on all four
graphs that represent the online Reddit discussions [41]. A real-world, large ER, and scale-free graph datasets. Only a subset
node is a user and an edge represents a reply to the comments. of large ER graphs dataset (300 graphs) is selected due to the
-COLLAB is a scientific collaboration dataset in which graphs computation overhead for these large graphs. The details of the
have researchers as nodes and their collaborations as experimental setup and the results are mentioned in the following
edges [42]. subsections.
Among these real-world datasets, REDDIT-BINARY is the
only one that contains graphs with more than 500 nodes. For this A. Experimental Setup
dataset, we only take the graphs that have less than 1200 nodes
since it becomes computationally very expensive to find the For all these synthetic and real-world datasets, we compute a
greedy solutions for larger graphs. The total time for computing greedy solution represented as Zgr (G) for a graph G = (V, E)
greedy solutions for REDDIT-BINARY is approximately 6 days. and the size of this set is represented as ζgr (G). The greedy al-
The graphs in the other datasets are prepared within a day. gorithm has two dominant parts; first to greedily populate Z(G)
Further statistics about these datasets are provided in Table I. until it is a complete solution, and then a greedy redundancy
These datasets are vital for the numerical analysis of the check to remove any extra nodes. The redundancy check is a
ZFS problem. In the next section, we provide some intriguing separate part, and it is applied in all our experiments where the
observations about the greedy algorithm using this database. greedy algorithm is used to complete the solution.
To corroborate our conjecture, we introduce a random se-
lection in the greedy algorithm. We provide an intermediate
V. INSIGHTS INTO GREEDY HEURISTICS FOR ZFS
solution computed by randomly selecting a vertex set which is
COMPUTATION
a fraction of ζgr (G) and then pass this partially formed solution
One of the approaches to compute approximate Z(G) is to the greedy algorithm to obtain a Z(G) primarily based on
the greedy approach in which we iteratively construct a Z(G) random selection. We represent the solution obtained by the
by adding a vertex to Z(G) that maximizes the span of the random-greedy algorithm as Zrdm (G) and its size as ζrdm (G).
intermediate solution as mentioned in Algorithm 1. However, For comparison, we already have the greedy solution and the
the selection of a vertex will be arbitrarily random if two or ζgr(G) for each graph, we only fix the randomness fraction ratio
more vertices have the same span. In the initial few iterations, τ to obtain the intermediate solution.
we observe that the greedy algorithm is likely to have multiple The performance of this approach is described by using the
potential vertices to add to the intermediate solution if the degree percentage deviation between the size of the Z(G) by the greedy
of all the vertices is large enough. During these iterations, solution ζgr (G), and the size of the Z(G) returned by this
the construction of the intermediate Z(G) solution by greedy partial random selection ζrdm (G). We report the average of the
algorithm might be as effective as the random selection. Based deviation over all the graph instances in the test sets of different
on this observation, we propose the following conjecture. datasets. Formally,
Conjecture 1: If half of the vertices of Zgr (G) are picked
randomly and the rest of the approximate solution is completed |K|
by the greedy solution, then it is fairly the same as obtaining a 1 ζrdm (Gi ) − ζgr (Gi )
Dev(ζrdm ) = × 100,
solution completely by the greedy algorithm. |K| i=0 ζgr (Gi )
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
2116 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 11, NO. 2, MARCH/APRIL 2024
Fig. 3. Sizes of the Z(G) computed by the greedy and the random selection
for the graphs in the REDDIT BINARY dataset with τ = 0.7.
Fig. 2. Results for synthetic and real-world datasets when a certain percentage
of vertices is selected randomly and the greedy algorithm is used to complete
the Z(G).
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2117
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
2118 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 11, NO. 2, MARCH/APRIL 2024
Fig. 6. Algorithm overview. First, a regression model trained on ζ(G) provides an estimate ζ̂(G) of ζ(G). An intermediate solution Ŝ x is formed by randomly
selecting Trand × Ŝ. This intermediate solution is then completed iteratively using a multi-layered graph convolutional network f (G; θ) that provides a set of
probabilities for each vertex in input graph G until a ZFS Ŝ is formed. Lastly, a redundancy check is performed that removes any redundant vertices.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2119
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
2120 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 11, NO. 2, MARCH/APRIL 2024
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2121
IX. CONCLUSION
Fig. 10. Comparison between random-greedy and the proposed GCN-based
solution. Plot (a) shows the deviation from the greedy solution with varying Minimum ZFS is a hard combinatorial optimization problem
randomness threshold τ , and plot (b) presents the respective time taken to with several applications across various domains. We presented
compute the solution on the whole dataset.
a novel graph convolutional network to compute a small-sized
ZFS. The proposed solution utilizes data-driven and algorithmic
insights to compute ZFS in large graphs effectively. Through
extensive experiments, we showed that the proposed approach
solution) deviation. The average size of the greedy solution is
is computationally efficient, scalable to much larger graphs,
449.3 vertices in the test set. This indicates that on average, there
generalizable to different graph families, and able to learn from
are about 3.5 more vertices in the Z(G) obtained by the GCN
sub-optimal datasets. Thus, our approach is not inhibited by
than in the greedy solution. However, the GCN takes a total time
the requirement to have optimal datasets consisting of large
of about 2.5 days (primarily the time for the redundancy check)
enough instances to train the graph learning models for com-
while the greedy algorithm takes about 48 days to compute the
binatorial optimization problems. We also contributed towards
Z(G) for the 1832 graphs in the test set. This essentially means
the future data-driven algorithms for the minimum ZFS problem
that the GCN architecture is less than 1% away from the solution
through several synthetic and real-world benchmark datasets
while being about 19 times faster on large graphs.
using greedy solutions as the ground truth. In addition, a small
We expect the time difference between GCN and greedy
graphs dataset containing hard instances is also annotated with
algorithm to grow exponentially for larger graphs. However, val-
optimal solutions. In the future, we aim to extend our approach
idating this hypothesis for significantly larger graphs becomes
to solving other combinatorial optimization problems, including
impractical, primarily due to the fact that the greedy algorithm’s
minimum dominating sets in graphs.
computational demands become prohibitively high on graphs
with more than 2500 vertices. As shown above, GCN algorithm
is computationally much faster than the greedy algorithm on REFERENCES
large graphs while preserving the solution quality. Even though [1] N. Barnier and P. Brisset, “Graph coloring for air traffic flow management,”
random selection and greedy heuristics can reduce time, GCN is Ann. Operations Res., vol. 130, no. 1, pp. 163–178, 2004.
[2] Y. Peng, B. Choi, B. He, S. Zhou, R. Xu, and X. Yu, “VColor: A practical
still significantly faster than the greedy algorithm, especially for vertex-cut based approach for coloring large graphs,” in Proc. IEEE 32nd
larger graphs. To substantiate this, we conduct an empirical study Int. Conf. Data Eng., 2016, pp. 97–108.
focusing on computational efficiency and solution quality. Our [3] B. Balasundaram and S. Butenko, “Graph domination, coloring and cliques
in telecommunications,” in Handbook of Optimization in Telecommunica-
experimentation involved varying the randomness threshold τ of tions. Berlin, Germany: Springer, 2006, pp. 865–890.
the initially selected vertices from 35% to 75%. To get a range of [4] F. Moradi, T. Olovsson, and P. Tsigas, “A local seed selection algorithm
sizes of the graphs, we select the REDDIT-BINARY graphs and for overlapping community detection,” in Proc. IEEE/ACM Int. Conf. Adv.
Social Netw. Anal. Mining, 2014, pp. 1–8.
a part of the Large ER graphs dataset to obtain graphs with the [5] N. Armenatzoglou, H. Pham, V. Ntranos, D. Papadias, and C. Shahabi,
number of vertices ranging between 6 − 1194. We compare the “Real-time multi-criteria social graph partitioning: A game theoretic ap-
quality of the solution using Dev(ζrdm ) and Dev(ζgcn ). Fig. 10 proach,” in Proc. ACM SIGMOD Int. Conf. Manage. Data, 2015, pp. 1617–
1628.
compares the random-greedy with the proposed GCN-based [6] P. R. Östergård, “A fast algorithm for the maximum clique problem,”
approach. Discrete Appl. Math., vol. 120, no. 1–3, pp. 197–207, 2002.
We observe that the random-greedy algorithm consumes sig- [7] D. Burgarth, D. D’Alessandro, L. Hogben, S. Severini, and M. Young,
“Zero forcing, linear and quantum controllability for systems evolving on
nificantly more time than our GCN-based approach, even when networks,” IEEE Trans. Autom. Control, vol. 58, no. 9, pp. 2349–2354,
allowing for up to 75% random selection. This underscores the Sep. 2013.
computational advantages of our method, especially for larger [8] B. Brimkov, C. C. Fast, and I. V. Hicks, “Computational approaches for
zero forcing and related problems,” Eur. J. Oper. Res., vol. 273, no. 3,
graphs. Fig. 10(a) demonstrates that increasing the randomness pp. 889–903, 2019.
threshold τ slightly deteriorates the solution quality for both [9] P. A. Dreyer Jr. and F. S. Roberts, “Irreversible k-threshold processes:
approaches. This trade-off is expected, but our approach con- Graph-theoretical threshold models of the spread of disease and of opin-
ion,” Discrete Appl. Math., vol. 157, no. 7, pp. 1615–1627, 2009.
sistently maintains a competitive solution quality. Additionally, [10] N. Monshizadeh, S. Zhang, and M. K. Camlibel, “Zero forcing sets and
the time taken by the random selection and the greedy heuristics controllability of dynamical systems defined on graphs,” IEEE Trans.
decreases with the increase of threshold τ as it takes fewer Autom. Control, vol. 59, no. 9, pp. 2562–2567, Sep. 2014.
[11] S. S. Mousavi, M. Haeri, and M. Mesbahi, “On the structural and strong
iterations of the greedy algorithm to complete the solution. structural controllability of undirected networks,” IEEE Trans. Autom.
Conversely, our GCN-based approach incurs a slight increase in Control, vol. 63, no. 7, pp. 2234–2241, Jul. 2018.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
2122 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 11, NO. 2, MARCH/APRIL 2024
[12] A. Aazami, “Hardness results and approximation algorithms for some [37] Y. Jiang, Y. Rong, H. Cheng, X. Huang, K. Zhao, and J. Huang, “Query
problems on graphs,” Ph.D. dissertation, University of Waterloo, Waterloo, driven-graph neural networks for community search: From non-attributed,
ON, Canada, 2008. attributed, to interactive attributed,” Proc. VLDB Endowment, vol. 15,
[13] W. Abbas, M. Shabbir, Y. Yazıcıoğlu, and X. Koutsoukos, no. 6, pp. 1243–1255, 2022.
“Leader selection for strong structural controllability in networks [38] O. Vinyals, M. Fortunato, and N. Jaitly, “Pointer networks,” in Proc. 28th
using zero forcing sets,” in Proc. Amer. Control Conf., 2022, Int. Conf. Neural Inf. Process. Syst., vol. 2, 2015, pp. 2692–2700.
pp. 1444–1449. [39] Z. Li, Q. Chen, and V. Koltun, “Combinatorial optimization with graph
[14] F. Gama, E. Tolstaya, and A. Ribeiro, “Graph neural networks for de- convolutional networks and guided tree search,” in Proc. 32nd Int. Conf.
centralized controllers,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Neural Inf. Process. Syst., 2018, pp. 537–546.
Process,, 2021, pp. 5260–5264. [40] P. Yanardag and S. Vishwanathan, “Deep graph kernels,” in Proc.
[15] M. Boffa, Z. B. Houidi, J. Krolikowski, and D. Rossi, “Neural combinato- 21th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, 2015,
rial optimization beyond the TSP: Existing architectures under-represent pp. 1365–1374.
graph structure,” in Proc. AAAI Workshop Graphs More Complex Struc- [41] W. L. Hamilton, Z. Ying, and J. Leskovec, “Inductive representation
tures Learn. Reasoning, 2022. learning on large graphs,” in Proc. 31st Int. Conf. Neural Inf. Process. Syst.,
[16] Q. Cappart, D. Chételat, E. Khalil, A. Lodi, C. Morris, and 2017, pp. 1025–1035.
P. Veličković, “Combinatorial optimization and reasoning with graph [42] C. Morris, N. M. Kriege, F. Bause, K. Kersting, P. Mutzel, and
neural networks,” J. Mach. Learn. Res., vol. 24, no. 130, pp. 1–61, M. Neumann, “TUDataset: A collection of benchmark datasets for learning
2023. with graphs,” 2020, arXiv:2007.08663.
[17] M. Böther, O. Kißig, M. Taraz, S. Cohen, K. Seidel, and [43] T. N. Kipf and M. Welling, “Semi-supervised classification with graph
T. Friedrich, “What’s wrong with deep learning in tree search for convolutional networks,” in Proc. Int. Conf. Learn. Representations, 2016.
combinatorial optimization,” in Proc. Int. Conf. Learn. Representations, [44] L. Breiman, “Random forests,” Mach. Learn., vol. 45, no. 1, pp. 5–32,
2022, pp. 1–25. 2001.
[18] K. Xu, M. Zhang, J. Li, S. S. Du, K. Kawarabayashi, and S. Jegelka, “How
neural networks extrapolate: From feedforward to graph neural networks,”
in Proc. Int. Conf. Learn. Representations, 2020, pp. 1–52.
[19] C. K. Joshi, T. Laurent, and X. Bresson, “An efficient graph convo-
lutional network technique for the travelling salesman problem,” 2019,
arXiv:1906.01227.
[20] I. Bello, H. Pham, Q. V. Le, M. Norouzi, and S. Bengio, “Neural combina-
torial optimization with reinforcement learning,” 2016, arXiv:1611.09940.
[21] Q. Cappart, D. Chételat, E. B. Khalil, A. Lodi, C. Morris, and P. Velickovic,
“Combinatorial optimization and reasoning with graph neural networks,”
J. Mach. Learn. Res., vol. 24, no. 130, pp. 1–61, 2023.
[22] W. Kool, H. van Hoof, and M. Welling, “Attention, learn to solve routing Obaid Ullah Ahmad received the B.S. degree in
problems!,” in Proc. Int. Conf. Learn. Representations, 2019, pp. 1–25. electrical engineering from the University of Engi-
[23] Q. Wang and C. Tang, “Deep reinforcement learning for transportation net- neering and Technology, Lahore, Pakistan, in 2019,
work combinatorial optimization: A survey,” Knowl.-Based Syst., vol. 233, and the M.S. degree in computer science from In-
2021, Art. no. 107526. formation Technology University, Lahore, in 2021.
[24] AIM Minimum Rank Special Graphs Work Group, “Zero forcing sets He is currently working toward the Ph.D. degree in
and the minimum rank of graphs,” Linear Algebra Appl., vol. 428, no. 7, electrical engineering from the University of Texas
pp. 1628–1648, 2008. at Dallas, Richardson, TX, USA. He is currently a
[25] M. Trefois and J.-C. Delvenne, “Zero forcing number, constrained match- Research Assistant with the University of Texas at
ings and strong structural controllability,” Linear Algebra Appl., vol. 484, Dallas’ Control, Intelligence, Resilience in Networks
pp. 199–218, 2015. and Systems Lab, Richardson. His current research
[26] A. Chapman and M. Mesbahi, “On strong structural controllability of interests include control-based approaches, for graph machine learning, network
networked systems: A constrained matching approach,” in Proc. Amer. optimization, and multi-robot systems.
Control Conf., 2013, pp. 6126–6131.
[27] S. Fallat, K. Meagher, and B. Yang, “On the complexity of the posi-
tive semidefinite zero forcing number,” Linear Algebra Appl., vol. 491,
pp. 101–122, 2016.
[28] I. V. Hicks et al., “Computational and theoretical challenges for com-
puting the minimum rank of a graph,” INFORMS J. Comput., vol. 34,
pp. 2868–2872, 2022.
[29] D. Ferrero et al., “Rigid linkages and partial zero forcing,” Elec. J.
Combinatorics, pp. P2–43, 2019.
[30] F. H. Kenter and J. C.-H. Lin, “On the error of a priori sampling:
Zero forcing sets and propagation time,” Linear Algebra Appl., vol. 576,
pp. 124–141, 2019.
[31] J. C.-H. Lin, P. Oblak, and H. Šmigoc, “The strong spectral property for
graphs,” Linear Algebra Appl., vol. 598, pp. 68–91, 2020. Mudassir Shabbir received the Ph.D. degree from
[32] D. Burgarth and V. Giovannetti, “Full control by locally induced relax- the Division of Computer Science, Rutgers Univer-
ation,” Phys. Rev. Lett., vol. 99, no. 10, 2007, Art. no. 100501. sity, New Brunswick, NJ, USA, in 2014. He is cur-
[33] S. Butler et al., “Minimum rank library. Sage programs for calculating rently an Associate Professor with the Department
bounds on the minimum rank of a graph, and for computing zero forcing of Computer Science, Information Technology Uni-
parameters,” 2014, Accessed: Mar. 24, 2022. [Online]. Available: https: versity, Lahore, Pakistan, and a Research Assistant
//github.com/jasongrout/minimum_rank Professor with Vanderbilt University, Nashville, TN,
[34] B. Brimkov, D. Mikesell, and I. V. Hicks, “Improved computational USA. He was with the Lahore University of Man-
approaches and heuristics for zero forcing,” INFORMS J. Comput., vol. 33, agement Sciences, Pakistan; Los Alamos National
pp. 1259–1684, 2021. Labs, NM, Bloomberg L.P. New York, NY, USA;
[35] A. Agra, J. O. Cerdeira, and C. Requejo, “A computational comparison of and with Rutgers University. He was Rutgers Honors
compact MILP formulations for the zero forcing number,” Discrete Appl. Fellow for 2011 to 2012. His main research interests include algorithmic and
Math., vol. 269, pp. 169–183, 2019. discrete geometry, and has developed new methods for the characterization and
[36] A. Weber, G. Reissig, and F. Svaricek, “A linear time algorithm to verify computation of succinct representations of large data sets with applications in
strong structural controllability,” in Proc. IEEE 53rd Conf. Decis. Control, non-parametric statistical analysis. He also works on graph machine learning
2014, pp. 5574–5580. and resilient network systems.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.
AHMAD et al.: GRAPH MACHINE LEARNING FRAMEWORK TO COMPUTE ZERO FORCING SETS IN GRAPHS 2123
Waseem Abbas (Member, IEEE) received the M.Sc. Xenofon Koutsoukos (Fellow, IEEE) received the
and Ph.D. degrees in electrical and computer engi- Ph.D. degree in electrical engineering from the Uni-
neering from the Georgia Institute of Technology, versity of Notre Dame, Notre Dame, IN, USA, in
Atlanta, GA, USA, in 2010 and 2013, respectively. 2000. He is currently the Thomas R. Walters Pro-
He is currently an Assistant Professor with the Sys- fessor and Chair of the Department of Computer Sci-
tem Engineering Department, University of Texas at ence, School of Engineering, Vanderbilt University,
Dallas, Richardson, TX, USA. He was a Research Nashville, TN, USA. He is also a Senior Research
Assistant Professor with the Vanderbilt University, Scientist with the Institute for Software Integrated
Nashville, TN, USA. He was a Fulbright Scholar from Systems (ISIS) and holds a secondary appointment
2009 to 2013. His research interests include control with the Department of Electrical and Computer En-
of networked systems, resilience and robustness in gineering. He was a Member of Research Staff with
networks, distributed optimization, and graph-theoretic methods in complex the Xerox Palo Alto Research Center (PARC) during 2000–2002. He has
networks. coauthored more than 350 journal and conference papers, and he is co-inventor
of four U.S. patents. His research interests include cyber-physical systems with
emphasis on learning-enabled systems, security and resilience, diagnosis and
fault tolerance, distributed algorithms, formal methods, and adaptive resource
management. Dr. Koutsoukos was the recipient of the NSF Career Award in 2004,
Excellence in Teaching Award in 2009 from the Vanderbilt University School
of Engineering, and the 2011 NASA Aeronautics Research Mission Directorate
(ARMD) Associate Administrator (AA) Award in Technology and Innovation.
Authorized licensed use limited to: University of Dubrovnik. Downloaded on April 20,2024 at 05:46:04 UTC from IEEE Xplore. Restrictions apply.