0% found this document useful (0 votes)

3 views

Syntax_and_Relation_Enhanced_Query_Generation_for_

The document presents SRSQL, a novel approach for text-to-SQL parsing that enhances performance by integrating syntax information and predefined relationships into a Transformer-based model. SRSQL generates SQL queries as Abstract Syntax Trees and demonstrates significant improvements in prediction accuracy, particularly on challenging benchmarks like Spider. The study highlights the importance of domain adaptation and complex reasoning in effectively converting natural language questions into executable SQL queries.

Uploaded by

muhammad anas

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Syntax_and_Relation_Enhanced_Query_Generation_for_

Uploaded by

muhammad anas

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Frontiers in Science and Engineering Volume 4 Issue 4, 2024

ISSN: 2710-0588

Syntax and Relation Enhanced Query Generation for Text to SQL

Parsing
Kun Han1,2, Xi Xiong1,2, *
1 School of Cybersecurity, Chengdu University of Information Technology, Chengdu 610225,
China
2 Advanced Cryptography System Security Key Laboratory of Sichuan Province, Chengdu
610225, China

Abstract
Text-to-SQL parsing is the task of converting natural language questions into executable
SQL queries, a significant branch of semantic parsing, which has gained increasing
attention in recent years. This technology lowers the barrier for people to access
databases, enhancing the convenience and availability of data. However, the primary
challenge for text-to-SQL parsing lies in domain adaptation, which concerns whether the
model can be applied to new databases and effectively align natural language questions
with the corresponding tables or columns within the database. To address these issues,
research has introduced SRSQL (Syntax and Relation-Augmented Query Generation),
which incorporates syntax information and predefined relationships into the model,
effectively utilizing syntactic dependencies and pattern linking to improve performance.
Using a Transformer-based decoder, SRSQL generates SQL queries in the form of
Abstract Syntax Trees (AST), significantly boosting prediction accuracy. Experimental
results show that SRSQL outperforms comparative models, particularly on challenging
benchmarks like Spider and Spider-SYN.

Keywords
Text-to-SQL; Syntactic Dependencies; Transformer; Abstract Syntax Trees.

1. Introduction
As the era of information technology advances, vast amounts of data are generated daily from
individuals and businesses. To facilitate management and operations, these data are commonly
stored in relational databases. To retrieve the needed data from these databases, Structured
Query Language (SQL) or similar structured query languages are employed. Despite the
growing popularity of relational databases, non-expert users often face limitations in accessing
information due to the need to understand complex query languages, creating a certain level of
usability barrier. Consequently, Text-to-SQL [1] parsing has garnered significant attention,
aiming to directly convert natural language questions into corresponding SQL query statements.
This task alleviates the challenges faced by non-expert users when interacting with relational
databases, to a certain extent.
Developing large-scale annotated question datasets along with corresponding SQL queries has
propelled advancements in the field. In contrast to previous efforts on parsed datasets [2], the
new datasets such as WikiSQL [3] and Spider [4] heavily test models' ability to generalize to
unseen database schemas. Each query in these new tasks is based on a multi-table database
architecture, and database schemas do not repeat between training and test sets.
A key issue in achieving domain generalization [5] is the need for complex reasoning to generate
structurally rich SQL queries. To accurately contextualize user queries with specific databases,
this involves explicit relationships (such as those defined by the database schema for tables and

82
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

columns) and implicit relationships (such as determining whether phrases in the query
correspond to specific columns or tables).
However, this implies that models need to predict queries in database contexts unseen during
training and accurately express query intentions through SQL logic. Therefore, text-to-SQL
parsers across databases cannot solely rely on observed SQL schema; they must also accurately
model natural language questions, database structures, and the context between them.
Current research typically adopts the following strategies to enhance the cross-domain
generalization ability of models. First, by learning schema-based embedding functions,
questions and database schemas are contextualized mutually [6]. Secondly, pretrained language
models such as BERT [7] and RoBERTa [8] have been shown to improve prediction accuracy by
acquiring semantic relationships in different contexts and capturing distant dependencies.
Methods that incorporate syntax-enhanced synthetic examples into the BERT pretraining
framework, alongside a basic semantic parser, have shown promising results.
This study introduces SRSQL, which integrates joint encoding of questions and schemas into a
novel Transformer [9] variant for text-to-SQL parsing. By representing each natural language
question as a graph with multiple relations, including syntactic dependencies and part-of-
speech, and a database schema as a graph composed of tables, columns, and their relationships,
SRSQL employs a relation-aware Transformer to learn the connections between the schema
and the question. A Transformer-based tree decoder is then proposed to generate SQL queries
in Abstract Syntax Tree (AST) form. Experiments on the Spider benchmark show that SRSQL
attains a 74.5% Exact Match (EM) accuracy on the test set, outperforming the baseline model
significantly.

2. Related Work
In earlier research, a commonly used approach was sketch-based slot filling, which employs
different modules to predict various parts of the generated SQL query. This method
decomposes the SQL generation task into several independent sketches and utilizes different
classifiers to predict each part, such as SQLNet [10], TypeSQL [11], RYANSQL [12], X-SQL [13], among
others. Most of these methods only handle simple queries, making it challenging to apply them
to more complex scenarios.
There are various methods to address the challenges posed by complex SQL tasks. One common
approach is to use generic neural network-enhanced encoders for global reasoning over natural
language questions and database schemas. For instance, IRNet [14] employs LSTM and self-
attention mechanisms to encode questions and schemas separately, while BRIDGE [15] serializes
questions and schemas into token sequences and maximizes the utilization of BERT and
database content to capture the linking relationships between questions and schemas.
On the other hand, many studies utilize graph structures to represent a series of complex
relationships. For example, Global-GNN [16] employs graph neural networks (GNN) to output
queries selecting subsets of tables or columns, while ShadowGNN [17] introduces a graph
projection neural network to abstract representations of questions and schemas. Further
developments include SADGA [18], which separately encodes question graphs and schema
graphs based on dependency relationship structures and database schemas, and SDSQL [19],
which improves structured reasoning by modeling relationships between schemas and
questions.
Recent work has demonstrated the effectiveness of fine-tuning pretrained models. For instance,
Shwa et al. [20] showed that fine-tuning the pretrained T5-3B model can yield highly competitive
results. Building upon this, PICARD [21] was introduced, a technique that restricts the
autoregressive decoder by applying incremental parsing during inference. It real-time filters

83
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

out syntactically incorrect sequences during beam search, significantly enhancing the quality
of generated SQL.
In summary, due to the complexity of semantic understanding and data dependency
relationships, generating complex query statements can lead to syntax errors, inappropriate
semantics, and database connection errors. The SRSQL model proposed in this paper effectively
alleviates these issues.

3. Question Definition
The task aims to transform the input natural language question Q into the corresponding
Abstract Syntax Tree (AST) representation of the SQL query y, given a database schema S= (T,
C).
Specifically, Let Q = (q1, q2, …, q|Q|) be a sequence of natural language tokens. |Q| is the length of
the natural language problem. The database schema S includes multiple tables T = (t1, t2, …, t|T|)
and column C = (c1, c2, …, c|C|), |T| and |C| respectively represent the number of tables and
columns in the database. The table name for each table is denoted as 𝑡 ∈ 𝑇 , it can be
represented as a sequence of tokens ti = ti1 ,ti2 ,⋯,ti|ti | , |ti| represents the number of tokens for
table names, Similarly, each column name in table t is represented as ci ={ci1 ,ci2 ,⋯,ci|ci | }.|ci|
represents the number of tokens for column names.

4. Model
4.1. Model Framework

Figure 1. Model Framework Diagram

The overall structure of the SRSQL model is illustrated in Figure 1. Model inherits the end-to-
end capabilities of Transformer and treats the text-to-SQL problem as a translation task. The
model consists of L layers of encoders and decoders, with the encoding of the question and the
database schema concatenated as the model's input. We utilize a relation-aware Transformer
[22] as the encoder (see Section 4.3), which employs relation-aware self-attention [23] to replace

the original self-attention mechanism. Additionally, we extend the Transformer decoder (see
Section 4.4), integrating node types and embeddings of previous actions to autoregressively
generate SQL queries. These queries are a series of actions derived according to SQL syntax.

84
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

4.2. Question Graph and Schema Graph Construction

In this section, we will present the process of creating the model's input. The input to the model
is a combination of a natural language question and a database schema. To represent these, we
employ a graph-based modeling approach, with the detailed methodology described below.
Question Graph Construction: The natural language question can be represented as a graph GQ
= (Q, R), Where the node set Q consists of natural language tokens, R = (r1, r2, …, r|R|) represents
the relationships between words. In this work, we construct a graph using the syntactic
dependency relationships between words in the question. Figure 2 is an example of a question
relation graph, showing the part-of-speech tagging and dependency relations for each token. In
this example, the token "names" has an object dependency relation with the token "Find", and
both "Find" and "names" are tagged as a verb (VB) and a noun (NN), respectively. We employ a
Graph Attention Network (GAT) [24] to encode the question graph 𝑍 ∈ 𝑅 , where 𝑖 ∈ {1, … , |𝑄|},
d is the size of the hidden layer.

Figure 2. Example of a Question Graph with Part-of-Speech Tagging and Dependency

Relations

Database Schema Graph Construction: The database schema graph is represented as a graph GS
= (S, R), where Node Set S = (T, C) representing tables T and columns C in the database schema.
The text is written in Chinese. Here's the translation into English: R={r1 ,…,r|R| }represents the
structural relationship between entities and attributes in a schema. We employ classic
database-specific relationships, such as whether a column belongs to a table, whether it is a
primary key of that table, and whether it is a foreign key referencing another column. Figure 3
illustrates an example of a database schema diagram. We also encode the database schema
diagrams using GAT [24] and obtain vector representations for each database schema through
global average pooling.

Figure 3. Example of a Database Schema Graph

4.3. Relational-Aware Transformer Encoder
The relational-aware Transformer replaces the conventional self-attention mechanism in
Transformers with relation-aware self-attention. This is an embedding module designed for
semi-structured sequences, which jointly encodes both the inherent relationships between

85
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

elements within the input and predefined relations. We utilize it to encode the linkages between
questions and database schemas, which will be elaborated on in Section 4.4.
Suppose that the input to each encoder layer in the original Transformer model is an n-
dimensional word vector sequence X=(x1 ,…,xn ), where 𝑥 ∈ 𝑅 represents the input, and the
output is a new sequence y where 𝑦 ∈ 𝑅 . The model consists of multiple stacked self-
attention layers, each containing H attention heads. The output element y is calculated by first
linearly transforming the input elements x and then performing a weighted sum using self-
attention, following this computational process:

( ) ( )
( ) 𝑥𝑊 (𝑥 𝑊 ) ( ) ( )
𝑒 = ,𝛼 = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥 𝑒 (1)
𝑑 /𝐻

( ) ( ) ( ) ( ) ( )
𝑧 = 𝛼 (𝑥 𝑊 ) , 𝑧 = 𝐶𝑜𝑛𝑐𝑎𝑡(𝑧 ,⋯,𝑧 ) (2)

𝑦 = 𝐿𝑎𝑦𝑒𝑟𝑁𝑜𝑟𝑚(𝑥 + 𝑧 ) (3)

𝑦 = 𝐿𝑎𝑦𝑒𝑟𝑁𝑜𝑟𝑚(𝑦 + 𝐹𝐶(𝑅𝑒𝐿𝑈(𝐹𝐶(𝑦 )))) (4)

Where 𝑊 , 𝑊 , 𝑊 ∈ 𝑅 × stands for the learnable weight matrix, 1 ≤ h ≤ H, where FC refers

to a fully connected layer, and Layer Norm denotes layer normalization (Layer Normalization).
In a Transformer model, each attention head in every layer computes implicit relationships
between input elements, with the strength of these relationships encoded in the attention
weights 𝛼 . However, in practical applications, if we have prior knowledge about certain
relationships between input elements and wish to guide the model to learn these, we can
employ a relation-aware Transformer by incorporating explicit relationships into the attention
module. This is done by modifying equations (1) and (2) as follows:

( ) ( )
( ) 𝑥𝑊 (𝑥 𝑊 +𝑟 )
𝑒 = (5)
𝑑 /𝐻

( ) ( ) ( )
𝑧 = 𝛼 (𝑥 𝑊 +𝑟 ) (6)

4.4. Schema Linking

Schema linking refers to the operation of aligning tables or columns mentioned in a natural
language question with those in a database. We utilize relation-aware self-attention for
encoding the schema linking relationships. To model the relationship of schema links, we
introduced the interaction graph 𝐺 = (𝑉, 𝛽 ), which has a structure similar to a database
schema diagram. In this graph, 𝑉 = 𝑄 ∪ 𝑇 ∪ 𝐶 encompasses problem keywords, table names,
and column names, with 𝛽 = 𝛽 ∪ 𝛽 ↔ representing the schema link relationship between the
problem words and the database schema. schema linking generally occurs in two ways: name-
based linking and content-based linking, and we will delve into these methods in detail next.

86
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

4.4.1. Name-based Linking

We follow the approach employed by Wang et al. [22], using n-gram matching to determine the
degree of match between items mentioned in the question and the corresponding items in the
schema: whether it matches the table or column names exactly or partially. Consequently, for
each edge (i,j) in the relationship interaction graph, we categorize the relationship based on the
types of xi and xj. The types of relationships we consider are: QUESTION-COLUMN-M and
QUESTION-TABLE-M. Where M is one of EXACTMATCH, PARTIALMATCH and NOMATCH.
These relationships are all asymmetric.
4.4.2. Database Content Linking
If the question directly mentions the value in a column of the database, rather than mentioning
the table name or column name, database content linkage can be used. We follow the same
process as Lin et al. [15] to capture the mentioned database content. Firstly, fuzzy string
matching is performed between the question tokens and the values in each column of the
database. Then, the matched values are inserted into the input sequence after the
corresponding column name. This relationship is represented as VALUE-MATCH.
4.5. Autoregressive Tree Decoder
In the decoding phase, previous work primarily relied on LSTM-based decoders to generate
Abstract Syntax Tree (AST). Our approach, however, extends the original Transformer decoder
in an autoregressive manner for AST generation. This method offers an advantage over LSTM-
based decoders by better preserving the context of previously generated query parts for longer
sequences.
The decoder structure of SRSQL, as depicted in Figure 4, differs from the original Transformer
decoder. It does not employ masked self-attention but utilizes cross-attention instead, with H
heads. The input to the decoder consists of the hidden states from the encoder, the current node
type, and the action from the previous time step. The node type is incorporated as a residual
term within the multi-head attention mechanism. The decoder then goes through L layers,
ultimately outputting probabilities for generating the next action at each time step.

Figure 4. Decoder structure diagram

87
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

In the SRSQL decoder, each node possesses two attributes: a node type and the action from the
previous time step. We denote the vector representation of the current node type as 𝑚 , and
the vector representation of the action from the previous time step as 𝑎 .
Specifically, node types include SQL keywords, table names, and column names from the
database. Decoding actions are divided into two categories: (1) applying the generated rules to
the current syntax tree, which is the APPLYRULE action, and (2) selecting a table or column
from the database schema, which are the SELECTTABLE and SELECTCOLUMN actions. The
process of generating the Abstract Syntax Tree (AST) involves sequential application of these
actions, with a depth-first traversal order for constructing a SQL query y. As shown in Figure 5,
SQL statements based on the AST are generated using context-free grammar.

Figure 5. Example of an Abstract Syntax Tree

When incorporating 𝑚 as a bias term in the calculation of attention, the formula takes the
following form:

𝑄𝐾
𝐴= +𝑈 (7)
𝑑

𝑈 =𝑊 ×𝑚 (8)

Here, the query vector Q is derived from the attention vector 𝑎 , while the key vector K and
value vector V come from the hidden states of the encoder. The vector U is derived from the
node type vector 𝑚 , with 𝑊 being a learnable weight matrix. In the l-th layer, the residual
update for the node vector 𝑛 can be represented as follows, where || denotes concatenation,
and H is the number of attention heads:

𝑛 =𝑛 + 𝑂 || 𝑠𝑜𝑓𝑡𝑚𝑎𝑥 𝐴 𝑉 (9)

88
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

Consequently, after passing through L layers of the decoder, the decoder state 𝑛 at the current
time step is fed into an action output MLP (Multi Layer Perceptron), which computes the
probability distribution 𝑃(𝑎 ) for the APPLYRULE action at that step, with the specific
calculation formula being:

𝑃(𝑎 = 𝐴𝑃𝑃𝐿𝑌𝑅𝑈𝐿𝐸[𝑅]|𝑎 , 𝑦) = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑊 𝑔(𝑛 )) (10)

Here, g (⋅)represents an MLP with a tanh activation function, and 𝑎 denotes all action pairs up
to time step t for the SELECTTABLE action. We calculate using the following formula:

𝑃(𝑎 = 𝑆𝐸𝐿𝐸𝐶𝑇𝑇𝐴𝐵𝐿𝐸[𝑖]𝑎 , 𝑦) = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑊 𝑛 ) (11)

The calculation method for the "SELECT COLUMN" action is similar, and the prediction of the
AST form of the final SQL query can be decoupled into a sequence of actions 𝑎 = (𝑎 , ⋯ , 𝑎| | ),
with the resulting training objective being:

| |

𝐿=− log 𝑃(𝑎 |𝑎 , 𝑆, 𝑄) (12)

5. Experiment
5.1. Dataset
Our experiment employed the Spider and Spider-Syn datasets [25]. The Spider dataset is a large,
complex, and cross-domain semantic parsing and text-to-SQL dataset, consisting of nine classic
datasets like Scholar [26], WikiSQL, GeoQuery [27], etc. It contains 8,659 training samples, 1,034
development samples, and 2,147 test samples, spanning 138 domains across 200 complex
databases. The Spider-Syn dataset, derived from the Spider benchmark, is a manually curated
dataset where NL problems are modified from Spider by replacing words related to patterns
with synonyms chosen to reflect real-world problem interpretations. It consists of 7,000
training samples and 1,034 development samples.
5.2. Assessment Metrics
Following Yu et al.'s [3] metrics, we compute the Exact Match accuracy (EM) for all examples,
grouped by difficulty. This is done by dividing the predicted SQL and the actual SQL into distinct
subsets based on keywords, and then checking if the predicted set matches the actual one. EM
assesses whether the predicted SQL query matches the ground truth exactly. Like previous
work on Spider, these metrics do not consider the model's performance in generating values
within the SQL.
5.3. Model Configuration
We utilized stanza [28] for tokenization, word segmentation, part-of-speech tagging, and
dependency parsing. For training, we set the maximum input length to 1024, the maximum
number of generated nodes in the AST to 200, the batch size to 32, and the maximum training
steps to 40,000. The encoder and decoder had 6 layers with a dimension of 512 and 8 attention
heads. The vector dimensions for tables and columns were set to 512, and the embeddings for
node types and actions were of size 512. We employed Adafactor as the optimizer, with a
learning rate of 1e-4 and a dropout rate of 0.1.

89
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

5.4. Comparative Experiment

The EM accuracy of the model on the Spider dataset is presented in Table 1, where we initially
compared it with other advanced models. In Table 1, our proposed model demonstrates
competitive performance. Our method outperforms T5-3B+ PICARD [21], which heavily fine-
tunes a language model with a large number of parameters, by 2.6% on the test set, indicating
that our approach can still exhibit strong effectiveness with fewer parameters. Compared to
S2 2SQL [29], our model achieves a 2.4% absolute improvement in EM.

Table 1. EM results on Spider's test set and development set.

Method Dev Test
RATSQL + BERT[22] 69.7 65.6
ShadowGNN + RoBERTa[17] 72.3 66.1
SADGA + GAP[18] 73.1 70.1
LGESQL + ELECTRA[30] 75.1 72.0
RASAT+PICARD[31] 75.3 70.9
T5-3B + PICARD [21] 75.5 71.9
S2 SQL + ELECTRA [29] 76.4 72.1
ours 77.2 74.5

Furthermore, we conducted a fine-grained analysis of accuracy based on the query difficulty

levels defined by Yu et al. [3] (easy, medium, hard, and very hard). In Table 2, we compare the
EM precision of our method against the latest baseline for these four query difficulty subsets.
As expected, the model's performance significantly deteriorates with increasing query
complexity, with the accuracy dropping from 92.2% for simple queries to 50.6% for very hard
queries. In terms of the most complex query types, SRSQL outperforms RAT-SQL [22] by 7.2%
and 7.7% in hard and very hard queries, respectively. This demonstrates the ability of our
Transformer-based SQL decoder to capture longer sequence contexts. Moreover, SRSQL
consistently outperforms the baseline across all four subsets, providing evidence for the
effectiveness of our approach.
Our model was also tested for EM accuracy on the Spider-SYN dataset, as shown in Table 3. The
performance of SRSQL is superior to all baseline models, indicating that our model maintains
good robustness when facing more flexible and complex problems.

Table 2. EM accuracy of Spider queries across different difficulty levels.

Method Easy Medium Hard Extra
RAT-SQL+BERT[22] 86.4 73.6 62.1 42.9
SADGA[18] 90.3 72.4 63.8 49.4
LGESQL[30] 91.5 76.7 66.7 48.8
GRAPHIX-T5-3B[32] 91.9 81.6 61.5 50
ours 92.2 82.5 69.3 50.6

Table 3. EM results on the development set of Spider-SYN.

Method EM accuracy (%)
IRNet[14] 28.4
RAT-SQL+BERT[22] 48.2
LGESQL+ELECTRA[30] 64.6
GRAPHIX-T5-3B[32] 66.9
Ours 67.5

90
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

5.5. Ablation Experiment

To better validate the effectiveness of each component in our model, we conducted a series of
ablation experiments on the development set of the Spider dataset. As shown in Table 4, we
tested the impact of removing four crucial design elements from the model: syntax
dependencies from the input, part-of-speech tagging, pattern linking in the encoder, and the
use of an LSTM-based decoder.

Table 4. Ablation study of EM accuracy on the development set for SRSQL (±95% confidence
interval)
Method Scheme 1
SRSQL 77.2 ± 0.76
SRSQL w/o syntactic dependency 76.1 ± 0.73
SRSQL w/o Part-of-Speech tagging 76.3 ± 0.56
SRSQL w/o schema linking relations 73.8 ± 0.80
SRSQL encoder + LSTM-based decoder 72.9 ± 0.38

As shown in Table 4, the most significant impact among these designs came from the choice of
decoder. After switching to an LSTM-based decoder, the EM accuracy dropped from 77.2% to
72.9%, resulting in a 4.3% decrease in performance, highlighting the superiority of a
Transformer-based decoder. Second, the removal of pattern linking had a considerable effect
on the model, with EM accuracy dropping by 3.4%. This is because the task of matching
questions to database schemas became more challenging, and previous research has already
confirmed the importance of this component for text-to-SQL parsing. Lastly, removing syntax
dependencies and part-of-speech tagging had a smaller effect on performance, with decreases
of 1.1% and 0.9% respectively.

6. Conclusion
In this paper, we present SRSQL, a syntax and relation-enhanced text-to-SQL parser that stands
out with its autoregressive SQL query prediction based on Transformers. By incorporating
relation-aware self-attention, SRSQL integrates pattern linking relationships into its encoder.
The Transformer-based tree decoder, grounded in joint encoding, integrates node types and
prior actions to generate SQL queries during the learning process. Notably, SRSQL
demonstrates state-of-the-art performance on the Spider and Spider-SYN datasets. However,
our model still has some limitations, with this study primarily focusing on the grammatical
aspects of text-to-SQL conversion. Future work could explore incorporating large pre-trained
language models or leveraging techniques from large-scale prompt-based models.

References
[1] John M. Zelle and Raymond J. Mooney: Learning to parse database queries using inductive logic
programming. Proceedings of the thirteenth national conference on Artificial intelligence (Portland,
Oregon, 1996). Vol.2, p1050–1055.
[2] Catherine Finegan-Dollak, Jonathan K. Kummerfeld, Li Zhang, Karthik Ramanathan, Sesh Sadasivam,
Rui Zhang, and Dragomir Radev: Improving Text-to-SQL Evaluation Methodology. Proceedings of
the 56th Annual Meeting of the Association for Computational Linguistics (Melbourne, Australia,
July ,2018). Vol.1, p351–360.
[3] Victor Zhong, Caiming Xiong, and Richard Socher: Seq2SQL: Generating Structured Queries from
Natural Language using Reinforcement Learning.arXiv:1709.00103, 2017.

91
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

[4] Tao Yu, Rui Zhang, Kai Yang, Michihiro Yasunaga, Dongxu Wang, Zifan Li, James Ma, Irene Li,
Qingning Yao, Shanelle Roman, Zilin Zhang, and Dragomir Radev: Spider: A Large-Scale Human-
Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
(Brussels, Belgium, October-November ,2018), p3911–3921.
[5] Binyuan Hui, Ruiying Geng, Lihan Wang, Bowen Qin, Bowen Li, Jian Sun, and Yongbin Li: S2 SQL:
Injecting syntax to question-schema interaction graph encoder for text-to-sql parsers.2022.
arXiv:2203.06958.
[6] Wonseok Hwang, Jinyeong Yim, Seunghyun Park, and Minjoon Seo: A Comprehensive Exploration
on WikiSQL with Table-Aware Word Contextualization. arXiv:1902.01069, 2019.
[7] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova: BERT: Pre-training of Deep
Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the
North American Chapter of the Association for Computational Linguistics: Human Language
Technologies (Minneapolis, Minnesota, June,2019).
[8] Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis,
Luke Zettlemoyer, and Veselin Stoyanov: RoBERTa: A Robustly Optimized BERT Pretraining
Approach. arXiv:1907.11692, 2019.
[9] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz
Kaiser, and Illia Polosukhin: Attention Is All You Need. arXiv:1706.03762, 2017.
[10] Xiaojun Xu, Chang Liu, and Dawn Song: SQLNet: Generating Structured Queries From Natural
Language Without Reinforcement Learning.arXiv:1711.04436, 2017.
[11] Tao Yu, Zifan Li, Zilin Zhang, Rui Zhang, and Dragomir Radev: TypeSQL: Knowledge-based Type-
Aware Neural Text-to-SQL Generation.ArXiv:1804.09769, 2018.
[12] DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, and Dong Ryeol Shin: RYANSQL: Recursively
Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases.ArXiv:
2004.03125,2020.
[13] Pengcheng He, Yi Mao, Kaushik Chakrabarti, and Weizhu Chen: X-SQL: reinforce schema
representation with context. ArXiv:1908.08113, 2019.
[14] Jiaqi Guo, Zecheng Zhan, Yan Gao, Yan Xiao,Jian-Guang Lou, Ting Liu, and Dongmei Zhang: Towards
Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation. ArXiv:1905.
08205, 2019.
[15] Xi Victoria Lin, Richard Socher, and Caiming Xiong: Bridging textual and tabular data for
crossdomain text-to-SQL semantic parsing. Findings of the Association for Computational
Linguistics: EMNLP 2020(Online, November,2020), p4870–4888.
[16] Ben Bogin, Matt Gardner, and Jonathan Berant: Global reasoning over database structures for text-
to-SQL parsing. Proceedings of the 2019 Conference on Empirical Methods in Natural Language
Processing and the 9th International Joint Conference on Natural Language Processing (Hong Kong,
China, November,2019), p3657–3662.
[17] Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, and Kai Yu: Shadowgnn: Graph
projection neural network for text-to-sql parser. Proceedings of the 2021 Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language
Technologies(Online, June,2021), p 5567–5577.
[18] Ruichu Cai, Jinjie Yuan, Boyan Xu, and Zhifeng Hao: SADGA: Structure-aware dual graph aggregation
network for text-to-sql. Advances in Neural Information Processing Systems (2021). Vol.34, p7664–
7676.
[19] Binyuan Hui, Xiang Shi, Ruiying Geng, Binhua Li, Yongbin Li, Jian Sun, Xiaodan Zhu: Improving Text-
to-SQL with Schema Dependency Learning. arXiv:2103.04399,2021.
[20] Peter Shaw, Ming-Wei Chang, Panupong Pasupat, and Kristina Toutanova: Compositional
generalization and natural language variation: Can a semantic parsing approach handle both? In
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the

92
Frontiers in Science and Engineering Volume 4 Issue 4, 2024
ISSN: 2710-0588

11th International Joint Conference on Natural Language Processing (Online, August,2021). Vol.1,
p922–938, Online.
[21] Torsten Scholak, Nathan Schucher, and Dzmitry Bahdanau: PICARD: Parsing incrementally for
constrained auto-regressive decoding from language models. Proceedings of the 2021 Conference
on Empirical Methods in Natural Language Processing (Online and Punta Cana, Dominican
Republic.November,2021), p9895–9901.
[22] Bailin Wang, Richard Shin, Xiaodong Liu, Oleksandr Polozov, and Matthew Richardson: RAT-SQL:
Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers. Proceedings of the 58th
Annual Meeting of the Association for Computational Linguistics (Online, July,2020), p7567-7578.
[23] Peter Shaw, Jakob Uszkoreit, and Ashish Vaswani: Self-Attention with Relative Position
Representations. Proceedings of the 2018 Conference of the North American Chapter of the
Association for Computational Linguistics: Human Language Technologies (New Orleans, Louisiana,
June,2018). Vol.2, p464–468.
[24] Petar Velickovi ˇ c, Guillem Cucurull, Arantxa ´Casanova, Adriana Romero, Pietro Liò, and Yoshua
Bengio: Graph Attention Networks.arXiv:1710.10903,2018.
[25] Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver, John R. Woodward, Jinxia Xie, and
Pengsheng Huang: Towards robustness of text-to-SQL models against synonym substitution.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the
11th International Joint Conference on Natural Language Processing (Online, August,2021).
Vol.1,p2505-2515.
[26] Srinivasan Iyer, Ioannis Konstas, Alvin Cheung, Jayant Krishnamurthy, and Luke Zettlemoyer:
Learning a neural semantic parser from user feedback. Proceedings of the 55th Annual Meeting of
the Association for Computational Linguistics (Vancouver, Canada, July,2017). Vol .1, p963–973.
[27] John M. Zelle and Raymond J. Mooney: Learning to parse database queries using inductive logic
programming. Proceedings of the Thirteenth National Conference on Artificial Intelligence (1996).
Vol. 2, p1050–1055.
[28] Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton, and Christopher D. Manning: Stanza: A Python
Natural Language Processing Toolkit for Many Human Languages. Proceedings of the 58th Annual
Meeting of the Association for Computational Linguistics: System Demonstrations (Online,
July,2020), p101–108.
[29] Binyuan Hui, Ruiying Geng, Lihan Wang, Bowen Qin, Bowen Li, Jian Sun, and Yongbin Li: S2 SQL:
Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers.arXiv:
2203.06958,2022.
[30] Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu, and Kai Yu.: LGESQL: Line Graph Enhanced
Text-to-SQL Model with Mixed Local and Non-Local Relations. ArXiv:2106.01093,2021.
[31] Jiexing Qi, Jingyao Tang, Ziwei He, Xiangpeng Wan, Yu Cheng, Chenghu Zhou, Xinbing Wang, Quanshi
Zhang, and Zhouhan Lin: RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model
for Text-to-SQL. arXiv:2205.06983,2022.
[32] Jinyang Li, Binyuan Hui, Reynold Cheng, Bowen Qin, Chenhao Ma, Nan Huo, Fei Huang, Wenyu Du,
Luo Si, and Yongbin Li: Graphix-t5: Mixing pretrained transformers with graph-aware layers for
text-to-sql parsing. arXiv:2301.07507,2023.

Semantic Parsing For Complex Data Retrieval: Targeting Query Plans vs. SQL For No-Code Access To Relational Databases
No ratings yet
Semantic Parsing For Complex Data Retrieval: Targeting Query Plans vs. SQL For No-Code Access To Relational Databases
17 pages
A Survey On Text-to-SQL Parsing: Concepts, Methods, and Future Directions
No ratings yet
A Survey On Text-to-SQL Parsing: Concepts, Methods, and Future Directions
19 pages
RATSQL
No ratings yet
RATSQL
12 pages
Seq 2 SQL
No ratings yet
Seq 2 SQL
13 pages
Base paper
No ratings yet
Base paper
10 pages
Paper 1
No ratings yet
Paper 1
6 pages
Large Language Model Enhanced Text-to-SQL Generation- A Survey
No ratings yet
Large Language Model Enhanced Text-to-SQL Generation- A Survey
18 pages
LANLI: A Natural Language Interfacing Tool For Relational Database Query Generation
No ratings yet
LANLI: A Natural Language Interfacing Tool For Relational Database Query Generation
14 pages
A Natural Language Interface To Relational Databases Using An Online Analytic Processing Hypercube
No ratings yet
A Natural Language Interface To Relational Databases Using An Online Analytic Processing Hypercube
18 pages
3583140.3583165
No ratings yet
3583140.3583165
14 pages
Llm model transform for short term trading on commodity
No ratings yet
Llm model transform for short term trading on commodity
7 pages
Dusql
No ratings yet
Dusql
13 pages
A Comprehensive Survey On Human-To-Database Communication Using NLP
No ratings yet
A Comprehensive Survey On Human-To-Database Communication Using NLP
5 pages
research paper
No ratings yet
research paper
32 pages
SQL Fundamentals for New Developers: A Practical Guide with Examples
From Everand
SQL Fundamentals for New Developers: A Practical Guide with Examples
William E. Clark
No ratings yet
Advanced SQL Queries: Writing Efficient Code for Big Data
From Everand
Advanced SQL Queries: Writing Efficient Code for Big Data
Robert Johnson
5/5 (2)
Conceptual Graphs For A Database Interface (Sowa 1976)
No ratings yet
Conceptual Graphs For A Database Interface (Sowa 1976)
22 pages
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
No ratings yet
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
17 pages
2406.08426v3
No ratings yet
2406.08426v3
18 pages
SQL and NoSQL Full Mastery: A Comprehensive Guide to Modern Data Management
From Everand
SQL and NoSQL Full Mastery: A Comprehensive Guide to Modern Data Management
Kameron Hussain
No ratings yet
2409.16751v1
No ratings yet
2409.16751v1
18 pages
Question Answering System: 296: Natural Language Processing
No ratings yet
Question Answering System: 296: Natural Language Processing
30 pages
Lucy: Think and Reason To Solve Text-to-SQL: Nina Narodytska Shay Vargaftik
No ratings yet
Lucy: Think and Reason To Solve Text-to-SQL: Nina Narodytska Shay Vargaftik
33 pages
Structuring Natural Language To Query Language A R
No ratings yet
Structuring Natural Language To Query Language A R
5 pages
Recent Advances in Text to SQL
No ratings yet
Recent Advances in Text to SQL
22 pages
2303.07351v1
No ratings yet
2303.07351v1
16 pages
Database Design with SQL: Building Fast and Reliable Systems
From Everand
Database Design with SQL: Building Fast and Reliable Systems
Robert Johnson
No ratings yet
JavaScript Data Structures Explained: A Practical Guide with Examples
From Everand
JavaScript Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Fine-Tuning of Small/Medium LLMs For Business QA On Structured Data
No ratings yet
Fine-Tuning of Small/Medium LLMs For Business QA On Structured Data
17 pages
Recent Advances in Text-To-SQL- A Survey of What We Have and What We Expect
No ratings yet
Recent Advances in Text-To-SQL- A Survey of What We Have and What We Expect
22 pages
C++ Data Structures Explained: A Practical Guide with Examples
From Everand
C++ Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
RESDSQL
No ratings yet
RESDSQL
9 pages
SQL Made Easy: Tips and Tricks to Mastering SQL Programming
From Everand
SQL Made Easy: Tips and Tricks to Mastering SQL Programming
Ryan Campbell
No ratings yet
The Art of SQL: Crafting Robust Database Solutions
From Everand
The Art of SQL: Crafting Robust Database Solutions
Richard Evans
No ratings yet
Dbms Lab El Report
No ratings yet
Dbms Lab El Report
20 pages
Advanced SQL Performance Tuning: Optimize Your Database Workloads
From Everand
Advanced SQL Performance Tuning: Optimize Your Database Workloads
Robert Johnson
No ratings yet
NLQ 262290 5914375 NLQ
No ratings yet
NLQ 262290 5914375 NLQ
8 pages
2405.16755v2
No ratings yet
2405.16755v2
39 pages
1711 04436v1
No ratings yet
1711 04436v1
13 pages
NLQ PDF
No ratings yet
NLQ PDF
5 pages
2.1 Review of Literature: "SQL Generation and PL/SQL Execution From Natural Language Processing"
No ratings yet
2.1 Review of Literature: "SQL Generation and PL/SQL Execution From Natural Language Processing"
11 pages
LLM Based TXT To SQL
No ratings yet
LLM Based TXT To SQL
18 pages
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
From Everand
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
Olga Maria Stefania Cucaro
No ratings yet
Mastering Algorithms and Data Structures
From Everand
Mastering Algorithms and Data Structures
Manish Soni
No ratings yet
SQL and NoSQL: Building Hybrid Data Solutions for Modern Applications
From Everand
SQL and NoSQL: Building Hybrid Data Solutions for Modern Applications
Robert Johnson
No ratings yet
NLIDB_PPT(33247)
No ratings yet
NLIDB_PPT(33247)
18 pages
Few-Shot Text-to-SQL Translation Using Structure
No ratings yet
Few-Shot Text-to-SQL Translation Using Structure
28 pages
Learning SQL: Master SQL Fundamentals
From Everand
Learning SQL: Master SQL Fundamentals
Kiet Huynh
No ratings yet
247 Sqlnet Generating Structured Q
No ratings yet
247 Sqlnet Generating Structured Q
15 pages
Data Structures Explained: A Practical Guide with Examples
From Everand
Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Regular Expressions Demystified: A Practical Guide with Examples
From Everand
Regular Expressions Demystified: A Practical Guide with Examples
William E. Clark
No ratings yet
Natural Language Interfaces To Database (NLIDB) PDF
No ratings yet
Natural Language Interfaces To Database (NLIDB) PDF
3 pages
66 235 1 PB
No ratings yet
66 235 1 PB
10 pages
An Algorithm To Transform Natural Languages To SQL Queries For Relational Databases
No ratings yet
An Algorithm To Transform Natural Languages To SQL Queries For Relational Databases
7 pages
Comprehensive SQL Techniques: Mastering Data Analysis and Reporting
From Everand
Comprehensive SQL Techniques: Mastering Data Analysis and Reporting
Adam Jones
No ratings yet
Graphix T5
No ratings yet
Graphix T5
10 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
SQL All-in-One For Dummies
From Everand
SQL All-in-One For Dummies
Allen G. Taylor
4.5/5 (2)
Mastering SQL and Database: From Basics to Expert Proficiency
From Everand
Mastering SQL and Database: From Basics to Expert Proficiency
William Smith
No ratings yet
Mastering the Art of PL/SQL Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of PL/SQL Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
15 Syntax Parsing
No ratings yet
15 Syntax Parsing
30 pages
Intent and Motivation of Façade Design Pattern
No ratings yet
Intent and Motivation of Façade Design Pattern
7 pages
Compiler Design
No ratings yet
Compiler Design
31 pages
Introduction To Natural Language Processing - GeeksforGeeks
No ratings yet
Introduction To Natural Language Processing - GeeksforGeeks
15 pages
Compiler Unit 1 Notes
No ratings yet
Compiler Unit 1 Notes
23 pages
Application of Parsing
No ratings yet
Application of Parsing
2 pages
M.sc. Computer Science Syllabus - 2021-2022 Onwards
No ratings yet
M.sc. Computer Science Syllabus - 2021-2022 Onwards
66 pages
Natural Language Processing Notes
No ratings yet
Natural Language Processing Notes
26 pages
EX 8 - 14 47 ACD - Merged
No ratings yet
EX 8 - 14 47 ACD - Merged
30 pages
WT unit II (2)
No ratings yet
WT unit II (2)
33 pages
Menhir
No ratings yet
Menhir
57 pages
Sylabus
No ratings yet
Sylabus
28 pages
Syntactic Analysis
No ratings yet
Syntactic Analysis
8 pages
CodeCompass - An Open Software Comprehension Framework For Industrial Usage
No ratings yet
CodeCompass - An Open Software Comprehension Framework For Industrial Usage
9 pages
CSE Courses Details
No ratings yet
CSE Courses Details
123 pages
CD Model Set-3 Answer Key
No ratings yet
CD Model Set-3 Answer Key
29 pages
Foundations of Natural Language Processing Wrapup, Review, and Exam Information
No ratings yet
Foundations of Natural Language Processing Wrapup, Review, and Exam Information
12 pages
Lecture 3-4&5
No ratings yet
Lecture 3-4&5
91 pages
What'S New in Netbrain Consultant Edition 6.2: Innovations in Portable, Map-Driven Network Automation
No ratings yet
What'S New in Netbrain Consultant Edition 6.2: Innovations in Portable, Map-Driven Network Automation
40 pages
Mca2013 PDF
100% (1)
Mca2013 PDF
61 pages
Compiler Design For Cse
No ratings yet
Compiler Design For Cse
1 page
Templates - Jasonette
No ratings yet
Templates - Jasonette
18 pages
Chapter 1 - Query Processing and Optimization
No ratings yet
Chapter 1 - Query Processing and Optimization
62 pages
AWR Analysis
No ratings yet
AWR Analysis
20 pages
Talmon Wintner Tagging The Quran
No ratings yet
Talmon Wintner Tagging The Quran
8 pages
XML Parsing With Perl
No ratings yet
XML Parsing With Perl
30 pages
Compiler Design CS_1
No ratings yet
Compiler Design CS_1
55 pages
Parsing - 1
No ratings yet
Parsing - 1
59 pages
Lesson Plan Aiml Nlp
No ratings yet
Lesson Plan Aiml Nlp
2 pages
Clang Ast
No ratings yet
Clang Ast
6 pages

Syntax_and_Relation_Enhanced_Query_Generation_for_

Uploaded by

Syntax_and_Relation_Enhanced_Query_Generation_for_

Uploaded by

Frontiers in Science and Engineering Volume 4 Issue 4, 2024

Syntax and Relation Enhanced Query Generation for Text to SQL

Figure 1. Model Framework Diagram

4.2. Question Graph and Schema Graph Construction

Figure 2. Example of a Question Graph with Part-of-Speech Tagging and Dependency

Figure 3. Example of a Database Schema Graph

𝑦 = 𝐿𝑎𝑦𝑒𝑟𝑁𝑜𝑟𝑚(𝑦 + 𝐹𝐶(𝑅𝑒𝐿𝑈(𝐹𝐶(𝑦 )))) (4)

Where 𝑊 , 𝑊 , 𝑊 ∈ 𝑅 × stands for the learnable weight matrix, 1 ≤ h ≤ H, where FC refers

4.4. Schema Linking

4.4.1. Name-based Linking

Figure 4. Decoder structure diagram

Figure 5. Example of an Abstract Syntax Tree

𝑃(𝑎 = 𝐴𝑃𝑃𝐿𝑌𝑅𝑈𝐿𝐸[𝑅]|𝑎 , 𝑦) = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑊 𝑔(𝑛 )) (10)

𝑃(𝑎 = 𝑆𝐸𝐿𝐸𝐶𝑇𝑇𝐴𝐵𝐿𝐸[𝑖]𝑎 , 𝑦) = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑊 𝑛 ) (11)

𝐿=− log 𝑃(𝑎 |𝑎 , 𝑆, 𝑄) (12)

5.4. Comparative Experiment

Table 1. EM results on Spider's test set and development set.

Furthermore, we conducted a fine-grained analysis of accuracy based on the query difficulty

Table 2. EM accuracy of Spider queries across different difficulty levels.

Table 3. EM results on the development set of Spider-SYN.

5.5. Ablation Experiment

You might also like