Exploring The Capabilities and Limitations of GPT

The document discusses the capabilities and limitations of GPT and ChatGPT models for natural language processing. It covers their transformer architecture, unsupervised training processes, and evaluation metrics like perplexity. It also evaluates their performance on tasks such as question answering and summarization, finding that while excellent in some areas, they still face challenges including context understanding, response diversity, and rare inputs.

Uploaded by

hurhish.artem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views3 pages

Exploring The Capabilities and Limitations of GPT

Uploaded by

hurhish.artem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Journal of Management Research and Analysis 2023;10(1):18–20

Content available at: https://www.ipinnovative.com/open-access-journals

Journal of Management Research and Analysis

Journal homepage: https://www.jmra.in/

Review Article
Exploring the capabilities and limitations of GPT and Chat GPT in natural
language processing
Nimit Jagdishbhai1 , Krishna Yatin Thakkar1, *
1 Dept. of Management, Christ Institute of Management, Rajkot, Gujarat, India

ARTICLE INFO ABSTRACT

Article history: Natural Language Processing (NLP) has seen tremendous advancements with the development of
Received 15-02-2023 Generative Pretrained Transformer (GPT) models and their conversational variant, ChatGPT. These
Accepted 13-03-2023 language models have been shown to generate contextually appropriate and coherent responses to natural
Available online 12-04-20223 language prompts, making them highly useful for various NLP applications. However, there are still
limitations to their performance and understanding these limitations is crucial for their effective utilization.
This paper presents a comprehensive analysis of the capabilities and limitations of GPT and ChatGPT,
Keywords: covering their architecture, training processes, and evaluation metrics. The study also evaluates the
Natural Language Processing
performance of these models on various NLP tasks, including language translation, question-answering,
Generative Pretrained Transformer
and text summarization. The results reveal that while these models excel in certain tasks, they still face
ChatGPT challenges in understanding context, generating diverse responses, and handling rare or out-of-domain
their architecture
inputs. The study concludes by discussing potential solutions and future research directions for improving
training processes
the performance of GPT and ChatGPT in NLP applications.
evaluation metrics Solutions
This is an Open Access (OA) journal, and articles are distributed under the terms of the Creative Commons
Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon
the work non-commercially, as long as appropriate credit is given and the new creations are licensed under
the identical terms.
For reprints contact: [email protected]

1. Introduction generate coherent, natural-sounding text.

2. Training Processes: GPT and ChatGPT are trained
GPT (Generative Pre-trained Transformer) and ChatGPT
using unsupervised learning on a large corpus of text
(a variant of GPT designed for chatbot applications) are
data. During training, the model is presented with
large language models developed by OpenAI. Here are
sequences of text and is trained to predict the next
some details about their architecture, training processes, and
word in the sequence. This process is known as
evaluation metrics:
language modeling. The model is also fine-tuned on
specific tasks such as question-answering or language
1. Architecture: GPT and ChatGPT use a transformer
translation using supervised learning techniques.
architecture, which is a type of neural network that is
3. Evaluation Metrics: The performance of GPT and
particularly good at processing sequential data, such as
ChatGPT is typically evaluated using several metrics.
text. The transformer architecture consists of a series
One important metric is perplexity, which measures
of transformer blocks, each of which includes a self-
how well the model is able to predict the next word
attention mechanism and feedforward neural network
in a sequence. A lower perplexity score indicates
layers. This allows the model to effectively learn the
better performance. Additionally, human evaluations
relationships between words in a sentence and to
are often used to evaluate the quality of the text
* Corresponding author. generated by the model. These evaluations may involve
E-mail address: [email protected] (K. Y. Thakkar). asking humans to rate the coherence, fluency, and

https://doi.org/10.18231/j.jmra.2023.004
2394-2762/© 2023 Innovative Publication, All rights reserved. 18
Jagdishbhai and Thakkar / Journal of Management Research and Analysis 2023;10(1):18–20 19

overall quality of the generated text. 1–6 3.2. Incorporation of multimodal data
While GPT and ChatGPT have primarily been used
2. Pros of ChatGPT for processing textual data, there is growing interest in
1. Availability: ChatGPT is available 24/7, providing incorporating other types of data, such as images, audio, or
immediate access to information and assistance. video. One approach is to use multimodal models that can
2. Fast and efficient: ChatGPT can process information learn representations of different types of data and integrate
quickly and provide responses in a matter of seconds, them into a unified framework. Another approach is to use
making it a fast and efficient way to obtain information. pre-training techniques that can leverage large amounts of
3. No human bias: ChatGPT is an artificial intelligence unlabeled data across multiple modalities.
model and does not have any inherent biases that a
human expert may have. 3.3. Better handling of rare or out-of-vocabulary words
4. Multilingual: ChatGPT can communicate in various GPT and ChatGPT models rely on a fixed vocabulary of
languages, making it accessible to a wider range of words, and may struggle with rare or out-of-vocabulary
users. words. One potential solution is to use subword or character-
level representations that can capture more fine-grained
3. Cons of ChatGPT information about the morphology of words. Another
approach is to use techniques such as dynamic vocabulary
1. Limited knowledge: ChatGPT’s knowledge is limited expansion or knowledge distillation to handle rare or unseen
to the data it has been trained on, and it may not words.
have access to the most up-to-date or comprehensive
information. 3.4. Development of more efficient and scalable
2. Lack of empathy: ChatGPT does not have the
training algorithms
emotional intelligence or empathy that a human expert
may possess, making it less effective in dealing with GPT and ChatGPT models are extremely large and require
emotional or sensitive issues. significant computational resources to train. One potential
3. Inability to understand context: ChatGPT may solution is to use more efficient training algorithms,
struggle to understand the context of a question or such as those based on sparse attention or adaptive
situation, which can lead to inaccurate or irrelevant computation. Another approach is to develop distributed
responses. training techniques that can distribute the computational
4. Risk of misinformation: ChatGPT may provide load across multiple devices or clusters.
inaccurate or incomplete information, especially if
it has been trained on biased or unreliable data. 3.5. Exploration of novel evaluation metrics
It is important to verify information obtained from
ChatGPT with other sources. While perplexity and human evaluations are commonly used
to evaluate the performance of GPT and ChatGPT models,
there may be other metrics that are better suited to specific
GPT and ChatGPT have demonstrated impressive NLP applications. For example, for text generation tasks,
performance on a wide range of natural language metrics such as diversity, novelty, or coherence may be more
processing (NLP) tasks, but there are still some limitations informative than perplexity. Developing new evaluation
and opportunities for improvement. Here are some potential metrics that are more closely aligned with the goals of
solutions and future research directions for improving the specific NLP applications could help to improve the overall
performance of GPT and ChatGPT in NLP applications: 7–10 performance of GPT and ChatGPT models.

3.1. Better handling of long-range dependencies 4. Conclusion

The transformer architecture is well-suited for processing In summary, GPT and ChatGPT are large language models
sequential data, but it can struggle with long-range that use a transformer architecture and are trained using
dependencies, such as those that occur in certain types unsupervised learning on a large corpus of text data. The
of text, such as scientific papers or legal documents. One performance of these models is typically evaluated using
potential solution is to use hierarchical models that can metrics such as perplexity and human evaluations of the
process information at different levels of granularity, such quality of the generated text. Overall, GPT and ChatGPT
as paragraphs, sections, or documents. Another approach have already achieved impressive performance on a wide
is to incorporate external knowledge, such as ontologies or range of NLP tasks, but there is still significant room
knowledge graphs, to help the model understand the context for improvement. Continued research and development in
of the text. these areas will likely lead to further improvements in the
20 Jagdishbhai and Thakkar / Journal of Management Research and Analysis 2023;10(1):18–20

performance and applicability of these models. //s3-us-west-2.amazonaws.com/openai-assets/researchcovers/

languageunsupervised/languageunderstandingpaper.pdf.
7. Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P.
5. Source of Funding Language models are few-shot learners; 2020.
8. Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D. RoBERTa: A
None. Robustly Optimized BERT Pretraining Approach. Comp Language.
2019;Available from: https://arxiv.org/abs/1907.11692.
6. Conflict of Interest 9. Lewis M, Liu Y, Goyal N, Ghazvininejad M, Mohamed A, Levy O,
et al. BART: Denoising sequence-to-sequence pre-training for natural
None. language generation, translation, and comprehension. 2020;Available
from: https://arxiv.org/abs/1910.13461.
10. Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M.
References Exploring the limits of transfer learning with a unified text-to-text
1. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN. transformer. J Mach Learn Res. 2019;1(6):1–67.
Attention is all you need. Adv Neural Inf Proc Syst. 2017;5:5998–
6008.
2. Radford A, Narasimhan K, Salimans T, Sutskever I. Improving Author biography
language understanding by generative pre-training. 2018;Available
from: https://s3-us-west-2.amazonaws.com/openai-assets/research- Nimit Jagdishbhai, Assistant Professor
covers/language-unsupervised/language_understanding.
3. Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I. Krishna Yatin Thakkar, Assistant Professor
Language models are unsupervised multitask learners. OpenAI blog.
2019;1(8):1–24.
4. Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P.
Language models are few-shot learners; 2020. Cite this article: Jagdishbhai N, Thakkar KY. Exploring the
5. Chen TQ, Lu Y, Chen Y, Du X. Generative Pretraining From Pixels. capabilities and limitations of GPT and Chat GPT in natural language
Proc Mach Learn Res. 2020;119:1691–703. processing. J Manag Res Anal 2023;10(1):18-20.
6. Radford A, Mikolov T. Improving language understanding
by generative pre-training; 2018. Available from: https:

Ai Tools PDF
No ratings yet
Ai Tools PDF
273 pages
Seminar Report On Chat GPT
100% (4)
Seminar Report On Chat GPT
8 pages
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
From Everand
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
Nelson Ambrose
No ratings yet
CLG Paper Presenation
No ratings yet
CLG Paper Presenation
6 pages
Doc
No ratings yet
Doc
1 page
A Comprehensive Study of ChatG
No ratings yet
A Comprehensive Study of ChatG
24 pages
(IJETA-V11I3P16) :kritika Paliwal, Pooja Sharma, Sakshi Mishra, Pankaj Sharma
No ratings yet
(IJETA-V11I3P16) :kritika Paliwal, Pooja Sharma, Sakshi Mishra, Pankaj Sharma
4 pages
A Brief Review of Chatgpt: Limitations, Challenges and Ethical-Social Implications
No ratings yet
A Brief Review of Chatgpt: Limitations, Challenges and Ethical-Social Implications
15 pages
Exploring ChatGPT Capabilities and Limitations A Survey
No ratings yet
Exploring ChatGPT Capabilities and Limitations A Survey
24 pages
Preprints202303 0438 v1
No ratings yet
Preprints202303 0438 v1
18 pages
111-115, Tesma0712, IJEAST, 19806
No ratings yet
111-115, Tesma0712, IJEAST, 19806
5 pages
Arxiv_ChatGPT_2024
No ratings yet
Arxiv_ChatGPT_2024
36 pages
Unleashing The Potential of Conversational AI Amplifying Chat-GPTs Capabilities and Tackling Technical Hurdles
No ratings yet
Unleashing The Potential of Conversational AI Amplifying Chat-GPTs Capabilities and Tackling Technical Hurdles
26 pages
The Evolution, Applications, and Ethical Implications of ChatGPT A Study of AI Powered Conversational Agents
No ratings yet
The Evolution, Applications, and Ethical Implications of ChatGPT A Study of AI Powered Conversational Agents
4 pages
Complete_ChatGPT project _and_AI_Project
No ratings yet
Complete_ChatGPT project _and_AI_Project
4 pages
Fuoff 1 11122
No ratings yet
Fuoff 1 11122
2 pages
ChatGPTy Advancement in Conversational AI
No ratings yet
ChatGPTy Advancement in Conversational AI
3 pages
Conversational AI For Natural Language P
No ratings yet
Conversational AI For Natural Language P
9 pages
Chatgpt: Artificial Intelligence
No ratings yet
Chatgpt: Artificial Intelligence
9 pages
Chat GPT
No ratings yet
Chat GPT
8 pages
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
ChatGPT A Comprehensive Review On Background, Applications, Key
No ratings yet
ChatGPT A Comprehensive Review On Background, Applications, Key
34 pages
Chat GPT
No ratings yet
Chat GPT
3 pages
Biggest Buzz On ChatGPT AI Tools
No ratings yet
Biggest Buzz On ChatGPT AI Tools
4 pages
p2
No ratings yet
p2
3 pages
SSRN Id4402499
No ratings yet
SSRN Id4402499
7 pages
探索ChatGPT，解锁人工智能语言生成的潜力
No ratings yet
探索ChatGPT，解锁人工智能语言生成的潜力
72 pages
Abme D 23 00350
No ratings yet
Abme D 23 00350
12 pages
2304 02017
No ratings yet
2304 02017
23 pages
ChatGPT: Applications, Opportunities, and Threats
No ratings yet
ChatGPT: Applications, Opportunities, and Threats
13 pages
Project Report
No ratings yet
Project Report
12 pages
CHATGPT DALL.E 3: Complete Guide. Third Edition
From Everand
CHATGPT DALL.E 3: Complete Guide. Third Edition
Hesham Mohamed Elsherif
No ratings yet
01 What Is ChatGPT? An Introduction To The AI-powered
No ratings yet
01 What Is ChatGPT? An Introduction To The AI-powered
2 pages
Report
No ratings yet
Report
2 pages
ChatGPT Simplified: Expert Tips & Tricks
From Everand
ChatGPT Simplified: Expert Tips & Tricks
Dr. islam Abo Amna
No ratings yet
A Systematic Review of The Limitations and Associated Opportunities of ChatGPT
No ratings yet
A Systematic Review of The Limitations and Associated Opportunities of ChatGPT
17 pages
ChatGPT: A Revolutionary Human-Machine Communication Technology
No ratings yet
ChatGPT: A Revolutionary Human-Machine Communication Technology
3 pages
ConversationalAIforNaturalLanguageProcessingAnReviewofChatGPT
No ratings yet
ConversationalAIforNaturalLanguageProcessingAnReviewofChatGPT
10 pages
Everything About ChatGPT
No ratings yet
Everything About ChatGPT
5 pages
CSIC-6040-排版347-353
No ratings yet
CSIC-6040-排版347-353
7 pages
Arxiv ChatGPTV7
No ratings yet
Arxiv ChatGPTV7
24 pages
Te Seminar Synopsis Final 1
No ratings yet
Te Seminar Synopsis Final 1
3 pages
214114
No ratings yet
214114
2 pages
New 183
No ratings yet
New 183
2 pages
New 180
No ratings yet
New 180
2 pages
ChatGPT Case Study - Bendo, Jaiko Riel A.
100% (2)
ChatGPT Case Study - Bendo, Jaiko Riel A.
27 pages
AI Unreliable Answers - A Case Study
No ratings yet
AI Unreliable Answers - A Case Study
18 pages
Daf 2
No ratings yet
Daf 2
1 page
Seminar Report ChatGPT 25 Pages
No ratings yet
Seminar Report ChatGPT 25 Pages
11 pages
2304 02017v11
No ratings yet
2304 02017v11
35 pages
02 As An AI Language Model Developed by OpenAI
No ratings yet
02 As An AI Language Model Developed by OpenAI
3 pages
Everything You Need To Know About Chatgpt Expeed Software 240314091646 b2188bc5
No ratings yet
Everything You Need To Know About Chatgpt Expeed Software 240314091646 b2188bc5
19 pages
Understanding ChatGPT_ A Complete Guide
No ratings yet
Understanding ChatGPT_ A Complete Guide
15 pages
ChatGPT How To Use
No ratings yet
ChatGPT How To Use
37 pages
AI HHW ChatGPT
No ratings yet
AI HHW ChatGPT
8 pages
MD Burhan6621documentation1
No ratings yet
MD Burhan6621documentation1
23 pages
Working of Chatgpt Report
No ratings yet
Working of Chatgpt Report
24 pages
ChatGPT
No ratings yet
ChatGPT
1 page
Chatgpt Prompts Mastery
67% (3)
Chatgpt Prompts Mastery
14 pages
A Comprehensive Survey of ChatGPT- Advancements, Applications, Prospects, and Challenges
No ratings yet
A Comprehensive Survey of ChatGPT- Advancements, Applications, Prospects, and Challenges
31 pages
Research Chatgpt
No ratings yet
Research Chatgpt
2 pages
Skin Lesions Detection Using Deep Learning Techniques
No ratings yet
Skin Lesions Detection Using Deep Learning Techniques
5 pages
Threat Intelligence Model - Main1
No ratings yet
Threat Intelligence Model - Main1
63 pages
Shraddhey Sharma Class 10thH AI PROJECT FILE
No ratings yet
Shraddhey Sharma Class 10thH AI PROJECT FILE
20 pages
Download Proceedings of the 20th Congress of the International Ergonomics Association (IEA 2018) Sebastiano Bagnara ebook All Chapters PDF
100% (2)
Download Proceedings of the 20th Congress of the International Ergonomics Association (IEA 2018) Sebastiano Bagnara ebook All Chapters PDF
62 pages
2. Deep Neural Network
No ratings yet
2. Deep Neural Network
60 pages
Knowledge Representation - 2
No ratings yet
Knowledge Representation - 2
12 pages
Accounting Information Systems and Firm Value
No ratings yet
Accounting Information Systems and Firm Value
28 pages
Computer Ethics Essay
100% (2)
Computer Ethics Essay
6 pages
9.1 Improvement in Food Resources
No ratings yet
9.1 Improvement in Food Resources
4 pages
Ai and Machine Learning in Structural Analysis And
No ratings yet
Ai and Machine Learning in Structural Analysis And
7 pages
Flanders Make - Activity Report 2019 200724
No ratings yet
Flanders Make - Activity Report 2019 200724
25 pages
Artificial Intelligence Will Soon Replace Teachers in the Classroom
No ratings yet
Artificial Intelligence Will Soon Replace Teachers in the Classroom
3 pages
Detection of Hate Speech and Offensive Language CodeMix Text in Dravidian Languages Using Cost-Sensitive Learning Approach
No ratings yet
Detection of Hate Speech and Offensive Language CodeMix Text in Dravidian Languages Using Cost-Sensitive Learning Approach
27 pages
A Review on the Effectiveness of Machine Learning and Deep Learning Algorithms for Cyber Security
100% (1)
A Review on the Effectiveness of Machine Learning and Deep Learning Algorithms for Cyber Security
19 pages
Disclosure Teams: The Reptilian Push by Kerry Cassidy
No ratings yet
Disclosure Teams: The Reptilian Push by Kerry Cassidy
5 pages
Young Scholars Forum 2025
No ratings yet
Young Scholars Forum 2025
7 pages
Deep LearningINAF With MATLAB
No ratings yet
Deep LearningINAF With MATLAB
80 pages
Trends and Trajectories For Explainable, Accountable and Intelligible Systems: An HCI Research Agenda
No ratings yet
Trends and Trajectories For Explainable, Accountable and Intelligible Systems: An HCI Research Agenda
11 pages
CRISP Learning Brochure (1)
No ratings yet
CRISP Learning Brochure (1)
12 pages
Confidential: SESSION 2018/2019
No ratings yet
Confidential: SESSION 2018/2019
26 pages
Slides_Empowering Agroalbania.al
No ratings yet
Slides_Empowering Agroalbania.al
3 pages
A Review on Three Decades of Manufacturing Maintenance Research Past Present and Future Directions
No ratings yet
A Review on Three Decades of Manufacturing Maintenance Research Past Present and Future Directions
25 pages
Walter Avila Cordova 2020 J. Phys. Conf. Ser. 1642 012003
No ratings yet
Walter Avila Cordova 2020 J. Phys. Conf. Ser. 1642 012003
11 pages
Unit 3 FUNDAMENTALS OF TECHNOPRENEURSHIP
No ratings yet
Unit 3 FUNDAMENTALS OF TECHNOPRENEURSHIP
8 pages
Chatbot and Text Summarization
No ratings yet
Chatbot and Text Summarization
5 pages
Cyber Chronicle
No ratings yet
Cyber Chronicle
123 pages
Fonasba - Young Ship Agent or Shipbroker of The Year Award 2018
No ratings yet
Fonasba - Young Ship Agent or Shipbroker of The Year Award 2018
28 pages
Weighted Ensemble Model For Image Classification: Talib Iqball M. Arif Wani
No ratings yet
Weighted Ensemble Model For Image Classification: Talib Iqball M. Arif Wani
8 pages
Inambar 2024 A Review of Machine Learning Application in The Talent Acquisition Process
No ratings yet
Inambar 2024 A Review of Machine Learning Application in The Talent Acquisition Process
13 pages