ITT

The document provides feedback for contributors on improving prompts for the Image-to-Text project, emphasizing that prompts should challenge the model and avoid simplicity. It outlines principles for crafting effective prompts, including the use of natural language, clarity, and ensuring requests are attainable based on the provided image information. Contributors are advised to avoid overly simplistic prompts and to incorporate real-world scenarios to enhance the model's reasoning capabilities.

Uploaded by

Natalia Graniel Piña

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views4 pages

ITT

Uploaded by

Natalia Graniel Piña

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Genesis Image-to-Text Feedback, 03/21

Dear Contributors, please take a moment to review the following customer feedback for the
Image-to-Text (ITT) project! The main takeaway is that prompts must appropriately
challenge the model to reason through the question to get to the answer! In other words,
the answer should NOT be obvious! As always, prompts should draw upon real-world
scenarios where you might ask an AI for guidance, using simple, direct, and natural-
sounding language.

Please do NOT attempt tasking any further before reading this feedback!

1. Scene Understanding/ Counting Prompts should NOT be simple or easy to answer 🫥

2. Prompts should use natural-sounding language 🌿
3. Prompts should be clear and unambiguous! 🎯 🔍
4. No Unattainable Requests! 🙅

1. Scene Understanding/ Counting Prompts should NOT be simple or easy

to answer 🫥
In order to improve the model’s abilities, we need to challenge it by submitting well-thought out,
creative prompts that test its understanding of what is being asked. Please avoid submitting
overly simplistic prompts that can be easily answered, particularly in the Scene
Understanding and Counting competencies ⛔.

Here’s an example of too-simple prompt that was submitted in the Scene Understanding
competency:

Prompt Example
While this prompt is appropriate for the Scene Understanding competency in that it asks about
the position of the car relative to the garage door, the answer almost immediately evident just
by looking at the image 👎. Please avoid overly simple prompts like this, particularly in the Scene
Understanding/ Counting competencies! This kind of prompt does not challenge the model, or
help it to improve.

Here are some principles that you might consider incorporating into prompts for the Scene
Understanding competency:
● Hierarchical Spatial Reasoning: Eg, “Can you tell which objects are on the floor, which
are on furniture, and which are stacked on each other?”
● Occlusion and Depth Reasoning: Eg, “Which building looks closest to the viewer, and
which looks farthest away?”
● Relational Layout with Directional Anchoring: “Using the tree in the middle as a
reference, where are the animals located around it– left, right, behind, or in front?”
● Symmetry/ Alignment Detection: Eg, “Are the benches lined up even with the
fountain? If not, how are they positioned differently?”
● Navigation and Pathfinding Reasoning: Eg, “If someone walks from the red door to
the green gate, what’s the best clear path they can take? Are there any obstacles?”
● Object Orientation and Facing Direction: Eg, “Who’s facing the camera, who’s turned
sideways, and who has their back to us?”
● Nested Spatial Structures: Eg, “What items are inside other items in this image– like a
spoon in a cup or a cup in a cabinet? Can you describe the full nesting?”
● Motion & Temporal Spatial Inference: Eg, “Looking at the positions of the child and the
ball, who’s likely to reach it first?”

(Note that while the prompt example above does test the model’s ability to understand the
relational layout of the image, it only asks about two objects relative to each other. A better
prompt might ask about the relational layout of multiple objects in the image, relative to a
reference point.)

2. Prompts should use natural-sounding language 🌿

Prompts should draw upon real-world scenarios where you might ask an AI for guidance, using
simple, direct, and natural-sounding language. While sometimes it is necessary to use more
technical language for clarity, prompts should generally sound just as if someone in the real
world is using an AI model to ask a question about a problem that they are encountering, and
how to solve it.

In the same vein, many Attempters have relied heavily on formatting requests to fulfill
Complexity, resulting in unnatural-sounding prompts. Please avoid using formatting requests
in your prompts so that they sound more natural. However, you may still include formatting
requests in your prompts, if they make sense. Do NOT include formatting requests simply to
fulfill Complexity. The Complexity reviewer measure will be changed soon to reflect that
prompts should contain sufficient complexity, without relying on formatting requests.

3. Prompts should be clear and unambiguous! 🎯 🔍

Make sure your prompts are specific and precise in what you are asking for! Vague
prompts can be interpreted in many ways, leading to responses that may be off-target, and thus
difficult to rate. Be as specific as possible about the kind of information that you are
seeking in the response!

4. No Unattainable Requests! 🙅

Prompts should be answerable based on information that is visually provided by the image.
Please do not submit prompts that cannot be answered by information contained in the
image. This includes requests that are not among the options in an image.

The following “unattainable” prompt was submitted in the Patterns competency. However, it is
not possible to identify a correct answer based on the given options:

Prompt Example
|

The correct answer is not found in the second image. The answer is that the white horse
inside of the white diamond is the odd knight.

Additionally, the prompt does not provide any explicit instructions on what the model should do if
none of the options are correct. This lack of clear guidance may mislead the agent if the answer
is unattainable.

The Art Of Prompt Engineering With Chatgpt A Hands-on Guide Pdf Download
No ratings yet
The Art Of Prompt Engineering With Chatgpt A Hands-on Guide Pdf Download
4 pages
Prompt Codex
From Everand
Prompt Codex
VISHWA HANSNUR
No ratings yet
Promoting Rigor Through Higher Level Questioning: Practical Strategies for Developing Students' Critical Thinking
From Everand
Promoting Rigor Through Higher Level Questioning: Practical Strategies for Developing Students' Critical Thinking
Todd Stanley
No ratings yet
Board Resolution For Opening Bank Account
100% (2)
Board Resolution For Opening Bank Account
3 pages
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
From Everand
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Steven Cooper
4.5/5 (11)
Immersive Learning: Designing for Authentic Practice
From Everand
Immersive Learning: Designing for Authentic Practice
Koreen Olbrish Pagano
No ratings yet
High-Impact Interview Questions: 701 Behavior-Based Questions to Find the Right Person for Every Job
From Everand
High-Impact Interview Questions: 701 Behavior-Based Questions to Find the Right Person for Every Job
Victoria Hoevemeyer
No ratings yet
Prompt Eng
No ratings yet
Prompt Eng
22 pages
[Cbt] Multimodal RLHF Tasking Specifications
No ratings yet
[Cbt] Multimodal RLHF Tasking Specifications
27 pages
assist_remove course
No ratings yet
assist_remove course
33 pages
PE2
No ratings yet
PE2
7 pages
4. Prompting AI Art an Investigation Into the Creative Skill of Prompt Engineering
No ratings yet
4. Prompting AI Art an Investigation Into the Creative Skill of Prompt Engineering
24 pages
dolphin Genesis Image-to-Text
No ratings yet
dolphin Genesis Image-to-Text
48 pages
?️_?️ Vision SFT Handbook
No ratings yet
?️_?️ Vision SFT Handbook
11 pages
Prompt Engineering in AI
No ratings yet
Prompt Engineering in AI
3 pages
Designintech2023 001 Pages 2
No ratings yet
Designintech2023 001 Pages 2
27 pages
?️_?️ Vision SFT Handbook 2
No ratings yet
?️_?️ Vision SFT Handbook 2
14 pages
The Snake Eyes Project Tasking Handbook
No ratings yet
The Snake Eyes Project Tasking Handbook
27 pages
Lesson 03 Prompt Engineering
No ratings yet
Lesson 03 Prompt Engineering
63 pages
Project Marvel - External Guidelines v1
No ratings yet
Project Marvel - External Guidelines v1
16 pages
Prompting+in+Practice
No ratings yet
Prompting+in+Practice
4 pages
Assessment That Works: How Do You Know How Much They Know? a Guide to Asking the Right Questions
From Everand
Assessment That Works: How Do You Know How Much They Know? a Guide to Asking the Right Questions
John Sleigh
No ratings yet
Mo_Dynamic_Prompt_Optimizing_for_Text-to-Image_Generation_CVPR_2024_paper
No ratings yet
Mo_Dynamic_Prompt_Optimizing_for_Text-to-Image_Generation_CVPR_2024_paper
10 pages
The Art of AI Prompt Crafting - A Comprehensive Guide For Enthusiasts - Prompting - OpenAI Developer Forum
No ratings yet
The Art of AI Prompt Crafting - A Comprehensive Guide For Enthusiasts - Prompting - OpenAI Developer Forum
13 pages
Prompt Engineering
No ratings yet
Prompt Engineering
27 pages
Day 3 - Customizing ChatGPT
No ratings yet
Day 3 - Customizing ChatGPT
44 pages
unit2 (1)
No ratings yet
unit2 (1)
22 pages
Prompt Engineering Guide for Students
No ratings yet
Prompt Engineering Guide for Students
5 pages
10 Prompt Engineering Tips and Best Practices
No ratings yet
10 Prompt Engineering Tips and Best Practices
9 pages
Lesson_02_Optimizing_GenAI_Models
No ratings yet
Lesson_02_Optimizing_GenAI_Models
40 pages
Revision Exercises in Basic Engineering Mechanics
From Everand
Revision Exercises in Basic Engineering Mechanics
Gregory Pastoll
No ratings yet
Leadership & Self-Worth: A Tech Nerd's Guide
From Everand
Leadership & Self-Worth: A Tech Nerd's Guide
Jamin Chavez
No ratings yet
The Global Engineer: How to Use the Essence of Engineering to Be an Engineer of International Ability
From Everand
The Global Engineer: How to Use the Essence of Engineering to Be an Engineer of International Ability
Clint Steele
No ratings yet
How To Optimize AI Prompts - A Guide To Effective Interaction With Generative Models
No ratings yet
How To Optimize AI Prompts - A Guide To Effective Interaction With Generative Models
4 pages
ChatGPT Prompt
No ratings yet
ChatGPT Prompt
5 pages
From Applicant to Hired: Unlocking the Secrets of Resumes, Interviews, and Employment
From Everand
From Applicant to Hired: Unlocking the Secrets of Resumes, Interviews, and Employment
Jeff Chu
No ratings yet
Principals and Heuristics of Effective Prompt Engineering
No ratings yet
Principals and Heuristics of Effective Prompt Engineering
10 pages
LLM Paper 4
No ratings yet
LLM Paper 4
24 pages
A Taxonomy of Prompt Modifiers For Text-To-Image Generation
No ratings yet
A Taxonomy of Prompt Modifiers For Text-To-Image Generation
15 pages
Pmẻgss
No ratings yet
Pmẻgss
20 pages
Basic Math in Plain English
From Everand
Basic Math in Plain English
Bobby Rabon
No ratings yet
Performance-Based Assessment for 21st-Century Skills
From Everand
Performance-Based Assessment for 21st-Century Skills
Todd Stanley
4.5/5 (14)
31 Best Practices Every Trainer Should Know (Vol. Ii)!
From Everand
31 Best Practices Every Trainer Should Know (Vol. Ii)!
Dan’l Adams
No ratings yet
Learn to use AI Prompt mechanics for high-quality output
No ratings yet
Learn to use AI Prompt mechanics for high-quality output
6 pages
Introduction To AI and AI Prompt Engineering
No ratings yet
Introduction To AI and AI Prompt Engineering
28 pages
Mining Your Client's Metaphors: A How-To Workbook on Clean Language and Symbolic Modeling, Basics Part Ii: Facilitating Change
From Everand
Mining Your Client's Metaphors: A How-To Workbook on Clean Language and Symbolic Modeling, Basics Part Ii: Facilitating Change
Gina Campbell
No ratings yet
AI Chatbot - Response Comparisons v2.2S
No ratings yet
AI Chatbot - Response Comparisons v2.2S
9 pages
This course will be a quick overview of the most important elements to remember when writing prompts for Mighty Moo
No ratings yet
This course will be a quick overview of the most important elements to remember when writing prompts for Mighty Moo
10 pages
AI PROMPT ENGINEERING 101_
No ratings yet
AI PROMPT ENGINEERING 101_
7 pages
Build Your Own ChatGPT Presentation
0% (1)
Build Your Own ChatGPT Presentation
57 pages
Week2 Llms
No ratings yet
Week2 Llms
25 pages
GPT-4.1 Prompting Guide
No ratings yet
GPT-4.1 Prompting Guide
31 pages
The Amazing Adventures of Generative AI - Copy
No ratings yet
The Amazing Adventures of Generative AI - Copy
24 pages
The Innovative Mindset
From Everand
The Innovative Mindset
Linda Zhang
No ratings yet
NEWT Prompt
No ratings yet
NEWT Prompt
16 pages
The CLEVER ChatGPT Prompt Engineering Approach
No ratings yet
The CLEVER ChatGPT Prompt Engineering Approach
30 pages
AI+ Prompt Engineer Level 1 Detailed Curriculum
No ratings yet
AI+ Prompt Engineer Level 1 Detailed Curriculum
10 pages
Design Guidelines For Prompt Engineering
No ratings yet
Design Guidelines For Prompt Engineering
23 pages
Prompt Engineering
No ratings yet
Prompt Engineering
8 pages
The basic concepts of OOP in C#: Learn conceptually in simple language
From Everand
The basic concepts of OOP in C#: Learn conceptually in simple language
Hani Marzban
No ratings yet
Chemistry, Strategic Paths to Understanding
From Everand
Chemistry, Strategic Paths to Understanding
Robert G. Bryant
5/5 (1)
Mantra Pushpam
No ratings yet
Mantra Pushpam
9 pages
Where can buy Violence, Colonialism and Empire in the Modern World 1st Edition Philip Dwyer ebook with cheap price
100% (2)
Where can buy Violence, Colonialism and Empire in the Modern World 1st Edition Philip Dwyer ebook with cheap price
28 pages
EPREUVE D'ANGLAIS CLASSE DE 4EME 2EME DEVOIR DU 2EME TRIMESTRE 2023-2024 CPEG SAINT JUSTIN
No ratings yet
EPREUVE D'ANGLAIS CLASSE DE 4EME 2EME DEVOIR DU 2EME TRIMESTRE 2023-2024 CPEG SAINT JUSTIN
2 pages
Surface Texture Measurement Fundamentals For Metrology Center Open House
No ratings yet
Surface Texture Measurement Fundamentals For Metrology Center Open House
54 pages
Xi Chem-Marking Scheme
No ratings yet
Xi Chem-Marking Scheme
9 pages
Kpsa Chairman Report November 2022
No ratings yet
Kpsa Chairman Report November 2022
20 pages
Network Forensics
No ratings yet
Network Forensics
34 pages
Final Copy of 5 Project Report
No ratings yet
Final Copy of 5 Project Report
4 pages
Pas 10 - Events After The Reporting Period
100% (2)
Pas 10 - Events After The Reporting Period
10 pages
Polymer by ADMET 1
No ratings yet
Polymer by ADMET 1
49 pages
Aeronautical Second Sem 2020 Batch
No ratings yet
Aeronautical Second Sem 2020 Batch
1 page
DMP Alarm Panel XRSuper6-XR20-XR40 installation manual
No ratings yet
DMP Alarm Panel XRSuper6-XR20-XR40 installation manual
24 pages
Project ICS 104 Term 232
No ratings yet
Project ICS 104 Term 232
7 pages
Mutoh VJ-1x24 Supplemental Training Guide Rev 51911
No ratings yet
Mutoh VJ-1x24 Supplemental Training Guide Rev 51911
45 pages
Teaching English Using ICT
100% (2)
Teaching English Using ICT
177 pages
4 New Addition. Vivek Et Al., 2020
No ratings yet
4 New Addition. Vivek Et Al., 2020
27 pages
HEDIONE®-964898 MSDS
No ratings yet
HEDIONE®-964898 MSDS
9 pages
Coconut Curry Chicken (Super Easy!) - Downshiftology
No ratings yet
Coconut Curry Chicken (Super Easy!) - Downshiftology
2 pages
Vincentius Hadi Soetjiadi
No ratings yet
Vincentius Hadi Soetjiadi
3 pages
Chrissi Jo: Printing Cleo Collection
No ratings yet
Chrissi Jo: Printing Cleo Collection
19 pages
Minimum Static Strength Requirements
No ratings yet
Minimum Static Strength Requirements
2 pages
Group Screening Test English Phil Iri
No ratings yet
Group Screening Test English Phil Iri
20 pages
CHM260 Chapter 1
No ratings yet
CHM260 Chapter 1
108 pages
Studio Xps 9100 Service Manual en Us
No ratings yet
Studio Xps 9100 Service Manual en Us
50 pages
Free ESL Play PDF
No ratings yet
Free ESL Play PDF
2 pages
Design and Analysis of Propeller Blade Geometry Using The PDE Method
100% (1)
Design and Analysis of Propeller Blade Geometry Using The PDE Method
215 pages
Ranger EagleEye3
No ratings yet
Ranger EagleEye3
2 pages
Tybcom - Sem Vi - HRM - MCQ
No ratings yet
Tybcom - Sem Vi - HRM - MCQ
17 pages
List of Subjects Academic Year-2017-2018 FE (2015 Pat) Semester-1
No ratings yet
List of Subjects Academic Year-2017-2018 FE (2015 Pat) Semester-1
2 pages

ITT

Uploaded by

ITT

Uploaded by

Genesis Image-to-Text Feedback, 03/21

1. Scene Understanding/ Counting Prompts should NOT be simple or easy to answer 🫥

1. Scene Understanding/ Counting Prompts should NOT be simple or easy

2. Prompts should use natural-sounding language 🌿

3. Prompts should be clear and unambiguous! 🎯 🔍

You might also like