0% found this document useful (0 votes)

217 views11 pages

?️_?️ Vision SFT Handbook

The Vision SFT Handbook provides an overview of a project aimed at enhancing chatbot capabilities in processing images through prompt and response creation. It includes detailed workflows for both attempters and reviewers, guidelines for task-specific variables, and emphasizes the importance of localization and conciseness in responses. The document is regularly updated with new information and examples to improve task quality.

Uploaded by

hackdovux

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

217 views11 pages

?️_?️ Vision SFT Handbook

Uploaded by

hackdovux

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

👁️‍🗨️ Vision SFT Handbook

Everything you need to know for high-quality tasks.

Welcome to Vision SFT!

This document will give you:

● An overview of the project
● Step-by-step instructions
● Guidelines for creating high-quality tasks
● A ton of useful examples
● A cheat sheet for while you task!

🆕 Change Log: Please review every day!

Here’s what’s new! This is a living document, it will get updated as we learn more and
hear more about questions you have.

Date What got updated?

April 10 ● Added more info on Review & Edit steps in

the Reviewers: Rating Criteria page
● Updated the example for Scene Texts based
on updated info on copyright.
● Updated clarification on Infographics Help
Type
● Added helpful details to the Chatbot help
type instructions.

April 11 Clarified Grounding requirements in the Attempter

Workflow and the Prompt Writing Tips
Project Overview:

In this project we are helping develop a chatbot’s ability to process and understand
images. We’ll do this by writing prompts referring to a picture, then create strong
responses that incorporate details from those images.

Contributors will help write and edit prompts and responses and compare human- vs
generated- response quality.

This data will help our chatbots give us better answers when using images as
references across a variety of use cases and help types.

The Goals 🥇
1. 🌉 Help the model learn to write amazing prompts and responses with an image for
reference.
2. 📈 Write responses that are as good as or better than state-of-the-art chatbot
responses
3. ⏳ Give quick, complete, and useful answers to users.
a. A "complete" answer that includes every possible detail is less important
than a concise, relevant response that directly addresses the user's need.

Task Steps:
Attempter Workflow (L-1):

📖 🖼️ ✍️ ✍️
Read Variables Find an Image Write Prompt(s) Write Response(s)

● Goal: As the first person to attempt this task, your goal is to find an approved
image and create a fantastic, lifelike prompt given your task-specific variables,
then draft a response to go with it that is equally as vibrant and engaging. Your
aim is to write an even better response than a chatbot could.
● Steps:
a. Review the task-specific instructions for the variables you need to
incorporate, specifically:
i. Help type
ii. Image type
iii. Text Load
iv. Text Language
v. Locale
b. Review the prompt instructions for the task to ensure you write the
prompt with all the necessary considerations
c. Find an image that meets the requirements and attach it to the prompt
entry box
d. Write an excellent prompt that references the image and asks for the
relevant help type. The model shouldn’t be able to answer your prompt
without knowing what’s in the image. DO NOT describe your image in
the prompt itself, that would defeat the purpose of testing if the Bot
can understand the picture.

FOR THE PROMPT-ONLY PROJECT, THIS IS THE FINAL STEP. FOR
THE PROMPT+RESPONSE PROJECTS, KEEP GOING.

e. Write a tailored, high-quality response to your own prompt that fulfills
the needs of the user.
i. If you are performing Multiturn Tasks, an additional Prompt box
will appear after your response box. You do NOT need to choose a
new photo– continue the conversation based on the original photo.
Continue writing prompt/response pairs until you’ve reached the
minimum number of turns, or have come to a natural close of the
conversation, whichever is longer, with a maximum of 6 turns.
ii. It can take the rubric a few seconds to load the next Prompt box
or the Submit Task button after the last Response. When you’re
done writing pairs, select End Session
f. Submit the task

Reviewer Workflow (L0):

📖 📖 ✍️ 📖 ✍️ 🎚️ ⚖️ ✔️ 💯
Read Variables Review the image Edit if Review the Edit if Rate Choose the Best Log Your Give
& prompt(s) necessary Response(s) necessary Responses Answer(s) Edits Feedback

● Goal: Quality check the prompt/response pair from each the Attempter and a
Chatbot, and provide quality updates. You’ll rank the responses side-by-side
and tell us which response is better.
● Steps:
a. Review the task-specific instructions and variables to ensure you have a
good understanding of what the prompt and response need to include
b. Review the prompt(s) potentially adding edits to improve them
■ FOR THE PROMPT-ONLY PROJECT, THIS IS THE ONLY PIECE
YOU WILL REVIEW – THERE WILL BE NO RESPONSE OR
RATINGS.
c. Review the response(s), potentially adding edits to improve them
d. Grade each response on specific quality criteria, such as image
understanding, instruction following, etc.
■ If you are performing Multiturn tasks, click on each turn response
to open up the rubric and preference ranking for that turn. All turns
need to be edited/ rated.
e. Provide a preference rating for which response is better and a
justification.
■ On multiturn selection, you’ll give a preference rating for EACH
answer.
f. Notate how much editing you needed to do on the prompt/response, and
give an overall quality grade for the task.
g. Give Feedback to the Attempter, if needed. Grade the quality of the task
with a justification and leave inline feedback wherever it’s needed.
h. Submit the task (“Approve with changes”)

Task-Specific Variables

Each task has five(5) variables that need to be considered when writing and reviewing
the prompt and responses.

Locale Help Type Image Type Text Load Text Language

🌎 🤝 📷 📚 🗣️
● 🌎 Locale
○ Where the prompt and response should be localized. Localization is
critical to these tasks, so you should use references to regionally-specific
customs, places, slang, nomenclature, etc.
■ For example, Mexican Spanish, Chilean Spanish, and Spain
Spanish will all have slightly different flavors and references that
are specific to their area, and can’t be completely answered by
simply translating or understanding a generic Spanish response.
● 🤝 Help type
○ What the user asking the chatbot for. Also known as a “use case,” this
helps define what the user’s ultimate goal is.

Click here to learn about Help Types and Image Examples

● 📷 Image type:
○ What type of picture you (as the user) are using for reference with the
chatbot. Each Image Type is a broad category, so get creative with what
you search for within each type!

Click here to learn about Image Types

● 📚 Text Load:
○ Each Image Type is also associated with a Text Load – how much text
should be in the image. For image types that are Text-Heavy, you’ll be
given a Text Language.
● 🗣️ Text Language
○ If your image is Text-Heavy, the language it is in will matter. Make sure to
pay attention to the text language that needs to be in your photo!
Prompt Creation Guidelines:
When crafting prompts, always adhere to the intended Help Type. This means
understanding what the user expects from the model.

● For instance, in a Creative Writing scenario, the user seeks an original text
output, regardless of its perceived "creativity" or "artistic" merit.
● In Extraction, the user needs a rapid answer derived from a provided source
text, without having to read it entirely.
● For Chatbot use cases, the user aims to engage in interactive experiences, such
as role-playing, game-playing, or general entertainment-focused conversations.

USE CASES CAN OVERLAP!

A task suitable for Extraction may also be appropriate for Closed Q&A. Similarly,
Chatbot or Brainstorming tasks can function as Open Q&A. Focus on creating tasks
that fit the primary use case, rather than trying to pick the only "correct" use case.

Specific things to watch out for:

Faces and people tasks should not directly ask to identify a figure.

These are:

1. Do NOT ask direct people identification, such as: who is this? + [a photo of a
person], What is this person known for? + [a photo of a person] even for
public/historical figures.
2. Do NOT ask resemblance questions, such as: which famous soccer player does
this person look like? + [a photo of a person]
3. Do NOT ask inference of protected status from images, such as: What is the
sexual orientation of the person from the image? + [a photo of a person]

However, if there are clear identifications of the person, you are allowed to ask
questions about it.
Examples are:

1. “What’s the sexual orientation of this person” + [an photo of Alan uring with
text “Alan uring” besides him]
2. “what’s the sexual orientation of this person” + [an image with only text “Alan
uring” on it].
3. “This is a picture of Barack Obama. What were his greatest accomplishments?”

Chatbot tasks should not be scripted conversations unless explicitly requested. The
interaction should resemble a basic conversation where the prompt represents one
side, and the model represents the other.

Good Example:

● Prompt:

Unset
"I’ve often wondered why the sea is so blue around the Amalfi
Coast. Please answer in English in the voice of Jacques
Cousteau."

● Response:

Unset
"The blue sea of Amalfi is famous, and one of my favorites to go
diving in! It is so blue because the water is so clear and
because of the brightness of the sun."

Bad Example:

● Prompt:
Unset
"I’ve often wondered why the sea is so blue around the Amalfi
Coast. Please answer in English in the voice of Jacques
Cousteau."

● Response: (rewrite these if you see them)

Unset

- Jacques: Oh yes this is a great question, let’s

discuss!

- You: Yes Jacques it’s a pleasure to be here.

- Jacques: Well you see the snorkeling makes the sea

bluer.

- You: I never knew that.

Localization Highlights:

Localization is VERY IMPORTANT.

Tasks must be tailored to your specific language and country. This includes:

● Using local spellings (e.g., "favour" and "summarise" in en_GB and

Commonwealth countries, avoiding "ß" in Swiss German).
● Employing local word choices (e.g., "robots" instead of "traffic lights" in South
Africa, "mobile phone" vs. "cell phone" in Commonwealth countries).
● Focusing on local topics, such as:
○ Local customs, traditions, events, and personalities.
○ Local locations, geography, and attractions.
○ Local regulations, concerns, and government matters.
○ Local business incentives, training programs, and resources.
○ Any other topics of specific local interest or need.

Avoid tasks that can be answered by simply translating an English response.

Do not include tasks about global celebrities (unless they have a local connection, such
as Bono's house in Dublin). Steer clear of universally common topics like Hollywood
movies, major video games, or widely available car models. Do not create tasks about
locales other than your own (e.g., do not write about Hong Kong if you are in
Singapore).

Response Creation Guidelines:

Conciseness is essential!

Responses should:

● Lead with the answer, using the pyramid principle to provide immediate value.
● Avoid meaningless pleasantries like "Of course!" or "Sure, I’d be happy to help!"
(except in Chatbot use cases where conversational politeness is expected).
● Eliminate unhelpful repetition, such as restating the prompt or summarizing the
response at the end.

There can be exceptions to these guidelines. These include:

● Prompts requesting a 1-week itinerary or workout plan, which naturally require

seven days.
● Prompts that explicitly ask for more than six bullet points or a long text
response (e.g., Creative Writing tasks like "write me a story" or "write a
persuasive essay").
● Classification text including more than 6 entries.
● Source reference text for Rewriting is in a format with 7 or more bullet points.

The 64 Ways Personal Contemplations On The Gene Keys (Richard Rudd) (Z-Library)
97% (64)
The 64 Ways Personal Contemplations On The Gene Keys (Richard Rudd) (Z-Library)
634 pages
The Art Of Prompt Engineering With Chatgpt A Hands-on Guide Pdf Download
No ratings yet
The Art Of Prompt Engineering With Chatgpt A Hands-on Guide Pdf Download
4 pages
CSP 621 - Mindsets and Behaviors Action Plan 1
No ratings yet
CSP 621 - Mindsets and Behaviors Action Plan 1
3 pages
?️_?️ Vision SFT Handbook 2
No ratings yet
?️_?️ Vision SFT Handbook 2
14 pages
? Flamingo WFE
No ratings yet
? Flamingo WFE
18 pages
ChatGPT Prompt
No ratings yet
ChatGPT Prompt
5 pages
Multiverse Course
No ratings yet
Multiverse Course
26 pages
Edu Given Prompt Evals - Attempter Instruction
No ratings yet
Edu Given Prompt Evals - Attempter Instruction
24 pages
[Cbt] Multimodal RLHF Tasking Specifications
No ratings yet
[Cbt] Multimodal RLHF Tasking Specifications
27 pages
The Snake Eyes Project Tasking Handbook
No ratings yet
The Snake Eyes Project Tasking Handbook
27 pages
02-Intro to Prompt Design - Part 2 SLIDES (2)
No ratings yet
02-Intro to Prompt Design - Part 2 SLIDES (2)
16 pages
Boot Camp Digital ChatGPT Prompt Optimization Cheat Sheet
No ratings yet
Boot Camp Digital ChatGPT Prompt Optimization Cheat Sheet
6 pages
ChatGPT Prompt200+ ChatGPT Prompts To Explore
75% (4)
ChatGPT Prompt200+ ChatGPT Prompts To Explore
57 pages
Cohere_ Ideal Model Behavior
No ratings yet
Cohere_ Ideal Model Behavior
43 pages
dolphin Genesis Image-to-Text
No ratings yet
dolphin Genesis Image-to-Text
48 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
25 pages
Prompt Eng
No ratings yet
Prompt Eng
22 pages
01-merged
No ratings yet
01-merged
15 pages
# ChatGPT Prompt Mastery_20250519_204724_0000
No ratings yet
# ChatGPT Prompt Mastery_20250519_204724_0000
44 pages
Exposition_Wholesale
No ratings yet
Exposition_Wholesale
5 pages
ChatGPT User Guide
No ratings yet
ChatGPT User Guide
9 pages
ChatGPT Course PDF
No ratings yet
ChatGPT Course PDF
161 pages
Everything I'll Forget About Prompting LLMs
No ratings yet
Everything I'll Forget About Prompting LLMs
36 pages
ChatGPT3 Free Prompt List
No ratings yet
ChatGPT3 Free Prompt List
4 pages
Instructions _ Winter Wonderland RLHF
No ratings yet
Instructions _ Winter Wonderland RLHF
31 pages
Instructions _ Winter Wonderland RLHF
No ratings yet
Instructions _ Winter Wonderland RLHF
46 pages
Generative AI Testing Intern Assignment
No ratings yet
Generative AI Testing Intern Assignment
5 pages
Ebook No. 1 - Introduction To ChatGPT
100% (4)
Ebook No. 1 - Introduction To ChatGPT
52 pages
Goldfish Crackers
No ratings yet
Goldfish Crackers
9 pages
Master Prompt engineering Like Pro
No ratings yet
Master Prompt engineering Like Pro
31 pages
Instructions _ Winter Wonderland RLH
No ratings yet
Instructions _ Winter Wonderland RLH
50 pages
ChatGPT User Guide
100% (1)
ChatGPT User Guide
12 pages
Introduction to AI Prompt Hub
No ratings yet
Introduction to AI Prompt Hub
16 pages
Learn to use AI Prompt mechanics for high-quality output
No ratings yet
Learn to use AI Prompt mechanics for high-quality output
6 pages
AI Chatbot - Response Comparisons v2.2S
No ratings yet
AI Chatbot - Response Comparisons v2.2S
9 pages
GPT Prompt Engineering Handbook: Ernest Simon
75% (4)
GPT Prompt Engineering Handbook: Ernest Simon
22 pages
Chatgpt Prompt Engineering
50% (2)
Chatgpt Prompt Engineering
12 pages
10 ChatPT prompts
No ratings yet
10 ChatPT prompts
14 pages
Prompt Eng Techniques
100% (2)
Prompt Eng Techniques
17 pages
My book _20250520_164912_0000
No ratings yet
My book _20250520_164912_0000
39 pages
LLM SFT Data Guideline v2.0
No ratings yet
LLM SFT Data Guideline v2.0
13 pages
Chatbots
No ratings yet
Chatbots
15 pages
Int344 Nlp Ete Unit 6 QnA Building Models Chatbot
No ratings yet
Int344 Nlp Ete Unit 6 QnA Building Models Chatbot
10 pages
【R107】gptm-prompts【Raffaele Gaito】
No ratings yet
【R107】gptm-prompts【Raffaele Gaito】
2 pages
Cheat_Sheet_Beginners_Guide_to_ChatGPT
No ratings yet
Cheat_Sheet_Beginners_Guide_to_ChatGPT
1 page
unit2 (1)
No ratings yet
unit2 (1)
22 pages
Chatgpt+Prompting+Guide+ +Smfs+Course
No ratings yet
Chatgpt+Prompting+Guide+ +Smfs+Course
16 pages
Chatgpt Slides
100% (1)
Chatgpt Slides
112 pages
ChatGPT Cheat Sheet PDF
100% (1)
ChatGPT Cheat Sheet PDF
1 page
ChatGPT Accelerator Challenge #1 Beginner Level Prompts (1)
No ratings yet
ChatGPT Accelerator Challenge #1 Beginner Level Prompts (1)
4 pages
Effective Chatbots Using Machine Learning and Natural Language Processing
No ratings yet
Effective Chatbots Using Machine Learning and Natural Language Processing
10 pages
Image+Multi-turn+Conversational+QA+Correction 1 (1)
No ratings yet
Image+Multi-turn+Conversational+QA+Correction 1 (1)
7 pages
Chatgpt Guide
No ratings yet
Chatgpt Guide
56 pages
ChatGPT_Prompt_Guide
No ratings yet
ChatGPT_Prompt_Guide
3 pages
ChatGPT Prompts 1
No ratings yet
ChatGPT Prompts 1
41 pages
Unit-5 Suthanthira Devi
No ratings yet
Unit-5 Suthanthira Devi
155 pages
ChatGPT Mastery: Integrating AI into Your Workflow for Advanced Users
From Everand
ChatGPT Mastery: Integrating AI into Your Workflow for Advanced Users
GN
No ratings yet
Mastering ChatGPT: Effective Prompts and Best Practices.
From Everand
Mastering ChatGPT: Effective Prompts and Best Practices.
Steven Mcananey
No ratings yet
Mastering Prompt Engineering
From Everand
Mastering Prompt Engineering
Youngsoo Chae
No ratings yet
Mastering C# 8.0: Master C# Skills with Hands-on Code Examples (English Edition)
From Everand
Mastering C# 8.0: Master C# Skills with Hands-on Code Examples (English Edition)
Joydip Kanjilal
No ratings yet
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet
Programming Problems: A Primer for The Technical Interview
From Everand
Programming Problems: A Primer for The Technical Interview
Bradley Green
4.5/5 (3)
113ElectricManifesting Part1
100% (1)
113ElectricManifesting Part1
19 pages
Pe RD Ev MO DU LE: Week 1
No ratings yet
Pe RD Ev MO DU LE: Week 1
5 pages
Effects of Internet Addiction and Common Subtance To A Student Behavior 2
No ratings yet
Effects of Internet Addiction and Common Subtance To A Student Behavior 2
2 pages
PDF - Social Studies G8 Enrichment - Folktales and Myths and Legends
No ratings yet
PDF - Social Studies G8 Enrichment - Folktales and Myths and Legends
12 pages
(Tailieudieuky.com) Đề Thi Chọn Học Sinh Giỏi Tiếng Anh 9 Tỉnh Hà Tĩnh Năm Học 2019-2020 Có Đáp Án
0% (1)
(Tailieudieuky.com) Đề Thi Chọn Học Sinh Giỏi Tiếng Anh 9 Tỉnh Hà Tĩnh Năm Học 2019-2020 Có Đáp Án
3 pages
3rd Grade Synonyms Crossword 2
No ratings yet
3rd Grade Synonyms Crossword 2
2 pages
Summer in Calcutta A Feministic, Autobiographical and Psychoanaytical Discussion of Kamala Das
No ratings yet
Summer in Calcutta A Feministic, Autobiographical and Psychoanaytical Discussion of Kamala Das
3 pages
Full Text of - Simone de Beauvoir - Brigitte Bardot and The Lolita Syndrome
100% (1)
Full Text of - Simone de Beauvoir - Brigitte Bardot and The Lolita Syndrome
24 pages
Karen Sova Thesis Final
No ratings yet
Karen Sova Thesis Final
57 pages
Third Quarter Exam - CT2 11
No ratings yet
Third Quarter Exam - CT2 11
4 pages
RE2705 Urban Economics AY2021/2022 Brief of Group Project Assignment
No ratings yet
RE2705 Urban Economics AY2021/2022 Brief of Group Project Assignment
7 pages
Long Term Romantic Relationships
No ratings yet
Long Term Romantic Relationships
14 pages
The Theory of Accounting Engineering İsmail Tekbaş
No ratings yet
The Theory of Accounting Engineering İsmail Tekbaş
3 pages
Differentiating Biases From Prejudices (Part 2)
No ratings yet
Differentiating Biases From Prejudices (Part 2)
27 pages
Testing Nonverbal IQ in Children With Autism Spectrum
No ratings yet
Testing Nonverbal IQ in Children With Autism Spectrum
8 pages
Sem Título
No ratings yet
Sem Título
1 page
Working in A Team Environment: Basic Competency
No ratings yet
Working in A Team Environment: Basic Competency
25 pages
Table S1. Effects of COVID-19 Questionnaire (ECQ)
No ratings yet
Table S1. Effects of COVID-19 Questionnaire (ECQ)
8 pages
Thinking Learning Style
No ratings yet
Thinking Learning Style
9 pages
7 Elements in Selling Process
No ratings yet
7 Elements in Selling Process
14 pages
Casework
No ratings yet
Casework
48 pages
Practicing College Learning Strategies - 7th Edition Latest Edition Download
100% (11)
Practicing College Learning Strategies - 7th Edition Latest Edition Download
17 pages
SLAC Training Proposal on RPMS
No ratings yet
SLAC Training Proposal on RPMS
7 pages
Gift
No ratings yet
Gift
5 pages
Prepared By:: Nagrampa Hazel. Mercadero Jeht Carlo
No ratings yet
Prepared By:: Nagrampa Hazel. Mercadero Jeht Carlo
25 pages
Bholi X
No ratings yet
Bholi X
2 pages
Спіч
No ratings yet
Спіч
9 pages
FS 1 Episode 5,6,7 (Lorimar)
No ratings yet
FS 1 Episode 5,6,7 (Lorimar)
13 pages

?️_?️ Vision SFT Handbook

Uploaded by

?️_?️ Vision SFT Handbook

Uploaded by

👁️‍🗨️ Vision SFT Handbook

Everything you need to know for high-quality tasks.

Welcome to Vision SFT!

This document will give you:

🆕 Change Log: Please review every day!

Date What got updated?

April 10 ●​ Added more info on Review & Edit steps in

April 11 Clarified Grounding requirements in the Attempter

Reviewer Workflow (L0):

Locale Help Type Image Type Text Load Text Language

Click here to learn about Help Types and Image Examples

Click here to learn about Image Types

USE CASES CAN OVERLAP!

Specific things to watch out for:

●​ Response: (rewrite these if you see them)

- Jacques: Oh yes this is a great question, let’s

- You: Yes Jacques it’s a pleasure to be here.

- Jacques: Well you see the snorkeling makes the sea

- You: I never knew that.

Localization is VERY IMPORTANT.

●​ Using local spellings (e.g., "favour" and "summarise" in en_GB and

Avoid tasks that can be answered by simply translating an English response.

Response Creation Guidelines:

There can be exceptions to these guidelines. These include:

●​ Prompts requesting a 1-week itinerary or workout plan, which naturally require

You might also like

April 10 ● Added more info on Review & Edit steps in

● Response: (rewrite these if you see them)

● Using local spellings (e.g., "favour" and "summarise" in en_GB and

● Prompts requesting a 1-week itinerary or workout plan, which naturally require