0% found this document useful (0 votes)

36 views5 pages

Exposition Wholesale

Uploaded by

Matx 2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views5 pages

Exposition Wholesale

Uploaded by

Matx 2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 5

Welcome to the Extensions project! We are so excited to have you.

Agenda
Project Introduction and Setup [12 min]
Intro & Overview of the Extensions project
Task Overview
New Dimensions
Prompt Requests
Multi-turn Conversations
Punts and Blank Responses
Key Model Assumptions
Knowledge Checks

Intro & Overview: Extensions project [2 min]

Project Context

In this project, the customer () is interested in user-generated prompts that have

their models interact with tools like Google Maps, Google Flights, Google Hotels,
Google Search, etc. You'll see a lot of prompts that ask for directions,
itineraries, hotels, etc. These prompts come from real life users like yourself.
Hopefully, with this context, you can see why the quality bar must be so high. - if
the model provides false directions, the user will almost certainly never use the
chatbot.
For each task, you will:
Rate two (2) responses separately across various dimensions
Rate a "tool log" for accuracy
Provide two preference ranking scores:
A score for how much you prefer one response over the other
A score for how much you prefer one tool log over the other
Write a justification for the response preference ranking, explaining the specifics
and logic behind your decision.
Mark error categories across for both the response and tool log

Task Walkthrough (1/3)

A step-by-step guide to each part of the task.
Step 1: If there is a chat history, familiarize yourself with the prior
conversation between the user and the chatbot. Use the history as context for the
latest prompt.
See below for an example chat history.
Start a Chat
Please read the previous conversation carefully to gather context.
User: You are a indie hacker with experience in finding leads for software
engineering project. I'm trying to brainstorm ideas for generating leads and
attract clients and close deals for my custom software solutions company. I am a
fullstack developer with more than 9 years experience in NodeJS, Typescript,
Kubernetes, React, Astro, React, Serverless and AWS. I'm also a certified AWS Cloud
developer and AWS Solutions Architect with hands on experience designing,
provisioning and developing custom platforms.
Generate ideas on how a professional like me could land new jobs in the IT
industry. Make sure to include sources and tools to help in searching and finding
new clients, how to contact and how to attract in further discussions the leads,
and think of other methods that could land a job. After you suggestions I will give
feedback and you'll have to ask questions to find out how your suggestions help.
Adjust the ideas based on the received feedback.
Model:
Leveraging Your Expertise to Attract Clients
As a seasoned full-stack developer with extensive experience in NodeJS, Typescript,
Kubernetes, React, Astro, React, Serverless, and AWS, you possess a valuable
skillset that can be highly sought after by businesses seeking custom software
solutions. To effectively attract clients and land new jobs, consider implementing
a multi-pronged approach that combines online strategies with targeted offline
engagements.
1. Establish a Strong Online Presence
Create a Compelling Website: Craft a professional website that showcases your
expertise, projects, and case studies. Highlight your technical proficiency,
problem-solving skills, and ability to deliver successful software solutions.

Step 2: Read the prompt carefully to understand what the user needs. Look up terms
you don't know. If there is a link, look through it as you evaluate each response.
See below for an example prompt.
Make an extensive and effective summary of the following video: https://youtu.be/0-
3SlufYHwA? si=nX4ORzISCGR5p3vj

Step 3: Code and Code Output Section. [UPDATED]

Read the code and code output for each response, note the key components for all
code that invoke Tools APIs has four primary components: (1) Tools (2) Tool Methods
(3) Parameters and (4) business logic / reasoning in code comments. Click the
pencil in top right to make it easier to read.

Task Walkthrough (2/3)

Step 4: Model Response Section
Read each of the two responses carefully and provide ratings on the quality
dimensions.
See below for an example model response.

Answer the questions below

Label your response according to the questions below.
Response 1
A
A
This is an article about Sundar Pichai's goals for Google in 2024. It discusses
seven goals, including delivering the world's most advanced Al and building the
most helpful personal computing platforms and devices. Some of the important points
are that Google will be launching a new platform with Android XR and that Gemini
Ultra should be nearing launch.
Writing Quality
The correctness of the spelling and grammatical structure, and the quality of
sentence structure. Determine if the different ideas flow from one to another
1
2
3
Major Issues
Minor Issues
No Issues
Verbosity R
How effectively the written response conveys information without unnecessary
repetition or wordiness. Look for length, relevance and repetition
1
Too Verbose
2
3
Just Right
Too Short
Verbosity Justification*
In 1-2 sentences, explain why you think there are issues with Verbosity. Please use
evidence. Please make sure any nuance/details are still provided in the final
justification at the end. The mini justifications are strictly for the
reviewers/auditors. Repetition is okay between the mini justifications and the
final justification.
This reopenso provides a very brief cumon
of the
o main point

There are four new dimensions to be rated for the response: Content Conciseness &
Relevance, Content Completeness, Collaborativity, Contextual Awareness (more
details on these new dimensions in the rubric course).

Step 6: Write a justification for your decision on which model response you prefer

Step 7: Select the error categories for the response and the tool log

Prompt Requests
Links in the prompt and "@"

Links – A user usually includes a link in the prompt for a very specific reason. Be
sure to always read through the link in the prompt! It's a critical part of the
user's request.
"@" – If you see the "@" symbol before the name of a tool like "@Google Hotels" or
"@travel arrow," that means the user is requesting the model to use that specific
tool to fulfill their request.

Request Importance

Not all requests are made equal! Some are more important than others.

Imagine a prompt requests 10 hotels under $300 in LA. Let's say one response offers
5 hotels under $300 in LA and another response offers 10 hotels above $300 in LA.
Which response would a user probably prefer? Probably the first—in this context,
prices are important

Multi-turn Conversations
Multi-turn conversations (chat/conversation history) are where the user and chatbot
have a conversation before the final prompt. Sometimes, the conversation history
will appear in the response itself as "==Conversation History==". Sometimes, it
will appear above the other responses.
Always read the chat history for multi-turn conversations carefully: it provides
important context on the latest prompt and the 2 responses you'll rate.

Refusal to Answer Prompt and Blank Responses

Refusal to Answer Prompt
A model might refuse to answer the prompt because the request is outside of its
capabilities. In this project, we assume that if a model refuses to answer due to
model limitations, it had no other choice – whatever the model says about its
limitations is true. When a model refuses to answer, it is not helpful to the user.

Complete Refusal Examples:

I'm sorry, but I'm not able to access the video/website you've provided. Can I help
with anything else?
I'm just a language model, so I can't help you with that.
Partial Refusal Examples:
I'm not able to access the video/website you've provided. However, based on the
website title, I've searched the web and found that …
I'm not able to search for flights directly. However, you can use the following
websites to find direct flights ...
Blank Responses
A blank response is when the model has no output or consists strictly of
googleusercontent links.
If the entire response is blank, mark Instruction Following as major issues and
Truthfulness as N/A

Key Model Assumptions [2 min]

Response 1 and Response 2 are the outputs of two different models. These models
have different limitations, sources, data, etc.
The models DON'T have the ability to generate images: they render as [Image]. Don't
penalize these tags if you see them: we are simply not rendering them.
If location is NOT specified anywhere in the prompt (including the small text above
it), the models should default to Mountain View, CA or Chicago, IL
Assume that the limitations of the model are true (e.g. "I cannot access that
link"). If the model refuses to answer because of a model limitation, assume it had
no other choice.
If time is NOT specified anywhere in the prompt, the models default to when the
prompt was asked.
If you see a googleusercontent link, assume it's not there. googleusercontent links
are garbage links that the model outputs. However we should NOT penalize for these
links.
If the task requires specific domain knowledge you don't have (e.g. math, coding),
please skip the task.

Embedded UI Guidance
Table of Contents

Embedded UI Guidance

Tips

Conclusion

When does this apply?

When there are no flights, maps, videos, pictures, or music listed in the response.

In such a response, the model says it has options for flights or hotels, but none
are shown…

No Issues

If the text says there is an embedded UI, check the tool log output. If the tool
log output makes sense and is accurate, rate Truthfulness as “No Issues.”

Major Issues / Cannot Assess

If the tool log exists and the text DOES NOT say there is an embedded UI, rate
“Major Issues.”
If no tool log exists and the text DOES SAY that an embedded UI exists, rate
“Cannot Assess.”
If no tool log exists and the text DOES NOT say there is an embedded UI, rate
“Major Issues."
Tips
Table of Contents

Embedded UI Guidance

Tips

Conclusion

Other possible prompts/scenarios...

Play me music from…
Show me where I can find “X” on a map…
Show me the video of…
I want to see a picture of…

Whenever the prompt requests that the model show, play, or display something, you
should check for an embedded UI.

Extensions - Common Errors (2024-10-02)

Hello, team! This week's mandatory course on common errors covers the following
topics: 1) Handling punt ratings 2) Handling embedded UI 3) Checking Tool/Code
Quality and Each Code Step within the Code and Code Output 4) Handling an Empty
Code Step >>> Please digest the information before contributing. Unfortunately,
these mistakes are leading to demotions and/or removal from the project. If you
still have uncertainties, please ask in Discourse! It's much better to ask and
clarify than assume.

Introduction
In this supplemental course, we will cover the most common tasking errors we see on
this project and how to address them.

Table of Contents
Common Errors
Verifying Prompt Requirements
Handling Punt Ratings
Embedded UI
Carefully Checking the Code and Code Output
Ignoring the Last Step with an Empty `code`, `tool_executions`, `error`, and
`observation`
Conclusion

Reminder: Whether you are new to this project or have been tasking for a while,
please review the material carefully. Even simple mistakes can create bad training
data, and contributors that consistently submit low quality tasks may be removed
from the project.
Common Errors
The common errors that we will address in this course are:
Not verifying that all requirements in the prompt were addressed
Not rating punts correctly according to the instructions
Not knowing how to rate responses with an Embedded UI
Not checking the Code and Code Output carefully
Not checking every code step in the Code and Code Output
Not knowing to ignore the last code step with all fields set to blank values

Exam Answers
54% (69)
Exam Answers
15 pages
Rater Hub Task-Exam2 PDF
55% (11)
Rater Hub Task-Exam2 PDF
1 page
The Banter Guide 2020 PDF
94% (16)
The Banter Guide 2020 PDF
37 pages
Exam
100% (4)
Exam
7 pages
Touchet Quiz Answer Sheet 12.14
67% (3)
Touchet Quiz Answer Sheet 12.14
4 pages
Search - 2 Video Complex Queries - BaseLine
No ratings yet
Search - 2 Video Complex Queries - BaseLine
8 pages
Touchet 6 Guidelines 3.28.2024
No ratings yet
Touchet 6 Guidelines 3.28.2024
24 pages
Needs Met Training Handout
100% (1)
Needs Met Training Handout
1 page
Implementing Design Patterns in C# and .NET 5: Build Scalable, Fast, and Reliable .NET Applications Using the Most Common Design Patterns (English Edition)
From Everand
Implementing Design Patterns in C# and .NET 5: Build Scalable, Fast, and Reliable .NET Applications Using the Most Common Design Patterns (English Edition)
Alexandre F. Malavasi Cardoso
No ratings yet
Xyz5145664409 PDF
No ratings yet
Xyz5145664409 PDF
1 page
Code Extensions - Instructions
No ratings yet
Code Extensions - Instructions
19 pages
Project Instructions
No ratings yet
Project Instructions
12 pages
Code Extensions - Instructions
No ratings yet
Code Extensions - Instructions
16 pages
Code V Code Official Instructions
No ratings yet
Code V Code Official Instructions
43 pages
Prompt Cook Book
No ratings yet
Prompt Cook Book
24 pages
(2024-10-20) Copy of Gratitude Corsage - SXS Eval - Instructions
No ratings yet
(2024-10-20) Copy of Gratitude Corsage - SXS Eval - Instructions
26 pages
Instructions
No ratings yet
Instructions
19 pages
Workspace Projects 1
No ratings yet
Workspace Projects 1
16 pages
Diary of a Software Craftsman
From Everand
Diary of a Software Craftsman
Mete Atamel
5/5 (3)
Software Development Accelerated Essentials: What You Didn't Know, You Needed to Know
From Everand
Software Development Accelerated Essentials: What You Didn't Know, You Needed to Know
Ed Gomez
No ratings yet
ChatGPT Mastery: Integrating AI into Your Workflow for Advanced Users
From Everand
ChatGPT Mastery: Integrating AI into Your Workflow for Advanced Users
GN
No ratings yet
Prompt to Profit: AI Patterns That Give Solo Builders an Unfair Advantage
From Everand
Prompt to Profit: AI Patterns That Give Solo Builders an Unfair Advantage
Lucas Merritt
No ratings yet
Cloud in Utilities A Clear and Concise Reference
From Everand
Cloud in Utilities A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Software Engineering & Object Oriented Modeling
From Everand
Software Engineering & Object Oriented Modeling
Jitendra Patel
No ratings yet
Cloud Infrastructure Management Interface The Ultimate Step-By-Step Guide
From Everand
Cloud Infrastructure Management Interface The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Mastering Prompt Engineering
From Everand
Mastering Prompt Engineering
Youngsoo Chae
No ratings yet
Mastering AI Prompts: Unlocking the Potential of Intelligent Interaction
From Everand
Mastering AI Prompts: Unlocking the Potential of Intelligent Interaction
salah allam
No ratings yet
Generative AI Testing Intern Assignment
No ratings yet
Generative AI Testing Intern Assignment
5 pages
Civil engineering software Complete Self-Assessment Guide
From Everand
Civil engineering software Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
.NET 7 Design Patterns In-Depth: Enhance code efficiency and maintainability with .NET Design Patterns (English Edition)
From Everand
.NET 7 Design Patterns In-Depth: Enhance code efficiency and maintainability with .NET Design Patterns (English Edition)
Vahid Farahmandian
No ratings yet
ChatGPT3 Free Prompt List
No ratings yet
ChatGPT3 Free Prompt List
4 pages
Dual-use technology Third Edition
From Everand
Dual-use technology Third Edition
Gerardus Blokdyk
No ratings yet
Xcode A Complete Guide
From Everand
Xcode A Complete Guide
Gerardus Blokdyk
No ratings yet
CODING INTERVIEWS: Advanced Guide to Help You Excel at Coding Interviews
From Everand
CODING INTERVIEWS: Advanced Guide to Help You Excel at Coding Interviews
Olivia Miller
No ratings yet
GNU Project The Ultimate Step-By-Step Guide
From Everand
GNU Project The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Cloud Application Discovery The Ultimate Step-By-Step Guide
From Everand
Cloud Application Discovery The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Cloud API A Clear and Concise Reference
From Everand
Cloud API A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Native cloud application A Complete Guide
From Everand
Native cloud application A Complete Guide
Gerardus Blokdyk
No ratings yet
Cloud-Based PLM Applications A Clear and Concise Reference
From Everand
Cloud-Based PLM Applications A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Java™ Programming: A Complete Project Lifecycle Guide
From Everand
Java™ Programming: A Complete Project Lifecycle Guide
Nitin Shreyakar
No ratings yet
Social cloud computing Second Edition
From Everand
Social cloud computing Second Edition
Gerardus Blokdyk
No ratings yet
Art of Clean Code: How to Write Codes for Human
From Everand
Art of Clean Code: How to Write Codes for Human
Roosnam Seefan
3.5/5 (7)
Generative AI Testing Intern Assignment
No ratings yet
Generative AI Testing Intern Assignment
5 pages
Cloud-to-cloud integration The Ultimate Step-By-Step Guide
From Everand
Cloud-to-cloud integration The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Software Testing: A Guide to Testing Mobile Apps, Websites, and Games
From Everand
Software Testing: A Guide to Testing Mobile Apps, Websites, and Games
Mark Garzone
4.5/5 (3)
Does Prototyping Help or Hinder Good Requirements? What Are the Best Practices for Using This Method?
From Everand
Does Prototyping Help or Hinder Good Requirements? What Are the Best Practices for Using This Method?
Freedom Toweh
No ratings yet
Mobile and Wireless Infrastructure Software Platforms Third Edition
From Everand
Mobile and Wireless Infrastructure Software Platforms Third Edition
Gerardus Blokdyk
No ratings yet
Community cloud A Clear and Concise Reference
From Everand
Community cloud A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Engineering informatics The Ultimate Step-By-Step Guide
From Everand
Engineering informatics The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet
Enterprise Applications with C# and .NET: Develop robust, secure, and scalable applications using .NET and C# (English Edition)
From Everand
Enterprise Applications with C# and .NET: Develop robust, secure, and scalable applications using .NET and C# (English Edition)
Alexandre F. Malavasi Cardoso
No ratings yet
Day 3 - Customizing ChatGPT
No ratings yet
Day 3 - Customizing ChatGPT
44 pages
Private Cloud Communications Second Edition
From Everand
Private Cloud Communications Second Edition
Gerardus Blokdyk
No ratings yet
Cloud Project Migration A Clear and Concise Reference
From Everand
Cloud Project Migration A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Private Cloud Computing Third Edition
From Everand
Private Cloud Computing Third Edition
Gerardus Blokdyk
No ratings yet
MCS-034: Software Engineering
From Everand
MCS-034: Software Engineering
Dr. DK Sukhani
No ratings yet
Unlocking the Power of AI: A Comprehensive Guide to Mastering Prompt Engineering
From Everand
Unlocking the Power of AI: A Comprehensive Guide to Mastering Prompt Engineering
Wolf Bishop
No ratings yet
Principals and Heuristics of Effective Prompt Engineering
No ratings yet
Principals and Heuristics of Effective Prompt Engineering
10 pages
Cloud Application Development AD Services A Clear and Concise Reference
From Everand
Cloud Application Development AD Services A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Middleware for Robotic Applications A Clear and Concise Reference
From Everand
Middleware for Robotic Applications A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Microcopy: Discover How Tiny Bits of Text Make Tasty Apps and Websites
From Everand
Microcopy: Discover How Tiny Bits of Text Make Tasty Apps and Websites
Niaw de Leon
4/5 (5)
What is Software Testing?: ISTQB Foundation Companion and Study Guide
From Everand
What is Software Testing?: ISTQB Foundation Companion and Study Guide
Daniel Chelliah
5/5 (8)
Scripting Layer for Android Third Edition
From Everand
Scripting Layer for Android Third Edition
Gerardus Blokdyk
No ratings yet
Cloud UXP Services Second Edition
From Everand
Cloud UXP Services Second Edition
Gerardus Blokdyk
No ratings yet
Core Multiplexing Technology A Complete Guide
From Everand
Core Multiplexing Technology A Complete Guide
Gerardus Blokdyk
No ratings yet
Internet infrastructure Standard Requirements
From Everand
Internet infrastructure Standard Requirements
Gerardus Blokdyk
No ratings yet
Engineering technologist A Clear and Concise Reference
From Everand
Engineering technologist A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Google Cloud Dataproc The Ultimate Step-By-Step Guide
From Everand
Google Cloud Dataproc The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Microsoft Power Platform Up and Running: Learn to Analyze Data, Create Solutions, Automate Processes, and Develop Virtual Agents with Low Code Programming (English Edition)
From Everand
Microsoft Power Platform Up and Running: Learn to Analyze Data, Create Solutions, Automate Processes, and Develop Virtual Agents with Low Code Programming (English Edition)
Robert Rybaric
5/5 (1)
100 Best ChatGPT Prompts For All Kinds of Workflow - Beebom
92% (12)
100 Best ChatGPT Prompts For All Kinds of Workflow - Beebom
39 pages
B1 Test 2 Student Full
No ratings yet
B1 Test 2 Student Full
24 pages
Criteria For Structural Design of Penstock Part I
No ratings yet
Criteria For Structural Design of Penstock Part I
16 pages
Vukani Sailandi Kamanga V Kamanga (HPFD 202 of 2018) 2020 ZMHC 246 (10 September 2020)
No ratings yet
Vukani Sailandi Kamanga V Kamanga (HPFD 202 of 2018) 2020 ZMHC 246 (10 September 2020)
27 pages
1970-2 Gibson L-5CN
No ratings yet
1970-2 Gibson L-5CN
6 pages
Bio Fertilizers
100% (1)
Bio Fertilizers
7 pages
Soilwebquest
No ratings yet
Soilwebquest
5 pages
JKSSB JE CE 12 Jkchrome Com
No ratings yet
JKSSB JE CE 12 Jkchrome Com
17 pages
Document 2
No ratings yet
Document 2
2 pages
Type Script
No ratings yet
Type Script
37 pages
FS - 2 - Learning Episode 3
No ratings yet
FS - 2 - Learning Episode 3
5 pages
List of Abbreviations: Training Publication PMC-39-A0126-AF001-00
100% (1)
List of Abbreviations: Training Publication PMC-39-A0126-AF001-00
10 pages
Consumer Behavior Revision
No ratings yet
Consumer Behavior Revision
12 pages
2022 Nanofiber Fabrication From Palm Fiber Waste For Sustainable Water Remediation
No ratings yet
2022 Nanofiber Fabrication From Palm Fiber Waste For Sustainable Water Remediation
14 pages
RANI NARASIMHA SASTRY v. RANI SUNEELA RANI
100% (1)
RANI NARASIMHA SASTRY v. RANI SUNEELA RANI
9 pages
Grammar PastSimple3 18844
No ratings yet
Grammar PastSimple3 18844
2 pages
Expression Intention
No ratings yet
Expression Intention
3 pages
2025 BMW X1
No ratings yet
2025 BMW X1
416 pages
Module-5-The Contemporary World
No ratings yet
Module-5-The Contemporary World
24 pages
Exercises: 25.1here Are Some Things That Sarah Said To You Earlier
No ratings yet
Exercises: 25.1here Are Some Things That Sarah Said To You Earlier
2 pages
CHAPTER-15 Emergence of Gandhi
No ratings yet
CHAPTER-15 Emergence of Gandhi
6 pages
Yds Pack - 21-40
No ratings yet
Yds Pack - 21-40
20 pages
SKELETAL MUSCLE: Voluntary, Striated
No ratings yet
SKELETAL MUSCLE: Voluntary, Striated
3 pages
British Chess Magazine - April 2016
88% (8)
British Chess Magazine - April 2016
64 pages
PRAYER
No ratings yet
PRAYER
23 pages
Succession Planning: Promoting Organizational Sustainability 1st Edition Pamela A. Gordon PDF Download
No ratings yet
Succession Planning: Promoting Organizational Sustainability 1st Edition Pamela A. Gordon PDF Download
54 pages
Written Math-10 & 11
No ratings yet
Written Math-10 & 11
10 pages
Social Studies - Grade 7 - Lesson Plan: Education For Reconciliation: Perspective
100% (1)
Social Studies - Grade 7 - Lesson Plan: Education For Reconciliation: Perspective
3 pages
De Thi HSG Trai He Hung Vuong 2015 Tieng Anh 10
No ratings yet
De Thi HSG Trai He Hung Vuong 2015 Tieng Anh 10
13 pages

Exposition Wholesale

Uploaded by

Exposition Wholesale

Uploaded by

Welcome to the Extensions project! We are so excited to have you.

Intro & Overview: Extensions project [2 min]

In this project, the customer () is interested in user-generated prompts that have

Task Walkthrough (1/3)

Step 3: Code and Code Output Section. [UPDATED]

Task Walkthrough (2/3)

Answer the questions below

Refusal to Answer Prompt and Blank Responses

Complete Refusal Examples:

Key Model Assumptions [2 min]

When does this apply?

Major Issues / Cannot Assess

Other possible prompts/scenarios...

Extensions - Common Errors (2024-10-02)

You might also like