Open navigation menu

Welcome to Scribd!

Re RAG Chat

Uploaded by

0% found this document useful (0 votes)

5 views2 pages

1. Switching to a different GPT model may solve issues with instruction following and generating unrealistic responses. 2. More prompt tuning is needed to discourage rewriting code from documentation. 3. Generating new example documents instead of source code may help limit rewriting responses from scratch.

Original Description:

Original Title

Re__RAG_Chat

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

1. Switching to a different GPT model may solve issues with instruction following and generating unrealistic responses. 2. More prompt tuning is needed to discourage rewriting code from documentation. 3. Generating new example documents instead of source code may help limit rewriting responses from scratch.

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

5 views2 pages

Re RAG Chat

Uploaded by

1. Switching to a different GPT model may solve issues with instruction following and generating unrealistic responses. 2. More prompt tuning is needed to discourage rewriting code from documentation. 3. Generating new example documents instead of source code may help limit rewriting responses from scratch.

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Hey all, great to meet you.

Here's the order I'd attempt fixes:

1. Switching the model to gpt-4-1106-preivew will probably solve most instruction

following or hallucination issues.

2. Try more prompt tuning. All OpenAI models tend to rewrite all code

unnecessarily. Tell them a few times in the prompt not to rewrite code from

documentation.

3. Try generating new docs of examples rather than Lua source code, which will

limit the model's tendency to rewrite things from scratch.

4. Unfortunately, the model, prompt, and data are the only vectors you can improve

with the Assistants API or a RAG wrapper. Moving to a custom prompt chain

might give you more flexibility for controlling retrieved context.

5. If you go custom, you can still use a cloud vector store like Pinecone, except now

you'll feed queried results into the prompt. You can play around with the number

of entries you sample or the similarity cutoff (the spatial distance between

vectors) and see if that reduces hallucinations. (If you need some boilerplate

code on interacting/embedding, lmk.)

6. If tuning the number/cutoff doesn't work, try reranking the results. See OpenAI's

example here:

https://cookbook.openai.com/examples/question_answering_using_a_search_api

7. If that doesn't work, try HyDE — a system for generating 'fake' example queries

that might return better results (https://arxiv.org/abs/2212.10496)

8. Try getting the model to generate a task list for complex coding tasks with a few

atomic parts. Tasklist generation is tricky — try adding at least five in-prompt

(multi-shot) examples. Complete each task recursively.

9. Another trick I didn't mention is sampling multiple results from the model at

different temperatures and having the model pick the best one. A surprising

amount of the time, it picks an attempt with a nonzero temperature (which means

it often thinks a different response is optimal than its first attempt.)

10. If all those don't work, try fine-tuning a custom gpt-3.5-turbo-1106 model where

you hand-annotate the desired output. Make sure to combine it with RAG still —

fine-tuning teaches the model to follow your instructions, but it's poor at teaching

new data.

You might also like

Low Latency Java
Document5 pages
Low Latency Java
rajesh2k
No ratings yet
Aspen Tutorial Series
Document52 pages
Aspen Tutorial Series
ximena.ceron4004
100% (11)
EPCCP
Document17 pages
EPCCP
froana
No ratings yet
Abap News For 750 Test Seams and Injections
Document10 pages
Abap News For 750 Test Seams and Injections
Surya Reddy Lakshminarasimha
No ratings yet
Parallel Computing Tools Reduced Order Models Optimization and High Performance Computing
Document30 pages
Parallel Computing Tools Reduced Order Models Optimization and High Performance Computing
Simge Karagözoğlu
No ratings yet
27b Assignment 6 RSS Revisited FAQs
Document2 pages
27b Assignment 6 RSS Revisited FAQs
Khatia Ivanova
No ratings yet
Error Using PDF Too Many Output Arguments
Document2 pages
Error Using PDF Too Many Output Arguments
Sara
0% (1)
Fine-Tuning - OpenAI API
Document19 pages
Fine-Tuning - OpenAI API
timsmith1081574
No ratings yet
Twelve Ways
Document4 pages
Twelve Ways
de7yT3iz
No ratings yet
Two Scoops of Django 3x - Compress 3
Document50 pages
Two Scoops of Django 3x - Compress 3
Can İsildar
No ratings yet
Homework 9: Patternmatching Important: Georgia Tech Academic Honor Code
Document7 pages
Homework 9: Patternmatching Important: Georgia Tech Academic Honor Code
terry
No ratings yet
Fine
Document14 pages
Fine
Ruben Couto
No ratings yet
CCA175 Cloudera Hadoop and Spark Developer Tips and Tricks
Document4 pages
CCA175 Cloudera Hadoop and Spark Developer Tips and Tricks
Abdur Rahman
No ratings yet
Lab Manual Java
Document138 pages
Lab Manual Java
globo1
No ratings yet
Zindagi Zama Da
Document21 pages
Zindagi Zama Da
junyed
No ratings yet
Tasks
Document3 pages
Tasks
Hussain Nawaz
No ratings yet
Assignment 3: Named Entity Recognition: Training Dataset
Document4 pages
Assignment 3: Named Entity Recognition: Training Dataset
ryder the ryder
No ratings yet
Introduction PDF
Document2 pages
Introduction PDF
quocchau
No ratings yet
An Empirical Study On Apache Spark
Document15 pages
An Empirical Study On Apache Spark
Lokesh Dikshi
No ratings yet
Randoop Tutorial PDF
Document5 pages
Randoop Tutorial PDF
Sahodara reddy
No ratings yet
Exercises Basic Syntax PDF
Document1 page
Exercises Basic Syntax PDF
amitsingh
No ratings yet
Pug
Document21 pages
Pug
sany375
No ratings yet
Test-Driven APIs With Laravel and Pest Sample Chapter
Document32 pages
Test-Driven APIs With Laravel and Pest Sample Chapter
Jendela Kayu
No ratings yet
Prompt Engineering
Document26 pages
Prompt Engineering
Gabriel Pereira
No ratings yet
CS609
Document292 pages
CS609
jawad asghar
100% (1)
The Automated Testing Framework
Document9 pages
The Automated Testing Framework
Svetlin Ivanov
No ratings yet
52 Tips & Tricks To Boost .NET Performance
Document66 pages
52 Tips & Tricks To Boost .NET Performance
v_doina20023873
No ratings yet
HW 2
Document4 pages
HW 2
5qf59ptg2s
No ratings yet
Hackster - io-hardware-As-Code Part IV Embedded RAM
Document6 pages
Hackster - io-hardware-As-Code Part IV Embedded RAM
ConejoSinPompon goma
No ratings yet
MP2: Design Patterns
Document4 pages
MP2: Design Patterns
Verguisho
No ratings yet
PctYT8dTSK eNUsx2ZUefg - Openai Workingcourse Large Language Models Llms Fine Tuning
Document12 pages
PctYT8dTSK eNUsx2ZUefg - Openai Workingcourse Large Language Models Llms Fine Tuning
Anana Mausse
No ratings yet
Laravel Testing
Document288 pages
Laravel Testing
sergiolns
No ratings yet
LAB 13: Arrays, Switch and Conditions: Theoretical Background
Document12 pages
LAB 13: Arrays, Switch and Conditions: Theoretical Background
Farhan Sheikh Muhammad
No ratings yet
Full Stack Nodejs
Document39 pages
Full Stack Nodejs
dexxt0r
No ratings yet
Addis Ababa University Addis Ababa Institute of Technology School of Electrical and Computer Engineering
Document5 pages
Addis Ababa University Addis Ababa Institute of Technology School of Electrical and Computer Engineering
ibrahim
No ratings yet
Programming Assignment 1 Checklist: Percolation
Document3 pages
Programming Assignment 1 Checklist: Percolation
Max Yadlovskiy
No ratings yet
QTP Faq's
Document76 pages
QTP Faq's
rkrishnakamboji
100% (1)
Pepper Presentation
Document38 pages
Pepper Presentation
Mohamed Balbaa
No ratings yet
Oop Quiz 03
Document9 pages
Oop Quiz 03
Muhammad Salman Khan
No ratings yet
Prompt Engineering - OpenAI API
Document21 pages
Prompt Engineering - OpenAI API
JøKĕr Ñ
No ratings yet
Exercises Basic Syntax PDF
Document1 page
Exercises Basic Syntax PDF
Mimo Molio
No ratings yet
Dos and Donts of C Program
Document6 pages
Dos and Donts of C Program
Paulo Bollosa
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
Rating: 3 out of 5 stars
3/5 (4)
Golang Mysql Tutorial
Document3 pages
Golang Mysql Tutorial
Renee
No ratings yet
Ruby Science Sample
Document27 pages
Ruby Science Sample
Ibnu Toriq
No ratings yet
Why User Space Sucks
Document12 pages
Why User Space Sucks
marbinminto
No ratings yet
Code Review
Document4 pages
Code Review
Niks Mumbai
No ratings yet
Prompt Engineering Guide
Document122 pages
Prompt Engineering Guide
Guz Kout
No ratings yet
50 Real Time Scenario (Problems & Solutions)
Document24 pages
50 Real Time Scenario (Problems & Solutions)
ishawakde73
No ratings yet
Fine Tuning OpenAI API
Document20 pages
Fine Tuning OpenAI API
nima
No ratings yet
This Particular Paper
Document6 pages
This Particular Paper
johnturkleton
No ratings yet
Building Arduino Projects For The Internet of Things
Document5 pages
Building Arduino Projects For The Internet of Things
santhosh n prabhu
No ratings yet
Optimization of Computer Programs in C
Document2 pages
Optimization of Computer Programs in C
fraudianone
No ratings yet
Practical Java 8: Lambdas, Streams and new resources
From Everand
Practical Java 8: Lambdas, Streams and new resources
Paulo Silveira
Rating: 5 out of 5 stars
5/5 (1)
100 Recipes for Programming Java
From Everand
100 Recipes for Programming Java
Jamie Munro
Rating: 4.5 out of 5 stars
4.5/5 (2)
Practical Play Framework: Focus on what is really important
From Everand
Practical Play Framework: Focus on what is really important
Alberto Souza
No ratings yet
Learn ASP.NET Core MVC - Be Ready Next Week Using Visual Studio 2017
From Everand
Learn ASP.NET Core MVC - Be Ready Next Week Using Visual Studio 2017
Arnaud Weil
Rating: 5 out of 5 stars
5/5 (1)
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Learn Kubernetes & Docker - .NET Core, Java, Node.JS, PHP or Python
From Everand
Learn Kubernetes & Docker - .NET Core, Java, Node.JS, PHP or Python
Arnaud Weil
No ratings yet
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
From Everand
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
Owen Smith
Rating: 5 out of 5 stars
5/5 (1)