Will changing system prompts in fine-tuning mess things up?

Question

I'm building an application that uses my fine-tuned GPT model. The model is trained with a somewhat generalized system prompt because each conversation example includes slight variations (e.g., different personal details like names or phone numbers, which I censor and don't include in the prompt).

In my application, I use a similar system prompt but add a sentence or two to include conversation-specific details. However, this additional information is getting longer, and the model sometimes fails to recall it properly. To improve this, I’m considering incorporating these details into the fine-tuning data, meaning the system prompts in my training examples would vary.

Would this cause issues? So far, Azure OpenAI has only stated that changing the system message may lead to different results than expected, and ChatGPT has suggested it could cause unpredictable behavior. Has anyone tested this in practice? Is there a better way to get GPT to pay closer attention to the system prompt?

Accepted Answer

Including conversation-specific details in the fine-tuning data by varying the system prompts could indeed lead to inconsistencies in model behavior. While OpenAI's guidance on changing system messages suggests it might produce different results, your use case (where the system prompt varies between training examples) could introduce unintended biases or unpredictability in model responses.

In particular, the model learns patterns based on the training data. If the system prompt varies significantly, the model may become overly sensitive to minor prompt changes, leading to unexpected shifts in output quality. A fixed, well-crafted system prompt ensures generalization across different conversations. If the model sees many different system prompts during training, it may not develop a strong anchoring effect, making it harder to rely on the system prompt effectively.
If certain patterns in system prompts appear frequently in training, the model might bias responses toward those prompts, limiting flexibility.

Instead of embedding details in the fine-tuning dataset, consider structuring your API requests to include a few-shot learning approach where recent relevant details are part of the input messages. If the system prompt is getting too long, try moving some conversation-specific details into an initial user or assistant message. This keeps the system prompt concise while ensuring key details remain in focus. During fine-tuning, you can train the model to repeat or summarize the important details when responding, helping reinforce memory without overloading the system prompt. If some details are structured data (e.g., names, phone numbers), consider using a retrieval system where the model calls a function to fetch relevant context dynamically rather than embedding everything in the prompt.

From what I've seen in practical tests, varying system prompts in fine-tuning generally introduces inconsistencies. Instead, models tend to perform better when the system prompt remains stable, and additional details are provided dynamically within conversation turns.

If the above response helps answer your question, remember to "Accept Answer" so that others in the community facing similar issues can easily find the solution. Your contribution is highly appreciated.

hth

Marcin

Share via

Will changing system prompts in fine-tuning mess things up?

0 additional answers

Your answer