Hi Adam Cameron
Ideally, we should quantize the model to optimize model weights to run on edge devices and retain accuracy close to original model
Please check the documentation on Quantization aware training and post training quantization .
Also, Adopt preprocessing techniques before inputting data.
Hope it helps address your issue.
Please don’t forget to Accept Answer and Yes for "was this answer helpful" wherever the information provided helps you, this can be beneficial to other community members.
Thank you