Estimating Token Consumption and Response Token Count in Databricks using dbrx-instruct
I’m trying to understand how to estimate the token consumption and response token count in Databricks using dbrx-instruct
. I want to create a function that can predict the number of tokens I’ll be requesting based on my query and how many tokens I’ll receive in response. This information is crucial for estimating the costs incurred.