Amazon interview question

How to evaluate the model-generated text quality?