I am extremely horrified by the prospect of GenAI grading.
You are roughly a decade late. Computers have been grading essays for a long time. The mcat for example hasn’t had human grading in about that long.
LLM system input is unsanitizable, according to NVidia:
The control-data plane confusion inherent in current LLMs means that prompt injection attacks are common, cannot be effectively mitigated, and enable malicious users to take control of the LLM and force it to produce arbitrary malicious outputs with a very high likelihood of success.
https://developer.nvidia.com/blog/securing-llm-systems-against-prompt-injection/
Reminds me of: https://www.wired.com/story/null-license-plate-landed-one-hacker-ticket-hell/
A guy thought it would be funny to change his license plate to NULL.
How do you sanitize ai prompts? With more prompts?
Two muffins are baking in an oven. One muffin turns to the other and says “sure is hot in here isn’t it?”
To which the other muffin replies “Holy crap! A talking muffin!”
Changing the muffins to cookies would not make it a different joke.