IE 11 Not Supported

For optimal browsing, we recommend Chrome, Firefox or Safari browsers.

Are AI chatbots better at math when they pretend to be Star Trek characters?

Answer: It appears so.

star_trek_11
While chatbots like OpenAI’s ChatGPT may be good at creating complete sentences, they are not so proficient at solving math problems. Rick Battle and Teja Gollapudi, both then at VMware’s natural language processing lab, wanted to find out if recent rumors were true that providing chatbots with positive encouragement when asking a math problem improves their answers.

They started by giving the bots prompts to solve grade-school math problems, some of which began with encouraging phrases like “you are an expert mathematician” and ended with things like “this will be fun!” The bots, however, did not consistently perform better with the positive reinforcement. So, they turned to AI to improve their methods.

They used an automated process to tweak the phrasing of the prompts based on whether or not the chatbots’ accuracy in solving the problems improved. This process was overall more effective in making the bots better at math, but the ones that were the best were the most surprising. When the chatbots were asked to start their answers with “Captain’s Log, Stardate [insert date here]:,” a phrase even casual Star Trek fans will recognize, the bots’ accuracy consistently improved.

“Surprisingly, it appears that the model’s proficiency in mathematical reasoning can be enhanced by the expression of an affinity for Star Trek,” the team wrote in their findings.