The AGC Debate: Are AI-Written Interview Answers a Red Flag or Smart Strategy?
Every day, we read stories of increased fake or AI-assisted applications. Tools like LazyApply are just one of many flooding the market, driving up applicant volumes to never-before-seen levels.?
As an overwhelmed hiring function, how do you find the needle in the haystack without using an army of recruiters to filter through the maze?
At Sapia.ai, we help global enterprises do just that. Many of the world’s most trusted brands, such as Qantas Group, have relied on our hiring platform as a co-pilot for better hiring since 2020.?
Our Chat Interview has given millions of candidates a voice they wouldn’t have had – enabling them to share in their own words why they’re the best fit for the role. To find the people who belong with their brands, our customers must trust that their candidates represent themselves. Thus, they want to trust that our AI is analysing real human answers —not answers from a machine.??
The Rise of GPT?
When ChatGPT went viral in November 2022, we immediately adopted a defensive strategy. We had long been flagging plagiarised candidate responses, but then, we needed to act fast to flag responses using artificially generated content (‘AGC’).?
Many companies were in the same position, but Sapia.ai was the only company with a large proprietary data set of interview answers that pre-dated GPT and similar tools: 2.5 billion words written by real humans.?
That data enabled us to build a world-first:- an LLM-based AGC detector for text-based interviews, recently upgraded to v2.0 with 99% accuracy and a false positive rate of 1%. An NLP classification model built on Sapia.ai proprietary data that operates across all Sapia.ai chat interviews.
Full Transparency with Candidates
Because we value candidate trust as much as customer trust, we wanted to be transparent with candidates about our ability to detect artificially generated content (AGC). As an LLM, we could identify AGC in real time and warn candidates that we had detected it.?
This has had a powerful impact on candidate behaviour. Since our AGC detector went live, we have seen that the real-time flagging acts as a real-time disincentive to use tools like ChatGPT to generate interview responses.?
The detector generates a warning if 3 or more answers are flagged as having artificially generated content. The Sapia.ai Chat Interview uses 5 open-ended interview questions for volume hiring roles, such as retail, contact centre, and customer service, and 6 questions for professional roles, such as engineers, data scientists, graduates, etc.
Let’s Take a Closer Look at the Data…?
We see that using our AGC detector LLM to communicate live with candidates in the interview flow when artificial content has been detected has a positive effect on deterring candidates from using AI tools to generate their answers.?
The rate of AGC use declines from 1 question flagged to 5 questions – raising the flag on one question is generally enough to deter candidates from trying again.?
领英推荐
The graph below shows the number of candidates, from a total of almost 2.7m, that used artificially generated content in their answers.??
Differences in AGC Usage Rate by Groups?
We see no meaningful differences in candidate behaviour based on the job they are applying for or based on geography.
However, we have found differences by gender and ethnicity – for example, men use artificially generated content more than women. The graph below shows the overall completion ratios by gender – for all interviews on the left and for interviews where the number of questions with AGC detected is 5 or more on the right.?
Perception of Artificially Generated Content by Hirers.?
We’re curious to understand how hirers perceive the use of these tools to assist candidates in a written interview. The creation of the detector was based on the majority of Sapia.ai customers wanting transparency & explainability around the use of these tools by candidates, often because they want to ensure that candidates are using their own words to complete their interviews and they want to avoid wasting time progressing candidates who are not as capable as their chat interview suggests.??
However, some of our customers feel that it’s a positive reflection of the candidate, showing that they are using the tools available to them to put their best foot forward.?
It’s a mix of perspectives.?
Our detector labels it as the use of artificially generated content. It’s up to our customers how they use that information in their decision-making processes.?
This concept of having a human in the loop is one of the key dimensions of ethical AI, and we ensure that it is used in every AI-related hiring product we build.?
Interested in the science behind it all? Download our published research on developing the AGC detector?