Can we detect ChatGPT-generated texts in Czech and Slovak languages?

Varování

Publikace nespadá pod Lékařskou fakultu, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.
Autoři

ŠIGUT Petr FOLTÝNEK Tomáš

Rok publikování 2023
Druh Článek ve sborníku
Konference Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
www http://nlp.fi.muni.cz/raslan/2023/paper10.pdf
Klíčová slova ChatGPT; AI-detection; Czech; Slovak
Popis The wide availability of generative AI exacerbates existing threats to society. It would not be easy even for linguists to tell whether the text we are reading was generated by a Large Language Model (LLM) or written by a human. Researchers have started developing tools that detect AI-generated content. This paper tested how two of these tools, Compilatio and GPT-2 Output Detector, performed with Czech, Slovak and English texts. There was only one tool somewhat capable of detecting AI-generated texts: Compilatio. Other tools were designed to work only with English texts. Hence, we also tested whether automatically translating the Czech and Slovak texts to English before uploading them to the detectors would have given any promising results. Ultimately, we showed that the texts generated by ChatGPT4 were less detectable than the texts generated by ChatGPT3.5.

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info