Generated Text Detection using Representation Learning with Token Prediction Model

Yeon-Seung Choo; Yong-Suk Park

Generated Text Detection using Representation Learning with Token Prediction Model

Yeon-Seung Choo

Yong-Suk Park

Vol. 51, No. 2, pp. 241-247, Feb. 2026

10.7840/kics.2026.51.2.241

Machine-Generated Text Detection

Model Alignment

representation learning

PDF Full-Text

Abstract

Large language models (LLMs) have become mainstream in the form of conversational chatbots, such as ChatGPT, and various social problems have begun to arise. Representative issues include AI plagiarism where individuals claim AI-generated content as their own and fake news creation and dissemination. To prevent the misuse of AI-generated text, several methods have been proposed to distinguish AI-generated content from human-written text. Popular approaches rely on zero-shot detection, which compares the negative probability curvature derived from token prediction results using various LLMs. More recently, learning-based methods have been proposed to replicate the output distribution of the target model for easier adaptation to various generators. However, these existing methods primarily depend on token-level probability estimation, making them vulnerable to variations in sentence expression, such as paraphrasing. In this paper, we propose a novel approach that applies representation learning to the prediction model to overcome these limitations. The proposed method aims to enhance robustness against paraphrasing and expression diversity, and we demonstrate its potential effectiveness through empirical evaluation.

Statistics

Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

Y. Choo and Y. Park, "Generated Text Detection using Representation Learning with Token Prediction Model," The Journal of Korean Institute of Communications and Information Sciences, vol. 51, no. 2, pp. 241-247, 2026. DOI: 10.7840/kics.2026.51.2.241.

[ACM Style]

Yeon-Seung Choo and Yong-Suk Park. 2026. Generated Text Detection using Representation Learning with Token Prediction Model. The Journal of Korean Institute of Communications and Information Sciences, 51, 2, (2026), 241-247. DOI: 10.7840/kics.2026.51.2.241.

[KICS Style]

Yeon-Seung Choo and Yong-Suk Park, "Generated Text Detection using Representation Learning with Token Prediction Model," The Journal of Korean Institute of Communications and Information Sciences, vol. 51, no. 2, pp. 241-247, 2. 2026. (https://doi.org/10.7840/kics.2026.51.2.241)

Vol. 51, No. 2 Index

Generated Text Detection using Representation Learning with Token Prediction Model

Submenu

Search
(IN TITLE, AUTHOR, ABSTRACT,KEYWORDS)

Advanced Search

Recent Publications
(LAST 3 YEARS)

Generated Text Detection using Representation Learning with Token Prediction Model

Generalized Kernel Restricted Boltzmann Machine

Study on Scalable Image Representation through Dimension Reduction and Features Encoding for Memory Circulation Mechanism Modeling

Design of Recommender Systems Exploiting Diversity Based on Network Representation Learning

Submenu

Search (IN TITLE, AUTHOR, ABSTRACT,KEYWORDS)

Advanced Search

POPULAR KEYWORDS(TOP 10 KEYWORDS)

Recent Publications(LAST 3 YEARS)

Search
(IN TITLE, AUTHOR, ABSTRACT,KEYWORDS)

POPULAR KEYWORDS
(TOP 10 KEYWORDS)

Recent Publications
(LAST 3 YEARS)