Abstract
Instruction-tuned language models produce text that commercial detectors identify as non-human, prompting the development of a paraphrasing pipeline that improves human-likeness while preserving semantics across different model sizes.
As AI-generated text enters the real-world at scale, institutions increasingly use commercial AI-text detectors, especially in education and academic-integrity workflows. We report a surprising empirical finding about such systems: when evaluated by GPTZero and Pangram, generated text from base models is often judged overwhelmingly human, whereas text generated by their instruction-tuned counterparts is not. Building on this observation, we propose Humanization by Iterative Paraphrasing (HIP), a detector-agnostic pipeline that minimally fine-tunes a base model into a paraphraser and applies it iteratively. Compared with the baselines we test, HIP yields a stronger trade-off between semantic preservation and detector evasion on commercial detectors. Across Llama-3 and Qwen-3 families, spanning model sizes from 0.6B to 70B, HIP consistently improves detector human-likeness. Our findings suggest that current detectors are tracking artifacts of instruction tuning and local context more than any invariant notion of machine-generated text. This, in turn, calls for detector designs that model these factors more explicitly.
Community
We found that current AI text detectors (GPTZero, Pangram) largely fail on base models: they track artifacts of instruction tuning rather than the general "machine-generated text".
Building on this, we introduce HIP (Humanization by Iterative Paraphrasing) which minimally fine-tune a base model into a paraphraser, then apply it iteratively to shift outputs toward human distributions, achieving state-of-the-art evasion-semantics tradeoff.
š¦ Tweet: https://x.com/YixuanEvenXu/status/2057171878754783429
š» Repo: https://github.com/YixuanEvenXu/humanization-by-iterative-paraphrasing
Get this paper in your agent:
hf papers read 2605.19516 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 14
YixuanEvenXu/Llama-3-70B-Instruct-HIP-adapter
Datasets citing this paper 1
YixuanEvenXu/HIP-training-and-evaluation-data
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper