New benchmark tests how AI detection models perform across languages and multilingual content transformations such as ...