ログイン
Language:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 02 情報科学
  2. 02 国際会議論文

Enhancing Hate Speech Classifiers through a Gradient-assisted Counterfactual Text Generation Strategy

http://hdl.handle.net/10061/0002001338
http://hdl.handle.net/10061/0002001338
eab1d742-410a-438b-9665-c7c0f8a2018d
アイテムタイプ 会議発表論文 / Conference Paper(1)
公開日 2026-02-19
タイトル
タイトル Enhancing Hate Speech Classifiers through a Gradient-assisted Counterfactual Text Generation Strategy
言語
言語 eng
資源タイプ
資源タイプ conference paper
アクセス権
アクセス権 open access
著者 Van Supranes, Michael

× Van Supranes, Michael

en Van Supranes, Michael

Search repository
Peng, Shaowen

× Peng, Shaowen

en Peng, Shaowen

Search repository
若宮, 翔子

× 若宮, 翔子

ja 若宮, 翔子

ja-Kana ワカミヤ, ショウコ

en Wakamiya, Shoko

Search repository
荒牧, 英治

× 荒牧, 英治

ja 荒牧, 英治

ja-Kana アラマキ, エイジ

en Aramaki, Eiji

Search repository
抄録
内容記述タイプ Abstract
内容記述 Counterfactual data augmentation (CDA) is a promising strategy for improving hate speech classification, but automating counterfactual text generation remains a challenge. Strong attribute control can distort meaning, while prioritizing semantic preservation may weaken attribute alignment. We propose Gradientassisted Energy-based Sampling (GENES) for counterfactual text generation, which restricts accepted samples to text meeting a minimum BERTScore threshold and applies gradient-assisted proposal generation to improve attribute alignment. Compared to other methods that solely rely on either prompting, gradient-based steering, or energy-based sampling, GENES is more likely to jointly satisfy attribute alignment and semantic preservation under the same base model. When applied to data augmentation, GENES achieved the best macro F1-score in two of three test sets, and it improved robustness in detecting targeted abusive language. In some cases, GENES exceeded the performance of prompt-based methods using a GPT-4o-mini, despite relying on a smaller model (Flan-T5-Large). Based on our cross-dataset evaluation, the average performance of models aided by GENES is the best among those methods that rely on a smaller model (Flan-T5-L). These results position GENES as a possible lightweight and open-source alternative.
書誌情報 en : Findings of the Association for Computational Linguistics: EMNLP 2025

p. 3529-3544, ページ数 16, 発行日 2025
会議情報
会議名 The 2025 Conference on Empirical Methods in Natural Language Processing
開始年 2025
開始月 11
開始日 04
終了年 2025
終了月 11
終了日 09
開催期間 2025-11-04 - 2025-11-09
開催地 Suzhou, China
開催国 CHN
出版者
出版者 Association for Computational Linguistics
出版者版DOI
関連タイプ isReplacedBy
識別子タイプ DOI
関連識別子 https://doi.org/10.18653/v1/2025.findings-emnlp.189
出版者版URI
関連タイプ isReplacedBy
識別子タイプ URI
関連識別子 https://aclanthology.org/2025.findings-emnlp.189/
権利
権利情報Resource https://creativecommons.org/licenses/by/4.0/
権利情報 ©2025 Association for Computational Linguistics. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
著者版フラグ
出版タイプ NA
戻る
0
views
See details
Views

Versions

Ver.1 2026-02-19 02:56:10.180180
Show All versions

Share

Share
tweet

Cite as

Other

print

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR 2.0
  • OAI-PMH JPCOAR 1.0
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX
  • ZIP

コミュニティ

確認

確認

確認


Powered by WEKO3


Powered by WEKO3