TY - JOUR
T1 - Categorical Metadata Representation for Customized Text Classification
AU - Kim, Jihyeok
AU - Amplayo, Reinald Kim
AU - Lee, Kyungjae
AU - Sung, Sua
AU - Seo, Minji
AU - Hwang, Seung Won
N1 - Publisher Copyright:
© 2019 Association for Computational Linguistics.
PY - 2019
Y1 - 2019
N2 - The performance of text classification has improved tremendously using intelligently engineered neural-based models, especially those injecting categorical metadata as additional information, e.g., using user/product information for sentiment classification. This information has been used to modify parts of the model (e.g., word embeddings, attention mech-anisms) such that results can be customized according to the metadata. We observe that current representation methods for categorical metadata, which are devised for human con-sumption, are not as effective as claimed in popular classification methods, outperformed even by simple concatenation of categorical features in the final layer of the sentence encoder. We conjecture that categorical features are harder to represent for machine use, as available context only indirectly describes the category, and even such context is often scarce (for tail category). To this end, we propose using basis vectors to effectively incor-porate categorical metadata on various parts of a neural-based model. This additionally decreases the number of parameters dramatic-ally, especially when the number of categorical features is large. Extensive experiments on various data sets with different properties are performed and show that through our method, we can represent categorical metadata more effectively to customize parts of the model, including unexplored ones, and increase the performance of the model greatly.
AB - The performance of text classification has improved tremendously using intelligently engineered neural-based models, especially those injecting categorical metadata as additional information, e.g., using user/product information for sentiment classification. This information has been used to modify parts of the model (e.g., word embeddings, attention mech-anisms) such that results can be customized according to the metadata. We observe that current representation methods for categorical metadata, which are devised for human con-sumption, are not as effective as claimed in popular classification methods, outperformed even by simple concatenation of categorical features in the final layer of the sentence encoder. We conjecture that categorical features are harder to represent for machine use, as available context only indirectly describes the category, and even such context is often scarce (for tail category). To this end, we propose using basis vectors to effectively incor-porate categorical metadata on various parts of a neural-based model. This additionally decreases the number of parameters dramatic-ally, especially when the number of categorical features is large. Extensive experiments on various data sets with different properties are performed and show that through our method, we can represent categorical metadata more effectively to customize parts of the model, including unexplored ones, and increase the performance of the model greatly.
UR - http://www.scopus.com/inward/record.url?scp=85150169664&partnerID=8YFLogxK
U2 - 10.1162/tacl_a_00263
DO - 10.1162/tacl_a_00263
M3 - Article
AN - SCOPUS:85150169664
SN - 2307-387X
VL - 7
SP - 201
EP - 215
JO - Transactions of the Association for Computational Linguistics
JF - Transactions of the Association for Computational Linguistics
ER -