Fitzpatrick Institute for Photonics Fitzpatrick Institute for Photonics
Pratt School of Engineering
Duke University

 HOME > pratt > FIP    Search Help Login 

Publications [#342165] of Lawrence Carin

Papers Published

  1. Shen, D; Wang, G; Wang, W; Min, MR; Su, Q; Zhang, Y; Li, C; Henao, R; Carin, L, Baseline needs more love: On simple word-embedding-based models and associated pooling mechanisms, ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), vol. 1 (January, 2018), pp. 440-450 [doi]
    (last updated on 2024/12/31)

    Abstract:
    Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWEMs), consisting of parameter-free pooling operations, relative to word-embedding-based RNN/CNN models. Surprisingly, SWEMs exhibit comparable or even superior performance in the majority of cases considered. Based upon this understanding, we propose two additional pooling strategies over learned word embeddings: (i) a max-pooling operation for improved interpretability; and (ii) a hierarchical pooling operation, which preserves spatial (n-gram) information within text sequences. We present experiments on 17 datasets encompassing three tasks: (i) (long) document classification; (ii) text sequence matching; and (iii) short text tasks, including classification and tagging.


Duke University * Pratt * Reload * Login
x