Boosting dynamic ensemble’s performance in Twitter

Costa, Joana; Silva, Catarina; Antunes, Mário; Ribeiro, Bernardete

http://hdl.handle.net/10400.8/14074

Use this identifier to reference this record.

Name:	Description:	Size:	Format:
Boosting dynamic ensemble’s performance in Twitter.pdf	Many text classification problems in social networks, and other contexts, are also dynamic problems, where concepts drift through time, and meaningful labels are dynamic. In Twitter-based applications in particular, ensembles are often applied to problems that fit this description, for example sentiment analysis or adapting to drifting circumstances. While it can be straightforward to request different classifiers' input on such ensembles, our goal is to boost dynamic ensembles by combining performance metrics as efficiently as possible. We present a twofold performance-based framework to classify incoming tweets based on recent tweets. On the one hand, individual ensemble classifiers' performance is paramount in defining their contribution to the ensemble. On the other hand, examples are actively selected based on their ability to effectively contribute to the performance in classifying drifting concepts. The main step of the algorithm uses different performance metrics to determine both each classifier strength in the ensemble and each example importance, and hence lifetime, in the learning process. We demonstrate, on a drifted benchmark dataset, that our framework drives the classification performance considerably up for it to make a difference in a variety of applications.	502.18 KB	Adobe PDF	Download

Send Feedback

Authors

Abstract(s)

Many text classification problems in social networks, and other contexts, are also dynamic problems, where concepts drift through time, and meaningful labels are dynamic. In Twitter-based applications in particular, ensembles are often applied to problems that fit this description, for example sentiment analysis or adapting to drifting circumstances. While it can be straightforward to request different classifiers' input on such ensembles, our goal is to boost dynamic ensembles by combining performance metrics as efficiently as possible. We present a twofold performance-based framework to classify incoming tweets based on recent tweets. On the one hand, individual ensemble classifiers' performance is paramount in defining their contribution to the ensemble. On the other hand, examples are actively selected based on their ability to effectively contribute to the performance in classifying drifting concepts. The main step of the algorithm uses different performance metrics to determine both each classifier strength in the ensemble and each example importance, and hence lifetime, in the learning process. We demonstrate, on a drifted benchmark dataset, that our framework drives the classification performance considerably up for it to make a difference in a variety of applications.

Keywords

Dynamic ensembles Text classification Twitter

URI

http://hdl.handle.net/10400.8/14074

Citation

Costa, J., Silva, C., Antunes, M. et al. Boosting dynamic ensemble’s performance in Twitter. Neural Comput & Applic 32, 10655–10667 (2020). https://doi.org/10.1007/s00521-019-04599-7.