| Name: | Description: | Size: | Format: | |
|---|---|---|---|---|
| 428.25 KB | Adobe PDF |
Advisor(s)
Abstract(s)
Drift is a given in most machine learning applications. The idea that models must accommodate for changes, and thus be dynamic, is ubiquitous. Current challenges include temporal data streams, drift and non-stationary scenarios, often with text data, whether in social networks or in business systems. There are multiple drift patterns types: concepts that appear and disappear suddenly, recurrently, or even gradually or incrementally. Researchers strive to propose and test algorithms and techniques to deal with drift in text classification, but it is difficult to find adequate benchmarks in such dynamic environments. In this paper we present DOTS, Drift Oriented Tool System, a framework that allows for the definition and generation of text-based datasets where drift characteristics can be thoroughly defined, implemented and tested. The usefulness of DOTS is presented using a Twitter stream case study. DOTS is used to define datasets and test the effectiveness of using different document representation in a Twitter scenario. Results show the potential of DOTS in machine learning research.
Description
Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9492))
Source title
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics).
Conference name 22nd International Conference on Neural Information Processing, ICONIP 2015
Conference city: Istanbul Conference date: 9 November 2015 - 12 November 2015
Conference name 22nd International Conference on Neural Information Processing, ICONIP 2015
Conference city: Istanbul Conference date: 9 November 2015 - 12 November 2015
Keywords
Drift Learning algorithms Software tool Text classification
Pedagogical Context
Citation
Costa, J., Silva, C., Antunes, M., Ribeiro, B. (2015). DOTS: Drift Oriented Tool System. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_72
Publisher
Springer Nature
CC License
Without CC licence
