Publicação
EasyBDI: Near Real-Time Data Analytics over Heterogeneous Data Sources
| datacite.subject.fos | Ciências Naturais::Ciências da Computação e da Informação | |
| datacite.subject.sdg | 08:Trabalho Digno e Crescimento Económico | |
| datacite.subject.sdg | 09:Indústria, Inovação e Infraestruturas | |
| datacite.subject.sdg | 10:Reduzir as Desigualdades | |
| dc.contributor.author | Silva, Bruno | |
| dc.contributor.author | Moreira, José | |
| dc.contributor.author | Costa, Rogério Luís de C. | |
| dc.date.accessioned | 2026-05-07T09:36:31Z | |
| dc.date.available | 2026-05-07T09:36:31Z | |
| dc.date.issued | 2021 | |
| dc.description | Link de acesso ao documento - https://openproceedings.org/2021/conf/edbt/p190.pdf | |
| dc.description | Conference date - 23 March 2021 - 26 March 2021; Conference code - 171234 | |
| dc.description.abstract | The large volume of currently available data creates several opportunities for sciences and industry, especially with the application of data analytics. But also raises challenges that make unfeasible the use of batch-based ETL processes. Indeed, near real-time data analytics is a requirement in several domains as an alternative to traditional data warehouses. In the last years, big data platforms have been developed to enable query execution over distributed data sources. However, they do not deal with subject-oriented analysis, do not provide data distribution transparency, or do not assist with schema mapping and integration. In this demonstration, we present EasyBDI. It's a near real-time big data analytics prototype that enables users to run queries over heterogeneous data sources based on global logical abstractions created by the system and provides some usual concepts of data warehouse systems, like facts and dimensions. We use two motivating scenarios, one based on three years of real data on photovoltaic energy production and consumption, and the other based on the SSB+ benchmark. We will also present implementation challenges, issues, solutions, and insights. | eng |
| dc.description.sponsorship | This work is partially funded by National Funds through the FCT (Foundation for Science and Technology) in the context of the projects UIDB/04524/2020, UIDB/00127/2020 and POCI-01-0247-FEDER-024541. | |
| dc.identifier.citation | Silva, B., Moreira, J. M., & Costa, R. L. D. C. (2021). EasyBDI: Near Real-Time Data Analytics over Heterogeneous Data Sources. In EDBT (pp. 702-705). DOI: https://doi.org/10.5441/002/edbt.2021.88. | |
| dc.identifier.doi | 10.5441/002/edbt.2021.88 | |
| dc.identifier.isbn | 978-3-89318-084-4 | |
| dc.identifier.uri | http://hdl.handle.net/10400.8/16246 | |
| dc.language.iso | eng | |
| dc.peerreviewed | yes | |
| dc.publisher | Open Proceedings | |
| dc.relation | Research Center in Informatics and Communications | |
| dc.relation | Institute of Electronics and Informatics Engineering of Aveiro | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | Distribution transparency | |
| dc.subject | data analytics | |
| dc.subject | near real-time data warehousing | |
| dc.title | EasyBDI: Near Real-Time Data Analytics over Heterogeneous Data Sources | eng |
| dc.type | conference paper | |
| dspace.entity.type | Publication | |
| oaire.awardNumber | UIDB/04524/2020 | |
| oaire.awardNumber | UID/CEC/00127/2013 | |
| oaire.awardTitle | Research Center in Informatics and Communications | |
| oaire.awardTitle | Institute of Electronics and Informatics Engineering of Aveiro | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F04524%2F2020/PT | |
| oaire.awardURI | http://hdl.handle.net/10400.8/13663 | |
| oaire.citation.conferenceDate | 2021-03 | |
| oaire.citation.conferencePlace | Virtual, Online | |
| oaire.citation.endPage | 705 | |
| oaire.citation.startPage | 702 | |
| oaire.citation.title | Proceedings of the 24th International Conference on Extending Database Technology (EDBT) | |
| oaire.fundingStream | 6817 - DCRRNI ID | |
| oaire.fundingStream | 6817 - DCRRNI ID | |
| oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |
| person.familyName | de Carvalho Costa | |
| person.givenName | Rogério Luís | |
| person.identifier.ciencia-id | 7717-9573-0C0F | |
| person.identifier.orcid | 0000-0003-2306-7585 | |
| person.identifier.rid | A-7940-2016 | |
| person.identifier.scopus-author-id | 7801604983 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| relation.isAuthorOfPublication | 5654d934-3fa0-4afb-9b3b-f2736104924c | |
| relation.isAuthorOfPublication.latestForDiscovery | 5654d934-3fa0-4afb-9b3b-f2736104924c | |
| relation.isProjectOfPublication | 67435020-fe0d-4b46-be85-59ee3c6138c7 | |
| relation.isProjectOfPublication | a81c1f82-40d2-40f8-9bec-058544e9fd17 | |
| relation.isProjectOfPublication.latestForDiscovery | 67435020-fe0d-4b46-be85-59ee3c6138c7 |
Ficheiros
Principais
1 - 1 de 1
A carregar...
- Nome:
- EasyBDI Near real-time data analytics over heterogeneous data sources.pdf
- Tamanho:
- 929.7 KB
- Formato:
- Adobe Portable Document Format
- Descrição:
- The large volume of currently available data creates several opportunities for sciences and industry, especially with the application of data analytics. But also raises challenges that make unfeasible the use of batch-based ETL processes. Indeed, near real-time data analytics is a requirement in several domains as an alternative to traditional data warehouses. In the last years, big data platforms have been developed to enable query execution over distributed data sources. However, they do not deal with subject-oriented analysis, do not provide data distribution transparency, or do not assist with schema mapping and integration. In this demonstration, we present EasyBDI. It's a near real-time big data analytics prototype that enables users to run queries over heterogeneous data sources based on global logical abstractions created by the system and provides some usual concepts of data warehouse systems, like facts and dimensions. We use two motivating scenarios, one based on three years of real data on photovoltaic energy production and consumption, and the other based on the SSB+ benchmark. We will also present implementation challenges, issues, solutions, and insights.
Licença
1 - 1 de 1
Miniatura indisponível
- Nome:
- license.txt
- Tamanho:
- 1.32 KB
- Formato:
- Item-specific license agreed upon to submission
- Descrição:
