Mostrar el registro sencillo del ítem
Kafka-ML: Connecting the data stream with ML/AI frameworks
dc.contributor.author | Martín-Fernández, Cristian | |
dc.contributor.author | Langendoerfer, Peter | |
dc.contributor.author | Soltani Zarrin, Pouya | |
dc.contributor.author | Díaz-Rodríguez, Manuel | |
dc.contributor.author | Rubio-Muñoz, Bartolomé | |
dc.date.accessioned | 2021-08-24T06:07:52Z | |
dc.date.available | 2021-08-24T06:07:52Z | |
dc.date.issued | 2022-01-01 | |
dc.identifier.citation | Future Generation Computer Systems, Volume 126, January 2022, Pages 15-33 | es_ES |
dc.identifier.uri | https://hdl.handle.net/10630/22726 | |
dc.description.abstract | Machine Learning (ML) and Artificial Intelligence (AI) depend on data sources to train, improve, and make predictions through their algorithms. With the digital revolution and current paradigms like the Internet of Things, this information is turning from static data to continuous data streams. However, most of the ML/AI frameworks used nowadays are not fully prepared for this revolution. In this paper, we propose Kafka-ML, a novel and open-source framework that enables the management of ML/AI pipelines through data streams. Kafka-ML provides an accessible and user-friendly Web user interface where users can easily define ML models, to then train, evaluate, and deploy them for inferences. Kafka-ML itself and the components it deploys are fully managed through containerization technologies, which ensure their portability, easy distribution, and other features such as fault-tolerance and high availability. Finally, a novel approach has been introduced to manage and reuse data streams, which may eliminate the need for data storage or file systems. | es_ES |
dc.description.sponsorship | This work is funded by the Spanish projects RT2018-099777-B-100 (“rFOG: Improving Latency and Reliability of Offloaded Computation to the FOG for Critical Services”), PY20_00788 (“IntegraDos: Providing Real-Time Services for the Internet of Things through Cloud Sensor Integration”) and UMA18FEDERJA-215 (“Advanced Monitoring System Based on Deep Learning Services in Fog”). Cristian Martín was with a postdoc grant from the Spanish project TIC-1572 (“MIsTIca: Critical Infrastructures Monitoring based on Wireless Technologies”) and his research stay at IHP has been funded through a mobility grant from the University of Malaga and IHP funding. Funding for open access charge: Universidad de Malaga/CBUA . We are grateful for the work of all the reviewers who have greatly contributed to improving the quality of this article. We would like to express our gratitude to Kai Wähner for his inspiration and ideas through numerous articles and GitHub repositories on Kafka and its combination with TensorFlow. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.subject | Inteligencia artificial | es_ES |
dc.subject.other | Kafka-ML | es_ES |
dc.subject.other | Apache Kafka | es_ES |
dc.subject.other | Machine Learning | es_ES |
dc.subject.other | Artificial Intelligence | es_ES |
dc.subject.other | Data streams | es_ES |
dc.subject.other | Docker | es_ES |
dc.subject.other | Kubernetes | es_ES |
dc.subject.other | Distributed systems | es_ES |
dc.title | Kafka-ML: Connecting the data stream with ML/AI frameworks | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.centro | E.T.S.I. Informática | es_ES |
dc.identifier.doi | https://doi.org/10.1016/j.future.2021.07.037 | |
dc.type.hasVersion | info:eu-repo/semantics/publishedVersion | es_ES |
dc.departamento | Lenguajes y Ciencias de la Computación |