Exploring open government data and big data from a quality perspective.
Abstract
Objective: To establish the key elements in the relationship between big data and open government data, from a quality perspective. Methodology: The authors conducted an exploratory literature review to determine the lines of quality relationship between big data and open government data, emphasizing Colombia´s case. Results: The concept of quality is a common factor for big data and open government data, establishing the benefits, such as innovation, transparency, and economic growth, as well as the challenges related to quality aspects, such as guaranteeing the reliability of the origin of the data, facilitating the understanding of the data, and establishing quality standards. Conclusions: The relationship between big data and open government data from the perspective of data quality allows to exploit the potential immersed in the data, contribute significantly to the construction of knowledge, and thus provide answers to different problems or phenomena
References
Attard, J., Orlandi, F., & Auer, S. (2016). Value creation on open government data. In Proceedings of the 2016 49th Hawaii International Conference on System Sciences (HICSS) (pp. 2605-2614). IEEE Computer Society https://doi.org/10.1109/HICSS.2016.326
Attard, J., Orlandi, F., Scerri, S., & Auer, S. (2015). A systematic review of open government data initiatives. Government Information Quarterly, 32(4), 399-418. https://doi.org/10.1016/j.giq.2015.07.006
BSA The Software Alliance. (2017). ¿Por qué son tan importantes los datos? https://data.bsa.org/wp-content/uploads/2015/10/BSADataStudy_es.pdf
Cai, L., & Zhu, Y. (2015). The challenges of data quality and data quality assessment in the big data era. Data Science Journal, 14(0), 2. https://doi.org/10.5334/dsj-2015-002
Caro, A., Fuentes, A., & Soto, A. M. (2013). Desarrollando sistemas de información centrados en la calidad de datos. Ingeniare, 21(1), 54-69. https://doi.org/10.4067/s0718-33052013000100006
Ciancarini, P., Poggi, F., & Russo, D. (2016). Big data quality: A roadmap for open data. In Proceedings of the 2016 IEEE 2nd International Conference on Big Data Computing Service and Applications, BigDataService 2016 (pp. 210-215). Publisher IEEE. https://doi.org/10.1109/BigDataService.2016.37
Cooper, H. M. (1988). Organizing knowledge syntheses: A taxonomy of literature reviews. Knowledge in Society, 1(1), 104-126. https://link.springer.com/article/10.1007%2FBF03177550
Gandomi, A., & Haider, M. (2015). Beyond the hype: Big data concepts, methods, and analytics. International Journal of Information Management, 35(2), 137-144. https://doi.org/10.1016/j.ijinfomgt.2014.10.007
International Organization for Standardization (2019). ISO/IEC 25000:2014
Systems and software engineering — Systems and software Quality Requirements and Evaluation (SQuaRE) — Guide to SQuaRE https://www.iso.org/obp/ui#iso:std:iso-iec:25000:ed-2:v1:en
Kalampokis, E., Tambouris, E., & Tarabanis, K. (2011). Open government data: A stage model. In M. Janssen, H. J. Scholl, M. A. Wimmer & F. Bannister (Eds.), Electronic Government EGOV 2014, Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (vol. 6846, pp. 235-246). Springer. https://doi.org/10.1007/978-3-642-22878-0_20
Kitchenham, B., Pearl Brereton, O., Budgen, D., Turner, M., Bailey, J., & Linkman, S. (2009). Systematic literature reviews in software engineering - A systematic literature review. Information and Software Technology, 51(1), 7–15. https://doi.org/10.1016/j.infsof.2008.09.009
Koltay, T. (2020). Quality of open research data: Values, convergences, and governance. Information, 11(4), 175. https://doi.org/10.3390/info11040175
Kucera, J., & Chlapek, D. (2014). Benefits and Risks of Open Government Data. Journal of Systems Integration, 5(1), 30–41. https://doi.org/http://dx.doi.org/10.20470/jsi.v5i1.185
Kucera, J. (2015). Open government data publication methodology. Journal of Systems Integration, 6(2). https://doi.org/10.20470/jsi.v6i2.231
Loshin, D. (2014). Understanding big data quality for maximum information usability. [White paper] SAS www.dataqualitybook.com
Maestre-Gongora, G. P., & Bernal, W. N. (2019). Conceptual Model of Information Technology Management for Smart Cities: SmarTICity. Journal of Global Information Management (JGIM), 27(2), 159-175. http://doi.org/10.4018/JGIM.2019040109
Maestre Góngora, Gina Paola, & Nieto Bernal, Wilson. (2015). Factores Clave en la Gestión de Tecnología de Información para Sistemas de Gobierno Inteligente. Journal of technology management & innovation, 10(4), 109-117. https://dx.doi.org/10.4067/S0718-27242015000400012
Maestre-Góngora , G. ., Rangel-Carrillo, A., & Osorio-Sanabria, M. . (2021). El valor de los datos abiertos de gobierno: un enfoque desde la evaluación de calidad. Revista de Investigación, Desarrollo e Innovación, 11(3), 507–518. https://doi.org/10.19053/20278306.v11.n3.2021.13348
Mahecha, J. F., López, N. E., & Velandia, J. A. (2018). Assessing data quality in open data: A case study. In Proceedings of the 2017 Congreso Internacional de Innovacion y Tendencias En Ingenieria, CONIITI 2017 IEEE (Vol 1, pp. 1-5). https://doi.org/10.1109/CONIITI.2017.8273343
Marsh, R. (2005). Drowning in dirty data? It’s time to sink or swim: A four-stage methodology for total data quality management. Journal of Database Marketing & Customer Strategy Management, 12(2), 105-112. https://doi.org/10.1057/palgrave.dbm.3240247
Martin, S., Foulonneau, M., Turki, S., & Ihadjadene, M. (2013). Risk analysis to overcome barriers to open data. Electronic Journal of e-Government, 11(1), 348-359 https://academic-publishing.org/index.php/ejeg/article/view/576/539
Merino, J., Caballero, I., Rivas, B., Serrano, M., & Piattini, M. (2016). A data quality in use model for big data. Future Generation Computer Systems, 63, 123-130. https://doi.org/10.1016/j.future.2015.11.024
Miloslavskaya, N., & Tolstoy, A. (2016). Big data, fast data, and data lake concepts. Procedia Computer Science, 88, 300-305. https://doi.org/10.1016/j.procs.2016.07.439
Ministerio de Tecnologías de la Información y las Comunicaciones. (2016a). Mapa de Ruta Guía de datos abiertos en Colombia. https://estrategia.gobiernoenlinea.gov.co/623/articles-9404_recurso_1.pdf
Ministerio de Tecnologías de la Información y las Comunicaciones. (2016b). Guía de estándares de calidad e interoperabilidad de los datos abiertos del gobierno de Colombia.https://herramientas.datos.gov.co/sites/default/files/2020-11/A_guia_de_estandares_final_0.pdf
Ministerio de Tecnologías de la Información y las Comunicaciones. (2019a). WebSite Datos abiertos Colombia. www.datos.gov.co
Ministerio de Tecnologías de la Información y las Comunicaciones. (2019b). Guía para el uso y aprovechamiento de datos abiertos en Colombia. https://gobiernodigital.gov.co/623/articles-9407_guia_datos.pdf
Ministerio de Tecnologías de la Información y las Comunicaciones. (2019c). Requisitos de calidad para datos abiertos. https://sellodeexcelencia.gov.co/documents/UTSF_SDE_Requisitos_de_calidad_para_datos_abiertos_2019_12_02_v_2_0.pdf
Muente-Kunigami, A., & Serale, F. (2018). Los datos abiertos en América Latina y el Caribe. Los Datos Abiertos En América Latina y El Caribe. https://doi.org/10.18235/0001202
Mukherjee, S., & Shaw, R. (2016). Big data-concepts, applications, challenges, and future scope. International Journal of Advanced Research in Computer and Communication Engineering, 5(2). https://doi.org/10.17148/IJARCCE.2016.5215
Munné, R. (2016). Big data in the public sector. In J. Cavanillas, E. Curry & W. Wahlster (Eds.), New Horizons for a Data-Driven Economy, (pp. 195-208). Springer. https://doi.org/10.1007/978-3-319-21569-3_11
Power Data. (2019). Big data: ¿En qué consiste? Su importancia, desafíos, y gobernabilidad. https://www.powerdata.es/big-data
Rangel-Carrillo, A. M., Maestre-Góngora, G. P., & Osorio-Sanabria, M. A. (2020). Principios, lineamientos, dimensiones y atributos para la evaluación de calidad de Datos Abiertos de Gobierno. Aibi Revista De investigación, administración E ingeniería, 8(S1), 54-65. https://doi.org/10.15649/2346030X.950
Redman, T. (2016, September 22). Bad data costs the U.S. $3 trillion per year. Harvard Business Review. https://hbr.org/2016/09/bad-data-costs-the-u-s-3-trillion-per-year
Russom, P. (2011). Big data analytics. https://tdwi.org/research/2011/09/~/media/TDWI/TDWI/Research/BPR/2011/TDWI_BPReport_Q411_Big_Data_Analytics_Web/TDWI_BPReport_Q411_Big%20Data_ExecSummary.ashx
Osorio-Sanabria, M. A., Amaya-Fernández, F. O., & González-Zabala, M. P. (2020). Políticas, normas y estrategias que fomentan los datos abiertos en Colombia: un análisis de literatura. Revista Virtual Universidad Católica Del Norte, (62), 155–188. https://doi.org/10.35575/rvucn.n62a7
Talukder, M. S., Shen, L., Hossain Talukder, M. F., & Bao, Y. (2019). Determinants of user acceptance and use of open government data (OGD): An empirical investigation in Bangladesh. Technology in Society, 56, 147-156. https://doi.org/10.1016/j.techsoc.2018.09.013
TodoBI. (2019, October 25). 11 Consejos sobre bad data: El enemigo silencioso en business intelligece y big data. https://www.todobi.com/11-consejos-sobre-bad-data-el-enemigo/
Torres Saumeth, K., Ruiz Afanador, T., Solís Ospino, L., & Martínez Barraza, F. (2012). Calidad y su evolución: una revisión [Quality and its evolution: A review]. Dimensión Empresarial, 10(2), 100-107. https://doi:10.15665/rde.v10i2.213
United Nations Economic Commission for Europe. (2014). A suggested framework for the quality of big data deliverables of the UNECE big data quality task team. https://statswiki.unece.org/download/attachments/108102944/Big%20Data%20Quality%20Framework%20-%20final-%20Jan08-2015.pdf?version=1&modificationDate=1420725063663&api=v2
Wahyudi, A., Kuk, G., & Janssen, M. (2018). A process pattern model for tackling and improving big data quality. Information Systems Frontiers, 20, 457-469. https://doi.org/10.1007/s10796-017-9822-7
Williams, D., & Tang, H. (2020). Data quality management for industry 4.0: A survey. https://asq.org/quality-resources/articles/data-quality-management-for-industry?id=0c3073f0489d45a6891309b94261efab
Yi, M. (2018). Exploring the quality of government open data: Comparison study of the UK, the USA, and Korea. The Electronic Library, 37(1), 35-48. https://doi.org/10.1108/EL-06-2018-0124
Zuiderwijk, A. (2017). Analysing open data in virtual research environments: New collaboration opportunities to improve policy making. International Journal of Electronic Government Research, 13(4), 76-92. https://doi.org/10.4018/IJEGR.2017100105
Zuiderwijk, A., Janssen, M., & Susha, I. (2016). Improving the speed and ease of open data use through metadata, interaction mechanisms, and quality indicators. Journal of Organizational Computing and Electronic Commerce, 26(1-2), 116-146. https://doi.org/10.1080/10919392.2015.1125180
Downloads
Copyright (c) 2023 Revista Colombiana de Computación

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Funding data
-
Universidad Cooperativa de Colombia
Grant numbers CONADI INV 3139










