Study Case: The Database Selection Process for the Big Data-based System to Reduce Health Effects of Air Pollution in Ciudad Juárez, Mexico.

  • Adrián Vásquez Autonomous University of Juarez
  • Fernando Estrada Autonomous University of Juarez
  • Alicia Jiménez Autonomous University of Juarez
  • Angel Nieves Autonomous University of Juarez
  • Nabile Rodríguez Autonomous University of Juarez
  • Israel Hernández Universidad Autonoma de juarez


The Border 2020 is a U.S.-Mexico effort to address binational environmental problems along the border. This project involved the city of El Paso, Texas and Ciudad Juárez, México to improve the transboundary air quality. A large portion of the Ciudad Juarez population resides in areas with very few or none air quality monitoring stations and also people is not educated on the health effects of exposure to air pollution. This motivated an innovative community-based climate monitoring scheme to increase the awareness among people on the effects of air pollutions. The idea was to manufacture a large amount of low-cost air quality sensors, located at different strategic sites to cover a major portion of the city and then to measure and analyze meteorological variables and alert people about outdoor activities when their health is at risk. To achieve this, it was considered a big data-based system to collect, store, analyze and visualize a large amount of data. Selecting the appropriate database software to store large volumes of data is a key element in these projects. Recent advances in storage technology show two main approaches of databases: SQL relational and NoSQL non-relational databases. This paper discusses important factors to consider when selecting the database software for climate data and presents a performance comparison between SQL and NoSQL databases in specific scenarios involving operations such as inserting, deleting and updating a massive volume of both structured and unstructured data.

Keywords: Community Monitoring, Air Pollution, Environmental Quality Index, Databases, Big data.


[1] «EPA (United States Environmental Protection Agency",» 2017 01 19. [En línea]. Available: [Último acceso: 2018 09 28].

[2]«EPA (United States Environmental Protection Agency),» 15 03 2018. [En línea]. Available: [Último acceso: 28 09 2018].

[3] «United States Environmental Protection Agency,» 23 04 2018. [En línea]. Available: [Último acceso: 15 09 2018].

[4] «EPA (United States Environmental Protection Agency),» 23 04 2018. [En línea]. Available: [Último acceso: 10 09 2018].

[5] «INEGI,» [En línea]. Available: [Último acceso: 2018 08 10].

[6] H. J. Watson, «Tutorial: Big Data Analytics: Concepts, Technologies and Applications.,» Communications of the Association for Information Systems:, vol. 34, nº 65, 2014.

[7] T. G., «Usage-Driven Database Design: From Logical Data Modeling through Physical Schema Definition,» New Jersey, Apress, 2017, p. 374.

[8] N. a. S. R. Umanath, Data Modeling and Database Design, Boston: Cengage Learning, 2014.

[9] MongoDB, «Top 5 Considerations When Evaluating,» A MongoDB White Paper , New York, 2015.

[10] N. B. F. D. D. T. a. T. R. Schulz W., «Evaluation of relational and NoSQL database architectures to manage genomic annotations,» Journal of Biomediacal Informatics, vol. 64, pp. Pages 288-295, 2016.
Cómo citar
Vásquez, A., Estrada, F., Jiménez, A., Nieves, A., Rodríguez, N., & Hernández, I. (2019). Study Case: The Database Selection Process for the Big Data-based System to Reduce Health Effects of Air Pollution in Ciudad Juárez, Mexico. Mundo FESC, 9(17), 23-30. Recuperado a partir de