Characterization of urban risks in the press applying text mining for the enrichment of open data
PDF (Español (España))
HTML (Español (España))

Keywords

Urban Risk
Text Mining
Open Digital Newspapers
Open Data

How to Cite

Vilches-Blázquez, L. M., & Comesaña Ocampo, D. (2022). Characterization of urban risks in the press applying text mining for the enrichment of open data. Investigación Bibliotecológica. Archivonomía, bibliotecología información, 36(91), 85–107. https://doi.org/10.22201/iibi.24488321xe.2022.91.58538
Métricas de PLUMX

Abstract

News is freely spread and widely available to Internet users much more easily than traditional media. In the news, we can find an infinite number of hidden “minor data,” that can provide valuable information not collected in other sources of information. In this context, we have been interested in analyzing and characterizing the urban risks contained in the Uruguayan open newspapers using text mining techniques. This proposal makes it possible to create a news corpus based on risk events included in open data. The corpus covers 2003-2019 and is built from the digital open newspapers El Eco Digital, Montevideo Portal, and La Red 21. Various text mining techniques are applied to this corpus using the QDA-MinerLite software and the Python language (concretely, through the Scattertext library) to identify, characterize, and discover insights on these events. The corpus processing results help enrich the existing open data on risks in Uruguay, incorporating information on their effects, actors, and associated interventions.

https://doi.org/10.22201/iibi.24488321xe.2022.91.58538
PDF (Español (España))
HTML (Español (España))

Authors:

  • They must sent the publication authorization letter to Investigación Bibliotecológica: archivonomía, bibliotecología e información.
  • They can share the submission with the scientific community in the following ways:
    • As teaching support material
    • As the basis for lectures in academic conferences
    • Self-archiving in academic repositories.
    • Dissemination in academic networks.
    • Posting to author’s blogs and personal websites

These allowances shall remain in effect as long as the conditions of use of the contents of the journal are duly observed pursuant to the Creative Commons:Attribution-NonCommercial-NoDerivatives 4.0 license that it holds. DOI links for download the full text of published papers are provided for the last three uses.

Self-archiving policy

For self-archiving, authors must comply with the following

a) Acknowledge the copyright held by the journal Investigación Bibliotecológica: archivonomía, bibliotecología e información.

b) Establish a link to the original version of the paper on the journal page, using, for example, the DOI.

c) Disseminate the final version published in the journal.

Licensing of contents

The journal Investigación Bibliotecológica: archivonomía, bibliotecología e información allows access and use of its contents pursuant to the Creative Commons license: Attribution- Non-commercial-NoDerivatives 4.0.

Licencia de Creative Commons


Investigación Bibliotecológica: archivonomía, bibliotecología e información by Universidad Nacional Autónoma de México is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 Internacional License.
Creado a partir de la obra en http://rev-ib.unam.mx/ib.

 

This means that contents can only be read and shared as long as the authorship of the work is acknowledged and cited. The work shall not be exploited for commercial ends nor shall it been modified.

Limitation of liability

The journal is not liable for academic fraud or plagiarism committed by authors, nor for the intellectual criteria they employ. Similarly, the journal shall not be liable for the services offered through third party hyperlinks contained in papers submitted by authors.

In support of this position, the journal provides the Author’s Duties notice at the following link: Responsibilities of authors.

The director or editor of the journal shall notify authors in the event it migrates the contents of the journal’s official website to a distinct IP or domain.

 

Downloads

Download data is not yet available.