Samenvatting
The advancements in digital tools and data collection methods ensure the continuing growth of textual data obtained through large-scale participation processes in urban contexts. In order to extract the thematic content of such underutilized textual datasets, topic modeling (TM) and content analysis have been deployed as promising AI-based Natural Language Processing (NLP) techniques. Yet, implementing such techniques has not been exploited in urban design domains due to the complexity of textual datasets and the lack of a systematic evaluation framework. In this paper, we addressed the challenges in the utilization of large textual data by using a real-world dataset collected via a digital participation platform in Madrid, Spain. Firstly, we identified prominent data structures and potential information embedded into the dataset by using a document-oriented NoSQL database. In this step, we systematically discussed data pre-processing steps to convert them into a series of structured data collections. Secondly, we evaluated three different TM algorithms, i.e. LDA, LSI, and HDP, according to a number of hyperparameters controlling the learning process. This step aimed to reveal the required number of topics to extract meaningful content through the algorithms. Lastly, we presented possible textual data visualization techniques to enable the use of textual information in digital participation processes. Consequently, this paper facilitates the use of large textual datasets by investigating data structures & processing, revealing the potentials of different TM algorithms, and eventually analyzing the results with the support of urban big data analytics and computational linguistic techniques for informed urban design processes.
| Originele taal-2 | Engels |
|---|---|
| Titel | Computer-Aided Architectural Design. INTERCONNECTIONS : Co-computing Beyond Boundaries |
| Subtitel | 20th International Conference, CAAD Futures 2023, Delft, The Netherlands, July 5–7, 2023, Selected Papers |
| Redacteuren | Michela Turrin, Charalampos Andriotis, Azarakhsh Rafiee |
| Plaats van productie | Cham |
| Uitgeverij | Springer |
| Pagina's | 271-286 |
| Aantal pagina's | 16 |
| ISBN van elektronische versie | 978-3-031-37189-9 |
| ISBN van geprinte versie | 978-3-031-37188-2 |
| DOI's | |
| Status | Gepubliceerd - 5 jul. 2023 |
| Extern gepubliceerd | Ja |
| Evenement | CAAD Futures 2023 - TU Delft, Delft, Nederland Duur: 5 jul. 2023 → 7 jul. 2023 |
Publicatie series
| Naam | Communications in Computer and Information Science (CCIS) |
|---|---|
| Volume | 1819 |
| ISSN van geprinte versie | 1865-0929 |
| ISSN van elektronische versie | 1865-0937 |
Congres
| Congres | CAAD Futures 2023 |
|---|---|
| Land/Regio | Nederland |
| Stad | Delft |
| Periode | 5/07/23 → 7/07/23 |
Financiering
This research is supported by “Designing mobile-friendly cartograms for visualising geospatial data” Grant, from the Ministry of Education, Singapore, under its Academic Research Fund Tier 2 programme (award number MOE-T2EP20221-0007) and by Singapore International Graduate Award (SINGA).
Vingerafdruk
Duik in de onderzoeksthema's van 'Transforming Large-Scale Participation Data Through Topic Modelling in Urban Design Processes'. Samen vormen ze een unieke vingerafdruk.Citeer dit
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver