This repository is the official implementation of Global urban visual perception varies across demographics and personalities.
To install requirements:
python install -r requirements.txt
We provide the Street Perception Evaluation Considering Socioeconomics (SPECS) dataset in hugging face.
This work uses a subset of images from the Global Streetscapes dataset.
After downloading its contents in data/ the folder should have the following subfolders:
data
├── abuja/
├── 10 CSV files with contextual information for these images
├── global-streetscapes/ (download contextual data from original [Global Streetscapes](https://huggingface.co/datasets/NUS-UAL/global-streetscapes))
├── labels/
├── final/ (6 XLSX files with final pairwise comparisons)
├── inferences/ (6 CSV files with inferred perception scores using the perception model used in [Global Streetscapes](https://huggingface.co/datasets/NUS-UAL/global-streetscapes))
├── processed/ (10 CSV files with computed perception Q scores)
├── svi/ (SVIs should be downloaded following the [wiki](https://github.com/matqr/specs/wiki) or with the metadata file)
├── img_paths.csv (path location )
├── metadata.csv (file with imagery metadata)
├── visual_complexity_all.csv
The wiki in this repository has the steps to reproduce the image subset used.
Survey design guidelines proposed by Gu et al. 2025.
| Section | Reporting Parameters | Ours |
|---|---|---|
| Survey Structure | Design, allocation, consistency evaluation | Our survey consisted of a demographic section of 11 questions and 50 pairwise comparison questions. |
| Raters | Demographics, sample size, number of ratings per image, recruitment process | We recruited 1,000 respondents (200 from each country) who reside in the city, or neighboring cities, where the images were extracted -- Santiago, greater Amsterdam area, Nigeria, Singapore, and San Francisco and Santa Clara -- and are above 21 years old. An equal distribution of respondents, based on annual income range, was prioritised for all but Chile and Nigeria. The gender ratio is 1:1 within countries, and the age group ratio is roughly 1:1:1:2 for 21-29, 30-39, 40-49, and above 50. More than 78% of participants from each country have lived there for more than five years, and less than 5% have lived there for less than a year. Background information from all participants is shown in Appendix Table: participants-bg in Appendix: participants-bg. Each participant completed 50 ratings, with each image receiving an average of 25 ratings (min. 10, max. 46) per indicator. |
| Scoring | Scoring methodology, rationale for method selection, evaluative indicators | We selected pairwise comparison as the ranking scoring method due to its proven stability over and fairly quick assessment. On the online survey, participants were asked to choose their preference between two images given an indicator. To avoid a forced choice scenario that could lead to bias (forced-choice), and to be consistent with previous and future studies, the option for an equally rated preference (i.e., "Both are the same to me") was shown too. We asked participants to rate a random pair of images five times for each of the 10 indicators, making up the 50 ratings per participant. Fig. screenshot-ui shows a screenshot of the online survey. |
| Image | Image type, image source, rationale for image choice, dataset composition | We chose perspective images with good quality, no reflection, non-panoramic, captured in clear weather, and taken during the day with front or back view direction. These types of images have proven rating stability (Gu 2025) and are easily obtained from open data sources. Using the Global Streetscapes, a large open, labelled, processed, and worldwide street-level imagery dataset (Hou et al. 2024); we obtained 400 images, 80 from each city -- Santiago, Amsterdam, Abuja, Singapore, and San Francisco, for subjective ratings in the survey. |
The notebook 6_preprocessing.ipynb takes the collected raw data, creates the demographic subgroups, computes the personality scores, and formats .
Notebooks with the prefix `7' analyse the statistical differences (Welch's Anova and Games Howell posthoc test) between demographic and personalities groups. The list of demographics subgroups analysed within countries are
- Gender
- Age group
- Annual Household Income (AHI)
- Education
- Race and ethnicity
- Big 5 personality
Groups are made based on the specific demographic categories, see the Supplementary Information in the paper for more details, as well as interactions between demographic groups. For personalities, only responses form participants whose personality score is in the 25th quantile and 75th quantile are used for comparisons.
The .ipynb notebooks perform the Q Score calculations and generates the csv files and the R scripts perform the statistical tests and generate the plots used in the paper.
A model pretrained on the PlacePulse2.0 (PP2), referred to as ViT-PP2, dataset is used to infer the perception values of the SVI on this dataset.
These inferred scores and compared against the dataset scores for each city.
The script 8_inference.py is the same codebase used in the Global Streetscapes.
Run
python 8_inference.py
The notebook 9a_multiplecitySVI-singlecityparticipant.ipynb computes the perception Q scores based on participants' location.
The full statistical analysis and figure used on the paper are done with 9a_multiplecitySVI-singlecityparticipant_plots.R.
The notebook 9b_singlecitySVI-multiplecityparticipant.ipynb computes the perception Q scores using all participants' ratings.
The full statistical analysis and figure used on the paper are done with 9b_singlecitySVI-multiplecityparticipant_plots.R.
The notebook 9c_perceptual-variations.ipynb computes the perception Q scores based on whether the participants are residents of the images' locations or non-residents.
The full statistical analysis and figures used on the paper are done with 9c_perceptual-variations_plots.R.
All previous three plots are then merged and polished with 9d_merging_plots.R.
The statistical analyses and plots are ran with 10_indicators-correlations_plots.R (Pearson's correlation) and 10a_indicators-correlations-u-shape.R (polynomial regressions).
If you use our dataset or work, please cite the paper:
Quintana M, Gu Y, Liang X, Hou Y, Ito K, Zhu Y, Abdelrahman M, Biljecki F (2025): Global urban visual perception varies across demographics and personalities. Nature Cities 2(11): 1092-1106. doi: 10.1038/s44284-025-00330-x
BibTeX:
@article{quintana2025,
title = {Global Urban Visual Perception Varies across Demographics and Personalities},
author = {Quintana, Matias and Gu, Youlong and Liang, Xiucheng and Hou, Yujun and Ito, Koichi and Zhu, Yihan and Abdelrahman, Mahmoud and Biljecki, Filip},
year = {2025},
journal = {Nature Cities},
volume = {2},
number = {11},
pages = {1092--1106},
issn = {2731-9997},
doi = {10.1038/s44284-025-00330-x},
url = {https://www.nature.com/articles/s44284-025-00330-x},
}
Besides the published paper, a free version (postprint / author-accepted manuscript) can be downloaded here.

