- Add profile of 6,126 people in collective dwellings (v4002=63) with demographics: gender, race, education, age, civil status - Add detailed analysis of 503 minors: 349 likely prisoners (v0502=20), 154 dependents of staff/prisoners - Add breakdown of female prisoners: higher education and whiter than male prisoners - Fix language inconsistencies (Spanish, Chinese, English terms) - Add documentation for br_ibge_censo_2022 setor_censitario (v* variables) - Add documentation for prison population identification across census datasets
4.8 KiB
br_ibge_censo_2022.setor_censitario
Overview
The setor_censitario table contains aggregated census data at the census tract (setor censitário) level from Brazil's 2022 Demographic Census (Censo Demográfico 2022), published by IBGE.
The table has 1,411 raw v* columns (v00001 through v01411) plus 7 named alias columns. None of the v* columns have descriptions in the basedosdados-schema.json context file.
Named Columns (aliases for V0001–V0007)
These are human-readable aliases pointing to the basic dictionary:
| Schema Column | IBGE Code | Description |
|---|---|---|
pessoas |
V0001 | Total de pessoas |
domicilios |
V0002 | Total de Domicílios (DPPO + DPPV + DPPUO + DPIO + DCCM + DCSM) |
domicilios_particulares |
V0003 | Total de Domicílios Particulares (DPPO + DPPV + DPPUO + DPIO) |
domicilios_coletivos |
V0004 | Total de Domicílios Coletivos (DCCM + DCSM) |
media_moradores_domicilios |
V0005 | Média de moradores em Domicílios Particulares Ocupados |
porcentagem_domicilios_imputados |
V0006 | Percentual de Domicílios Particulares Ocupados Imputados |
domicilios_particulares_ocupados |
V0007 | Total de Domicílios Particulares Ocupados (DPPO + DPIO) |
DPPO = Domicílios Particulares Permanentes Ocupados
DPPV = Domicílios Particulares Permanentes Vagos
DPPUO = Domicílios Particulares de Uso Ocasional
DPIO = Domicílios Particulares Improvisados Ocupados
DCCM = Domicílios Coletivos com Morador
DCSM = Domicílios Coletivos sem Morador
Raw v* Columns (V00001–V01411)
These are the 1,411 detailed aggregated census variables. They cover 8 major themes:
| Range | Theme | Count |
|---|---|---|
| V00001–V00089 | Características do Domicílio – Parte 1 | 89 |
| V00090–V00495 | Características do Domicílio – Parte 2 (crosstabs) | 406 |
| V00496–V00643 | Características do Domicílio – Parte 3 | 148 |
| V00644–V01005 | Alfabetização | 362 |
| V01006–V01041 | Demografia | 36 |
| V01042–V01223 | Parentesco | 182 |
| V01224–V01316 | Óbitos (2019-2022) | 93 |
| V01317–V01411 | Cor ou Raça | 95 |
Theme Details
V00001–V00089: Características do Domicílio – Parte 1 Type of dwelling, number of residents, rooms, bathrooms, sanitation, water supply, electricity, waste collection, appliances, etc.
V00090–V00495: Características do Domicílio – Parte 2 Detailed dwelling characteristics cross-tabulated by type of dwelling and race/color of the responsible person.
V00496–V00643: Características do Domicílio – Parte 3 More detailed dwelling characteristics.
V00644–V01005: Alfabetização Literacy rates by age group, sex, race/color, and other demographics.
V01006–V01041: Demografia Population demographics (age, sex distribution).
V01042–V01223: Parentesco Kinship/relationship structures within households.
V01224–V01316: Óbitos Deaths in the household (reference period 2019-2022).
V01317–V01411: Cor ou Raça Race/ethnicity breakdown of the population.
Special Populations (Separate Variable Ranges)
In addition to the 1,411 base variables, IBGE publishes separate dictionaries for:
- PCT – Indígenas (V01500–V02xxx): 1,029 variables for Indigenous populations
- PCT – Quilombolas (V03000–V03xxx): 951 variables for Quilombola populations
These are stored in separate sheets in the IBGE dictionary file.
Where to Find Full Variable Descriptions
Official IBGE Dictionary
Download the official dictionary Excel file:
https://ftp.ibge.gov.br/Censos/Censo_Demografico_2022/Agregados_por_Setores_Censitarios/dicionario_de_dados_agregados_por_setores_censitarios_20250417.xlsx
It contains 5 sheets:
- Dicionario Basico (V0001–V0007): Core counters — these map to the named schema columns
- Siglas Basico: Abbreviations for the basic variables
- Dicionario nao PCT (V00001–V01411): The main detailed variable dictionary
- Dicionario PCT - Indigenas (V01500–V02xxx): Indigenous population variables
- Dicionario PCT - Quilombolas (V03000–V03xxx): Quilombola population variables
Other Sources
- basedosdados website: https://basedosdados.org/dataset/br-ibge-censo-2022
- IBGE SIDRA: https://sidra.ibge.gov.br (search "Agregados por Setores Censitários")
- IBGE Census 2022 page: https://www.ibge.gov.br/estatisticas/sociais/populacao/28740-censo-demografico-2022.html
Notes
- The
basedosdados-schema.jsoncontext file lists these columns as{"name":"v00001","type":"INTEGER"}with no description field — this is a known documentation gap. - The
br_ibge_censo_2022.dicionariotable in the DuckDB only contains 30 entries forcadastro_enderecos— the 1,411 sector-level variable descriptions are missing from it. - For the 2010 census (
br_ibge_censo_demografico), descriptions are included in the schema for most tables.