- Add profile of 6,126 people in collective dwellings (v4002=63) with demographics: gender, race, education, age, civil status - Add detailed analysis of 503 minors: 349 likely prisoners (v0502=20), 154 dependents of staff/prisoners - Add breakdown of female prisoners: higher education and whiter than male prisoners - Fix language inconsistencies (Spanish, Chinese, English terms) - Add documentation for br_ibge_censo_2022 setor_censitario (v* variables) - Add documentation for prison population identification across census datasets
108 lines
4.8 KiB
Markdown
108 lines
4.8 KiB
Markdown
# br_ibge_censo_2022.setor_censitario
|
||
|
||
## Overview
|
||
|
||
The `setor_censitario` table contains aggregated census data at the **census tract (setor censitário)** level from Brazil's 2022 Demographic Census (Censo Demográfico 2022), published by IBGE.
|
||
|
||
The table has **1,411 raw `v*` columns** (`v00001` through `v01411`) plus 7 named alias columns. None of the `v*` columns have descriptions in the `basedosdados-schema.json` context file.
|
||
|
||
## Named Columns (aliases for V0001–V0007)
|
||
|
||
These are human-readable aliases pointing to the basic dictionary:
|
||
|
||
| Schema Column | IBGE Code | Description |
|
||
|---|---|---|
|
||
| `pessoas` | V0001 | Total de pessoas |
|
||
| `domicilios` | V0002 | Total de Domicílios (DPPO + DPPV + DPPUO + DPIO + DCCM + DCSM) |
|
||
| `domicilios_particulares` | V0003 | Total de Domicílios Particulares (DPPO + DPPV + DPPUO + DPIO) |
|
||
| `domicilios_coletivos` | V0004 | Total de Domicílios Coletivos (DCCM + DCSM) |
|
||
| `media_moradores_domicilios` | V0005 | Média de moradores em Domicílios Particulares Ocupados |
|
||
| `porcentagem_domicilios_imputados` | V0006 | Percentual de Domicílios Particulares Ocupados Imputados |
|
||
| `domicilios_particulares_ocupados` | V0007 | Total de Domicílios Particulares Ocupados (DPPO + DPIO) |
|
||
|
||
DPPO = Domicílios Particulares Permanentes Ocupados
|
||
DPPV = Domicílios Particulares Permanentes Vagos
|
||
DPPUO = Domicílios Particulares de Uso Ocasional
|
||
DPIO = Domicílios Particulares Improvisados Ocupados
|
||
DCCM = Domicílios Coletivos com Morador
|
||
DCSM = Domicílios Coletivos sem Morador
|
||
|
||
## Raw `v*` Columns (V00001–V01411)
|
||
|
||
These are the **1,411 detailed aggregated census variables**. They cover 8 major themes:
|
||
|
||
| Range | Theme | Count |
|
||
|---|---|---|
|
||
| V00001–V00089 | Características do Domicílio – Parte 1 | 89 |
|
||
| V00090–V00495 | Características do Domicílio – Parte 2 (crosstabs) | 406 |
|
||
| V00496–V00643 | Características do Domicílio – Parte 3 | 148 |
|
||
| V00644–V01005 | Alfabetização | 362 |
|
||
| V01006–V01041 | Demografia | 36 |
|
||
| V01042–V01223 | Parentesco | 182 |
|
||
| V01224–V01316 | Óbitos (2019-2022) | 93 |
|
||
| V01317–V01411 | Cor ou Raça | 95 |
|
||
|
||
### Theme Details
|
||
|
||
**V00001–V00089: Características do Domicílio – Parte 1**
|
||
Type of dwelling, number of residents, rooms, bathrooms, sanitation, water supply, electricity, waste collection, appliances, etc.
|
||
|
||
**V00090–V00495: Características do Domicílio – Parte 2**
|
||
Detailed dwelling characteristics cross-tabulated by type of dwelling and race/color of the responsible person.
|
||
|
||
**V00496–V00643: Características do Domicílio – Parte 3**
|
||
More detailed dwelling characteristics.
|
||
|
||
**V00644–V01005: Alfabetização**
|
||
Literacy rates by age group, sex, race/color, and other demographics.
|
||
|
||
**V01006–V01041: Demografia**
|
||
Population demographics (age, sex distribution).
|
||
|
||
**V01042–V01223: Parentesco**
|
||
Kinship/relationship structures within households.
|
||
|
||
**V01224–V01316: Óbitos**
|
||
Deaths in the household (reference period 2019-2022).
|
||
|
||
**V01317–V01411: Cor ou Raça**
|
||
Race/ethnicity breakdown of the population.
|
||
|
||
## Special Populations (Separate Variable Ranges)
|
||
|
||
In addition to the 1,411 base variables, IBGE publishes separate dictionaries for:
|
||
|
||
- **PCT – Indígenas** (V01500–V02xxx): 1,029 variables for Indigenous populations
|
||
- **PCT – Quilombolas** (V03000–V03xxx): 951 variables for Quilombola populations
|
||
|
||
These are stored in separate sheets in the IBGE dictionary file.
|
||
|
||
## Where to Find Full Variable Descriptions
|
||
|
||
### Official IBGE Dictionary
|
||
|
||
Download the official dictionary Excel file:
|
||
|
||
```
|
||
https://ftp.ibge.gov.br/Censos/Censo_Demografico_2022/Agregados_por_Setores_Censitarios/dicionario_de_dados_agregados_por_setores_censitarios_20250417.xlsx
|
||
```
|
||
|
||
It contains 5 sheets:
|
||
- **Dicionario Basico** (V0001–V0007): Core counters — these map to the named schema columns
|
||
- **Siglas Basico**: Abbreviations for the basic variables
|
||
- **Dicionario nao PCT** (V00001–V01411): The main detailed variable dictionary
|
||
- **Dicionario PCT - Indigenas** (V01500–V02xxx): Indigenous population variables
|
||
- **Dicionario PCT - Quilombolas** (V03000–V03xxx): Quilombola population variables
|
||
|
||
### Other Sources
|
||
|
||
- **basedosdados website**: https://basedosdados.org/dataset/br-ibge-censo-2022
|
||
- **IBGE SIDRA**: https://sidra.ibge.gov.br (search "Agregados por Setores Censitários")
|
||
- **IBGE Census 2022 page**: https://www.ibge.gov.br/estatisticas/sociais/populacao/28740-censo-demografico-2022.html
|
||
|
||
## Notes
|
||
|
||
- The `basedosdados-schema.json` context file lists these columns as `{"name":"v00001","type":"INTEGER"}` with **no description field** — this is a known documentation gap.
|
||
- The `br_ibge_censo_2022.dicionario` table in the DuckDB only contains 30 entries for `cadastro_enderecos` — the 1,411 sector-level variable descriptions are **missing** from it.
|
||
- For the 2010 census (`br_ibge_censo_demografico`), descriptions **are** included in the schema for most tables.
|