babynamesIL
contains the a dataset of names given to Israeli babies in the years 1948-2022. The data was downloaded from the Israeli Bureau of Statistics (LAMAS). Only names used for at least 5 children in at least one gender and sector (“Jewish”, “Muslim”, “Christian”, “Druze” and “Other”) are included.
The following objects are available:
babynamesIL
: For each year in from 1948 to 2022, the number of babies of each sex and sector given to each name.babynamesIL_totals
: a dataframe with the total number of of babies with each names per sex and sector.# Install the released version from CRAN
install.packages("babynamesIL")
# Install the development version from GitHub
devtools::install_github("aviezerl/babynamesIL")
library(babynamesIL)
# yearly data
head(babynamesIL)
#> # A tibble: 6 × 6
#> sector year sex name n prop
#> <chr> <dbl> <chr> <chr> <int> <dbl>
#> 1 Christian 1948 F לילה 19 0.129
#> 2 Christian 1948 F סמירה 14 0.0952
#> 3 Christian 1948 F מארי 13 0.0884
#> 4 Christian 1948 F נואל 11 0.0748
#> 5 Christian 1948 F אמל 9 0.0612
#> 6 Christian 1948 F מרים 7 0.0476
# totals per name
head(babynamesIL_totals)
#> # A tibble: 6 × 4
#> sector sex name total
#> <chr> <chr> <chr> <int>
#> 1 Christian F מריה 1063
#> 2 Christian F אמל 849
#> 3 Christian F מרים 751
#> 4 Christian F מארי 580
#> 5 Christian F רים 555
#> 6 Christian F מאיה 530
Add the missing values:
library(dplyr)
#>
#> Attaching package: 'dplyr'
#> The following objects are masked from 'package:stats':
#>
#> filter, lag
#> The following objects are masked from 'package:base':
#>
#> intersect, setdiff, setequal, union
babynamesIL %>%
tidyr::complete(sector, year, sex, name, fill = list(n = 0, prop = 0))
#> # A tibble: 3,204,000 × 6
#> sector year sex name n prop
#> <chr> <dbl> <chr> <chr> <int> <dbl>
#> 1 Christian 1948 F 'ג'ורג' 0 0
#> 2 Christian 1948 F אאוס 0 0
#> 3 Christian 1948 F אבא 0 0
#> 4 Christian 1948 F אבגיל 0 0
#> 5 Christian 1948 F אבטיסאם 0 0
#> 6 Christian 1948 F אבי 0 0
#> 7 Christian 1948 F אבי-אברהם 0 0
#> 8 Christian 1948 F אביאור 0 0
#> 9 Christian 1948 F אביאל 0 0
#> 10 Christian 1948 F אביב 0 0
#> # ℹ 3,203,990 more rows
Plot trajectory of names:
library(ggplot2)
babynamesIL %>%
tidyr::complete(sector, year, sex, name, fill = list(n = 0, prop = 0)) %>%
filter(name == "מעיין", sector == "Jewish") %>%
ggplot(aes(x = year, y = prop, color = sex)) +
geom_line() +
ggsci::scale_color_aaas() +
scale_y_continuous(labels = scales::percent) +
ggtitle("מעיין הבן מול מעיין הבת") +
theme_classic()
Data was downloaded from: here.
2022 data was downloaded from: here
A web app using the data is available here