WikiGenDex

Gender assignment algorithm based on Wikidata + Second WGND Dictionary

Here we will explain how to use the list of Wikidata names + World Gender Name Dictionary (WGND) second dictionary to assign gender to names. Note that we only look at the probability that a certain name is assigned to a certain gender in a specific country. That is, we are not identifying the gender of names but looking at the probability of a gender-name combination.

It is also important to take into account the binary view of gender and names that this gender assignation approach has, since only Men/Women/Unknonwn options are possible. So far, algorithms have not been able to solve this issue, that hinders and hides non-binary realities.

Given that, we have divided the procedure in six different steps (with a seven and eight optional step):

Load packages, clean your data
Assign gender to Slavic names* using Wikidata names (WDNAMESslav)
Assign gender to non-Slavic names* using Wikidata names (WDNAMESrest)
Join Slavic and non-Slavic names, clean data
With those names that remained unknown, use a second list of Wikidata names (WDNAMES2)**
With those names that remain unknown, use the WGND second dictionary based on language-gender-name information.
Create graph to show results (Optional)
Create world map to show global distribution of results (Optional)

*The reason for this division lays in the fact that Slavic names may contain gender information on last names, whereas non-Slavic names do not.

**Files needed for this step, wgnd_2_0_code_langcode and wgnd_2_0_name_gender_langcode can be found in the original World Gender Name Dictionary repository.

Warning: Depending on your data (given the number of columns, etc.), you may need to do small modifications to the code. Some notions of the workings of R are recommended.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
README.md		README.md
WDNAMES2.csv		WDNAMES2.csv
WDNAMESrest.csv		WDNAMESrest.csv
WDNAMESslav.csv		WDNAMESslav.csv
algoritmo.Rmd		algoritmo.Rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WikiGenDex

About

Uh oh!

Releases

Packages

egonzalezsalmon/WikidataGender

Folders and files

Latest commit

History

Repository files navigation

WikiGenDex

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages