Original post: https://bsky.app/profile/rebeccamwrites.bsky.social/post/3m6a62btty226
Here’s the dataset and their website if you are interested, there’s more games beyond Final Fantasy that they’ve looked at.
Main findings
- 35% of words were spoken by female characters.
- 29% of characters were female, which suggests the imbalance is driven by a lack of female characters.
- 94% of games had more male dialogue than female dialogue.


It looks like the scraper python script pulls the game script from here: https://www.ffcompendium.com/h/faqs/ffx2scriptaschthehated.txt
I’m not familiar with X-2 or their method for parsing the game script, but the project doesn’t look too hard to set up and you can submit issues on their GitHub so if you want to take a look I would say go for it. If they made a mistake I’m sure they would like to know
Looks like all Al Bhed dialogues might be double counted. The script there contains: