Skip to content

cysec-lab/typosquatting_domain_email

Repository files navigation

Typo-Squatting Domains in Emails

Introduction

We propose a model for generating typo-squatting domains that people are prone to mistakenly send emails to by collecting real-world domain typos. To construct this model, we created an email address dataset based on actual domains and conducted an experiment in which 300 participants were asked to input 40 email addresses each. This allowed us to gather domain typos in email addresses.
This repository provides access to the typo domains collected from our study.

Target Domains in the Study

The typo domains collected in our study meet the following criteria:

  • Domains conforming to RFC5321 and RFC5322
  • Domains using top-level domains (TLDs) approved by ICANN
  • Domains with a Damerau-Levenshtein distance of 1 to 5 from the correct domain

The table below provides examples of typo domains:

Correct Domain Typo Domain Description
example.com exampl.com Deletion
example.com eaxmple.com Transposition
example.com example.co TLD Change
example.com example.co.jp Damerau-Levenshtein distance = 3
example.co.jp exampleco.jp Deletion of (.)

The table below presents examples of typo domains not collected in this study:

Correct Domain Excluded Typo Domain Description
example.com example..com Does not conform to RFC
example.com example.coom TLD not approved by ICANN
example.com exampletyposquatting.com Damerau-Levenshtein distance ≥ 6

Dataset

We provide the following datasets:

Citation

If you use our dataset, please cite our paper.

The citation should look like this in a paper written in English (or any non-Japanese language):

Soma Sugahara, Rannosuke Hoshina, Tetsutaro Uehara: "Proposal of a Typosquatting Domain Generation Model based on the analysis of Typographical Error Tendencies", IPSJ SIG Technical Reports, Vol. 2025-IOT-68, No.59, p. 1-8, 2025

The citation should look like this in a paper written in Japanese:

菅原颯真, 星名藍乃介, 上原哲太郎: "タイプミス傾向の分析に基づくタイポスクワッティングドメイン生成モデルの提案", 研究報告インターネットと運用技術(IOT), Vol. 2025-IOT-68, No.59, p. 1-8, 2025

Contact

Tetsutaro Uehara (College of Information Science and Engineering, Ritsumeikan University)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published