All developers

Guhhhhaa

Software Engineer

// cayman islands, ky · UTC-5

Building in the open from Cayman Islands. Software Engineer.

View GitHub profileVisit website

profile sourced from GitHub

Scouting report

Built a 452★ Chinese sci-fi NLP corpus from scratch

assessed from open-source footprint

Principal
74signal

Guhhhhaa's standout is 4675-scifi (452★), a Chinese-language NLP corpus of ~4,675 science-fiction novels, backed by a companion corpus wula-scifi (129★) — nearly 700 stars of genuinely useful ML dataset work. It's solo, niche, and dataset-heavy rather than production engineering, with only 3 commits last year and 86% of repos abandoned. A strong fit for data/NLP teams that value curated corpora; less so for hands-on app delivery.

Authorship & open source

Solo authorwrote 100% of commits on 4675-scifi
0 merged PRs3 commits / yr

What they build

Data / ML53%
Frontend31%
Backend13%

Industry experience

  • Data, ML & AI
  • Education & EdTech
  • Fintech & Payments

Signal breakdown

Originality77
Impact100
Consistency66
Polish29
Stars

695

top repo 452

Original repos

36

45% forks

Followers

42

On GitHub

9.5 yr

Live demos

2

Activity

Active

86% stale

Strengths

  • Verified author — wrote 100% of commits on 4675-scifi
  • 695 stars earned across projects
  • A standout project with 452 stars
  • Ships to production — 2 live demos
  • Data / ML focus with Frontend
  • Domain experience in Data, ML & AI & Education & EdTech
  • Core stack: HTML, CSS, Jupyter, Python