Open
Description
- database is obsolete: i examined the url list, there is more than 1700~ site entries but only around 500 of them is active (got screenshots)
- false positives: some websites redirecting to main page when there is no username, maigret assuming it's a profile page
- consider puppeteer: most websites using client side rendering now, showing empty content to command line scrapers.
- consider 2b llm agents: it's almost impossible to maintain repository to align with 500~ websites, create a pipeline to a low latency agent to verify profile exists or not