GPTWiki automatically turns random internet content into clean, structured Wikipedia-style articles β and uploads them to your MediaWiki site.
It uses OpenAI-compatible LLMs to:
- π― Extract real academic-style topics from raw text (e.g., Random Reddit posts)
- π Write full articles in MediaWiki format (6+ sections, encyclopedic tone)
- π Upload everything to your wiki via API β 24/7, no human needed
You control the prompts and source. GPTWiki handles the writing, structuring, and publishing.
- π Ingest any data β Reddit, Stack Overflow, RSS, crawled posts, etc.
- π§ Auto-generate specific academic titles via LLM prompt
- π Write full encyclopedic articles in MediaWiki markup
- π Auto-generate categories
- π Directly uploads to MediaWiki via API
- π Overwrite mode or skip existing pages
- π§΅ Multithreaded background loop
- π§© 100% customizable: Bring your own data, prompt, and endpoint
-
Collect Data
Use any text source. The built-in example uses Reddit API to fetch recent CS-related discussions. -
Generate Title
LLM turns the noisy input (e.g., a Reddit post) into a real academic-style topic -
Generate Article
With your prompt template, GPTWiki creates a long MediaWiki-formatted article with 6+ sections. -
Auto Categorize
GPT generates relevant categories (e.g., Artificial Intelligence, Cognitive Science) for indexing. -
Upload
Articles are uploaded to your MediaWiki instance via API. You can choose to:- π« Skip if the page exists
- π Force overwrite for updates
Although our example uses Reddit API, you can plug in:
- π₯ Reddit / Stack Overflow
- π Academic abstracts
- π RSS feeds, blogs
- π§ Internal datasets
- π οΈ Even a simple .txt file
Just make sure each line is a string-like topic or article seed.
π EdwardWiki β 100% generated with GPTWiki
Fetches Reddit posts, converts to advanced research topics, writes full-length Wikipedia-style articles, and auto-publishes to MediaWiki.
See example.py
β full working script that:
- Uses Reddit API for topic input
- Generates titles via prompt
- Produces long-form MediaWiki content
- Classifies and uploads to your MediaWiki
- Runs in multithreaded infinite loop
GPTWiki is inspired by LSJBOT, the bot that created millions of Wikipedia articles using static datasets.
Unlike LSJBOT, GPTWiki demonstrates that modern LLMs + live APIs (e.g. Reddit) can generate structured, high-quality encyclopedia articles β with zero human editing.
It proves that full-scale, autonomous wiki generation is now viable using public data and AI.