Automatically generate weekly reports from your STT transcription database using GitHub Actions
Project description • Who this project is for • Project dependencies • Instructions for use • Contributing guidelines • Additional documentation • How to get help • Terms of use
With STT Report Generator, you can automatically generate weekly reports from your speech-to-text transcription database. This tool connects to your database, extracts key metrics about your transcription tasks, and generates both CSV data files and markdown summary reports. The entire process is automated using GitHub Actions, which can be scheduled to run on a regular basis.
This project is intended for team members and managers who need regular reports on speech-to-text transcription progress without manual data extraction and analysis.
Before using STT Report Generator, ensure you have:
- A GitHub account with access to create repository secrets
- A database containing STT transcription tasks data
- Database credentials with read access to the transcription data
Get started with STT Report Generator by setting up the GitHub Actions workflow and database connection.
-
Fork or clone this repository
-
Set up your database connection secrets
a. Go to your repository's Settings > Secrets > Actions
b. Create the following secrets for your PostgreSQL database:
HOST
- Your database host (e.g., dpg-xxxxxx.oregon-postgres.render.com)DBNAME
- Your database nameDBUSER
- Your database usernamePASSWORD
- Your database password
-
Customize the database query (if needed)
Open
generate_report.py
and modify thequery_transcription_data()
function to match your database schema.
-
Adjust the schedule in
.github/workflows/report-generator.yml
The default setting runs every Monday at 9:00 AM UTC:
schedule: - cron: '0 9 * * 1' # Every Monday at 9AM UTC
You can modify this cron expression to change the schedule.
-
Manual trigger:
a. Go to the Actions tab in your repository
b. Select the "Weekly STT Report Generation" workflow
c. Click "Run workflow"
-
Automatic schedule:
Reports will automatically generate according to your configured schedule
Issue | Solution |
Workflow fails with database connection error | Check that your DATABASE_URL secret is correctly formatted and that the database is accessible from GitHub Actions |
Query fails with column not found error | Modify the query in generate_report.py to match your database schema |
Reports not appearing in repository | Check the Actions tab for workflow run logs and ensure GitHub Actions has write permissions to your repository |
If you'd like to help out, check out our contributing guidelines.
For more information:
- File an issue.
- Email us at openpecha[at]gmail.com.
- Join our discord.
Project Name is licensed under the MIT License.