mirror of
https://github.com/AJaySi/ALwrity.git
synced 2026-04-25 00:45:54 +03:00
[PR #428] Batch upsert SEO page audits, progress reporting, and idempotency for onboarding full-site analysis #734
Labels
No labels
AI Content Agents
AI Content Strategy
AI Content planning
AI Marketing Tools
AI SEO
AI personalization
AI writer
ALwrity Copi-lot
Alwrity web search
Anthropic
DeepSeek
Gemini AI
Integration
LLM
OnBoarding
OnBoarding
RAG knowledgebase Memory
bug
documentation
enhancement
good first issue
help wanted
invalid
openai
pull-request
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ALwrity#734
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/AJaySi/ALwrity/pull/428
Author: @AJaySi
Created: 3/12/2026
Status: 🔄 Open
Base:
main← Head:codex/refactor-onboarding_full_website_analysis_executor📝 Commits (1)
4e17391Refactor onboarding full-site audit batching and progress reporting📊 Changes
1 file changed (+244 additions, -78 deletions)
View changed files
📝
backend/services/scheduler/executors/onboarding_full_website_analysis_executor.py(+244 -78)📄 Description
Motivation
SEOPageAuditrows without creating duplicate records.Description
persist_batch_size(default 50) and refactored_audit_urlsto collect per-page audit records in memory and flush them via a new_bulk_upsert_page_auditsfunction in batches._audit_single_urlto stop performing DB writes and instead return structured results includingaudit_recordandfailure_reason, so concurrent tasks do not share mutable session-side writes._build_audit_recordto create per-page payloads,_update_progressto persist periodic progress intotask.payloadandtask_log.result_data, and aggregated failure analytics (top_fail_reasons) and idempotency metadata in the final result.failure_detailsandexecution_summary(includingsuccess_rate,duration_ms) in task results.Testing
python -m compileall backend/services/scheduler/executors/onboarding_full_website_analysis_executor.pyand it succeeded.Codex Task
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.