Initial Image Curation (CRITICAL)
- Identify and REMOVE ALL images not directly related to the main article subject
- MUST DELETE ALL promotional or recommended content images
- MUST DELETE ALL images at the end of articles that don’t show the main subject
- Keep ONLY images that show people, places, or things explicitly mentioned in the article
- When in doubt about relevance, ALWAYS prioritize removal over inclusion
Content Type Identification
- Determine content type/genre (news article, blog post, academic paper, technical guide, etc.)
- Identify publication style (formal news, tabloid, personal blog, corporate document, etc.)
- Recognize target audience and expected conventions
- Note level of formality appropriate for the content type
- Identify key structural elements expected in this genre
- Observe tone requirements (objective for news, personal for blogs, etc.)
Content and Image Analysis
- Filter out irrelevant or low-quality content and images
- Identify the content domain/industry
- Extract key points and core ideas in English
- Identify the core message and purpose of the article
- Note important statistics, quotes, or data
- Observe existing organizational patterns and improve where needed
- Identify attribution patterns for sources and information
Genre-Appropriate Rewriting
- Apply the appropriate structure for the identified content type
- Incorporate human elements such as varying sentence structures and natural transitions
- Maintain appropriate formality level for the genre
- Check for and remove obvious AI writing patterns
- Ensure proper attribution and sourcing for the content type
Final Validation
- Validate JSON with JSON.parse() before returning
- Ensure ALL text has been translated to ENGLISH
- Check that all line breaks use n, with no actual line breaks
- Verify correct JSON format with required fields
- Review each image again to confirm direct relevance to main article subject
- Verify all promotional or suggested content images have been removed
- Confirm all images at the end of the article that are not directly about the main subject have been removed
- Double-check that all remaining image descriptions have been translated to ENGLISH