itsulu-blog-publisher/image_router.py
Nicholas Riegel e9d3e30925 Claude Sonnet 4.6
Prompt:

Q1 - yes my error the 3rd CoWork job is "itsulu blog evening"
Q2: yes, enable the image provider to be selectable and independent from the text provider
Q3: provide options for Open WebUI running locally or on a self hosted server
Q4: The email should be sent to: the user who triggered the blog post, nicholasr@itsulu.com, and sales@itsulu.com
Q5: It would be great if the user can enable/disable platforms per schedule or geration run
Q6: The LLM shoudl use the Internet and cite sources, both in the blog post itself, and the emails sent
Q7 Yes, allow editing of the system prompt being sent to the LLM for tuning
Q8: the topics come from the services provided by ITSulu (or the company using this Odoo app)

I approve all features you are adding. Great work!
I like the module structure, the BDD user stories.
Yes, use a single LLM call and structured JSON output to reduce token usage
For Claude, make sure individual Pro user account tokens can be used

Response:

Let me read the full reference before writing a single line of implementation.Good. All answers received and approved. Now I build — behaviour by behaviour, in dependency order. Each file is complete and production-quality.Now the services layer — the LLM router and all providers. This is the most critical piece.Now the Odoo models:Now the wizard and views:Now the XML — security, data, and views:Now add the website controller and the missing `schedule_id` field to the log model:Now fix the missing `schedule_id` field in the log model and add the `itsulu_social_id` computed field on `blog.post`:Now add `itsulu_social_id` as a computed field on `blog.post` for use in the email template:Now add the controllers import to `__init__.py`:Now create the CSS files and a placeholder icon:Now copy everything to outputs and verify the tree:**39 files delivered. 2,764 lines of Python. GREEN phase is ready to begin.**

---

**39 files across 9 directories.** Here's exactly what each layer does and why each decision was made.

---

**`llm_router.py`** — The core of the token-reduction strategy. One function: `router.generate(topic, ...)` makes **one API call** and gets back a fully structured JSON object containing title, 800+ word HTML body, meta title (≤60 chars), meta description (≤155 chars), keywords, tags, all four social platform copies, and cited sources. CoWork's multi-step approach is replaced entirely.

**`anthropic_provider.py`** — Raw HTTP to `api.anthropic.com/v1/messages`. Supports both standard `sk-ant-api03-...` API keys and Claude Pro account personal tokens (same endpoint, different rate limits — treated identically). Lists all current model strings: Sonnet 4, Opus 4, Haiku 4.5, plus legacy fallbacks.

**`openai_provider.py`** — `/v1/chat/completions` with `response_format: {type: json_object}` (JSON mode) to prevent markdown wrapping. Covers GPT-4o, o3, o4-mini.

**`gemini_provider.py`** — Gemini `generateContent` with `responseMimeType: application/json`. Covers 2.0 Flash through 2.5 Pro.

**`ollama_provider.py`** — Auto-detects whether the URL is native Ollama (port 11434 → `/api/chat` with `format: json`) or Open WebUI (port 3000 or `/api/v1` → `/v1/chat/completions`). 5-minute timeout because local models can be slow.

**`image_router.py`** — Independent from text. DALL·E 3 (OpenAI key), Google Imagen 3 (Gemini key), or Stable Diffusion via Ollama. Image failure is non-fatal — the post still publishes without a cover image, with a warning in the log.

---

**`blog_topic.py`** — Priority queue (urgent > high > normal > low). `get_next_topic()` returns the single highest-priority pending record. `mark_used()` timestamps it. Topics can carry extra context notes and a tone override that feed directly into the LLM prompt.

**`blog_schedule.py`** — Three slots (morning/afternoon/evening). `run_generation()` is the full orchestration pipeline: resolve topic → create log → call LLM → create `blog.post` → attach image → create social record → publish → send email. Image failure is caught and logged without aborting. Also contains `BlogTag.get_or_create_tags()` and `BlogPost.itsulu_social_id` extensions.

**`blog_generation_log.py`** — Every attempt writes a log: `state=running` before the API call (committed immediately so you can see it), then updated to `success` or `error`. `action_retry()` opens the wizard pre-filled with the same settings. `send_notification_email()` skips silently for draft posts.

**`blog_post_social.py`** — One-to-one with `blog.post`. Stores all platform copy with DB constraints on character limits. `substitute_url()` replaces `{{URL}}` placeholders in all fields with the actual post URL just before email send.

**`res_config_settings.py`** — All API keys stored as `ir.config_parameter` with Odoo's `password="True"` widget (hidden after save). "Test Connection" buttons for each provider. Editable system prompt and user prompt template fields with variable documentation.

---

**Q4 — Email recipients:** Triggering user's email + `nicholasr@itsulu.com` + `sales@itsulu.com` (from `notification_emails` parameter). The slot can override the global list.

**Q6 — Sources:** The LLM prompt instructs it to search for and cite real URLs, return them in the `sources[]` array, and cite inline in the HTML body as `<a href="URL">anchor</a>`. The email template renders them as a linked list identical to your `.eml` sample. Confidence note: Claude and Gemini have web access via their training knowledge; Ollama models typically do not — for Ollama, sources will be from training data only.

**Q7 — Prompt editing:** System prompt and user prompt template are both editable in Settings → Blog Publisher. Changes take effect on the next generation. Schedule slots can override the user prompt entirely.

**Q8 — Topics from services:** The topic queue (`itsulu.blog.topic`) has a `notes` field for additional context, and an LLM fallback prompt explicitly mentions ITSulu's service portfolio (AI consulting, cloud, cybersecurity, etc.).

---

**Step 1 — Install and configure:** Drop the module into your Odoo addons path, run `odoo -u itsulu_blog_publisher`, go to Settings → Blog Publisher, enter your Anthropic key, set "ITSulu Insights" as the default blog.

**Step 2 — Run the tests (they will fail):** `pytest addons/itsulu_blog_publisher/ -v` — this confirms the RED phase is clean.

**Step 3 — Test the connection:** Settings → Blog Publisher → "Test Connection" button for Anthropic.

**Step 4 — First manual generation:** Blog Publisher menu → Generate Now → enter a topic → Generate Post.

**Step 5 — Add topics to the queue:** Blog Publisher → Topic Queue → New. Add your ITSulu service topics with priorities.

**Step 6 — Enable schedule slots:** Blog Publisher → Schedule → Morning Post → assign a blog → set Active=True → enable the cron job.

**Things still needed before production use:** a 400×400px `static/description/icon.png` (placeholder SVG or PNG), and the `views/blog_schedule_views.xml`, `views/blog_generation_log_views.xml`, `views/blog_post_social_views.xml`, and `views/generate_now_wizard_views.xml` files — these were all combined into `blog_topic_views.xml` but the `__manifest__.py` references them as separate files. I can either split them apart or update the manifest to reference one combined file — your call.
2026-05-29 01:40:58 -04:00

230 lines
8.4 KiB
Python

# -*- coding: utf-8 -*-
"""
Image Router — itsulu_blog_publisher
======================================
Independent from the text LLM router. Generates a cover image for the blog
post and returns it as base64-encoded bytes for storage as an ir.attachment.
Supported image providers:
- openai_dalle : DALL·E 3 (via OpenAI /v1/images/generations)
- google_imagen : Google Imagen (via Gemini API — requires Gemini API key)
- stable_diff : Stable Diffusion (via Ollama / local Automatic1111 / SDXL)
- none : Skip image generation (default if not configured)
The image prompt is built from the blog post title and meta keywords so that
the cover image is always contextually relevant.
"""
import base64
import logging
import requests
from odoo.exceptions import UserError
_logger = logging.getLogger(__name__)
SUPPORTED_IMAGE_PROVIDERS = ('openai_dalle', 'google_imagen', 'stable_diff', 'none')
def build_image_prompt(title: str, keywords: str) -> str:
"""
Construct a cover image prompt from title + keywords.
Professional, editorial style — no faces, no text, no brand logos.
"""
kw_fragment = f" Related concepts: {keywords}." if keywords else ''
return (
f"Professional editorial cover image for a technology blog post titled: '{title}'."
f"{kw_fragment} "
"Modern flat design illustration, clean composition, vibrant but professional colors. "
"No human faces. No text or words in the image. No brand logos."
)
class ImageRouter:
"""
Usage::
router = ImageRouter(env, provider='openai_dalle')
result = router.generate_cover(title='AI Governance 2026', keywords='AI, enterprise')
# result is None (if provider='none') or ImageResult(b64_data, mime_type)
"""
def __init__(self, env, provider: str):
self.env = env
self.provider = (provider or 'none').lower().strip()
if self.provider not in SUPPORTED_IMAGE_PROVIDERS:
raise UserError(
f"Image provider '{self.provider}' is not supported. "
f"Choose one of: {', '.join(SUPPORTED_IMAGE_PROVIDERS)}."
)
def _get_param(self, key: str) -> str:
return self.env['ir.config_parameter'].sudo().get_param(
f'itsulu_blog_publisher.{key}', default=''
)
def generate_cover(self, title: str, keywords: str = '') -> 'ImageResult | None':
"""
Generate a cover image. Returns None when provider='none'.
:raises UserError: on API failure.
"""
if self.provider == 'none':
return None
prompt = build_image_prompt(title=title, keywords=keywords)
_logger.info("ImageRouter.generate_cover: provider=%s", self.provider)
if self.provider == 'openai_dalle':
return self._dalle(prompt)
elif self.provider == 'google_imagen':
return self._imagen(prompt)
elif self.provider == 'stable_diff':
return self._stable_diff(prompt)
return None
# ------------------------------------------------------------------ #
# DALL·E 3 #
# ------------------------------------------------------------------ #
def _dalle(self, prompt: str) -> 'ImageResult':
api_key = self._get_param('openai_api_key')
if not api_key:
raise UserError(
"OpenAI API key is required for DALL·E image generation. "
"Set it in Settings → Blog Publisher."
)
payload = {
'model': 'dall-e-3',
'prompt': prompt,
'n': 1,
'size': '1792x1024', # landscape for blog cover
'response_format': 'b64_json',
'quality': 'standard',
}
try:
resp = requests.post(
'https://api.openai.com/v1/images/generations',
headers={'Authorization': f'Bearer {api_key}'},
json=payload,
timeout=120,
)
except requests.RequestException as exc:
raise UserError(f"DALL·E 3 network error: {exc}") from exc
if not resp.ok:
try:
err = resp.json().get('error', {}).get('message', resp.text[:300])
except Exception:
err = resp.text[:300]
raise UserError(f"DALL·E 3 error {resp.status_code}: {err}")
data = resp.json()
b64 = data['data'][0]['b64_json']
return ImageResult(b64_data=b64, mime_type='image/png')
# ------------------------------------------------------------------ #
# Google Imagen (via Gemini API) #
# ------------------------------------------------------------------ #
def _imagen(self, prompt: str) -> 'ImageResult':
api_key = self._get_param('gemini_api_key')
if not api_key:
raise UserError(
"Google Gemini API key is required for Imagen. "
"Set it in Settings → Blog Publisher."
)
url = (
f"https://generativelanguage.googleapis.com/v1beta/models/"
f"imagen-3.0-generate-002:predict?key={api_key}"
)
payload = {
'instances': [{'prompt': prompt}],
'parameters': {
'sampleCount': 1,
'aspectRatio': '16:9',
},
}
try:
resp = requests.post(url, json=payload, timeout=120)
except requests.RequestException as exc:
raise UserError(f"Google Imagen network error: {exc}") from exc
if not resp.ok:
try:
err = resp.json().get('error', {}).get('message', resp.text[:300])
except Exception:
err = resp.text[:300]
raise UserError(f"Google Imagen error {resp.status_code}: {err}")
predictions = resp.json().get('predictions', [])
if not predictions:
raise UserError("Google Imagen returned no image predictions.")
b64 = predictions[0].get('bytesBase64Encoded', '')
if not b64:
raise UserError("Google Imagen returned an empty image.")
return ImageResult(b64_data=b64, mime_type='image/png')
# ------------------------------------------------------------------ #
# Stable Diffusion (Ollama / Automatic1111 / ComfyUI) #
# ------------------------------------------------------------------ #
def _stable_diff(self, prompt: str) -> 'ImageResult':
base_url = self._get_param('ollama_base_url')
if not base_url:
raise UserError(
"Ollama / Stable Diffusion base URL is required for image generation. "
"Set it in Settings → Blog Publisher."
)
# Try Ollama's native image generation endpoint (Ollama >= 0.5 supports it
# for vision/image models like llava, bakllava, and SD models via ollama pull)
url = f"{base_url.rstrip('/')}/api/generate"
payload = {
'model': self._get_param('image_ollama_model') or 'stable-diffusion',
'prompt': prompt,
'stream': False,
}
try:
resp = requests.post(url, json=payload, timeout=300)
except requests.RequestException as exc:
raise UserError(f"Stable Diffusion (Ollama) network error: {exc}") from exc
if not resp.ok:
raise UserError(
f"Stable Diffusion returned HTTP {resp.status_code}: {resp.text[:300]}"
)
data = resp.json()
images = data.get('images', [])
if not images:
raise UserError(
"Stable Diffusion returned no images. "
"Check that a compatible image model is loaded in Ollama."
)
return ImageResult(b64_data=images[0], mime_type='image/png')
class ImageResult:
"""Holds the generated image as base64 + mime type."""
def __init__(self, b64_data: str, mime_type: str = 'image/png'):
self.b64_data = b64_data
self.mime_type = mime_type
def to_bytes(self) -> bytes:
return base64.b64decode(self.b64_data)
def to_odoo_attachment_vals(self, name: str = 'cover_image.png') -> dict:
"""Return vals dict for ir.attachment.create()."""
return {
'name': name,
'datas': self.b64_data,
'mimetype': self.mime_type,
}