tracksolid_timescale_grafana_prod

kianiadee/tracksolid_timescale_grafana_prod

Author	SHA1	Message	Date
David Kiania	108c1be057	feat: nightly pg_dump sidecar uploads to rustfs fleet-db bucket Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Adds a `db_backup` sidecar that dumps tracksolid_db every night at 02:30 UTC (configurable via BACKUP_HOUR/BACKUP_MINUTE), gzips the output, and uploads to s3://fleet-db/daily/<dbname>_<ts>.sql.gz on the rustfs S3-compatible instance (s3.rahamafresh.com). Prunes objects older than BACKUP_KEEP_DAYS (default 30). Required .env additions (Coolify UI): RUSTFS_ENDPOINT=https://s3.rahamafresh.com RUSTFS_ACCESS_KEY=... RUSTFS_SECRET_KEY=... RUSTFS_BUCKET=fleet-db Mitigates data loss when Coolify service recreation wipes the service-ID-scoped timescale-data volume. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 12:53:23 +03:00
David Kiania	257643cae2	fix: auto-register devices on push + allow CSV import to insert new rows Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Three changes that together close the FK-violation loop on /pushalarm: 1. import_drivers_csv.py: when an IMEI is in the CSV but not in tracksolid.devices, INSERT a new row instead of skipping. Unblocks the 140 X3/JC400P devices listed as a HIGH open item in CLAUDE.md §10. 2. webhook_receiver_rev.py: new _ensure_device() helper upserts a stub devices row (status='unknown') before inserting an alarm. Handles the third class of devices — not in API sync, not in CSV (e.g. the X3-63282 Kampala device flagged in CLAUDE.md §10). 3. CSV refreshed from Downloads (Apr 21 version, 140 active rows). Also fixes alarm error log previously showing "None" (read deviceImei instead of the integration push's imei field). CSV import already applied live on the instance (63 → 201 devices). Webhook patch requires a Coolify redeploy to pick up _ensure_device(). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 12:29:32 +03:00
David Kiania	636dd2b8b0	fix: parse actual Jimi push format (msgType+data, field name remap) Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Diagnostic logging revealed the real Jimi integration push format: Content-Type: application/x-www-form-urlencoded Body: msgType=jimi.push.device.alarm&data=<URL-encoded JSON> Differences from docs: - data is one JSON object per POST (not a data_list array) - alarm uses imei+alarmTime, NOT deviceImei+gateTime _parse_request now reads form field `data` (falls back to `data_list`) and JSON-decodes a single object or array. push_alarm handler accepts either field naming for forward-compat. Removes diagnostic INFO log now that format is confirmed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 12:10:08 +03:00
David Kiania	c54794eb4c	diag: log raw push body + content-type at INFO level Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Temporary diagnostic to see what format Jimi actually sends on /pushalarm. New container is parsing to empty items (pushes arrive but no DB insert), so we need to see the real body shape. Remove once format is confirmed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 12:04:55 +03:00
David Kiania	ef36ebebea	fix: handle JSON body push format from Jimi integration API Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Jimi's integration push API (tracksolidprodocs.jimicloud.com) sends Content-Type: application/json with body {"token":"...","data_list":[...]}, not form-encoded. FastAPI Form() silently defaulted to "" so all pushes were discarded with "Failed to parse data_list:" warnings. Replaces per-endpoint Form() params with a shared _parse_request() helper that tries JSON body first, falls back to form-encoded. All seven push endpoints (pushobd, pushfaultinfo, pushalarm, pushgps, pushhb, pushtripreport, pushevent) updated. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-21 11:44:08 +03:00
David Kiania	85d02c81a5	feat: Daily Operations dashboard + tracksolid analytics views Some checks failed Static Analysis / static (push) Has been cancelled Details Tests / test (push) Has been cancelled Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Add a second Grafana dashboard focused on daily operational KPIs and live dispatch, keeping the NOC Live dashboard untouched. - grafana/provisioning/dashboards-json/daily_operations_dashboard.json New dashboard covering §7 Blueprint Panels 3-8 and the §4 dispatch lens: freshness banner, today-at-a-glance stat row, active vehicles map, currently-idle table, vehicles-not-moved-today, per-vehicle daily KPI roll-up, driver behaviour leaderboard, distance trend, alarm frequency, idle cost MTD, utilisation heatmap, SLA row (collapsed, data-gated). - 07_analytics_views.sql Nine views in tracksolid.* wrapping the BA-file [DASHBOARD]-tagged queries. Each view carries COMMENT ON VIEW with its spec section. SELECT granted to grafana_ro. Smoke-tested against live DB. - run_migrations.py Register 06 and 07 in MIGRATIONS list with idempotent seed checks so future fresh deploys apply them correctly. - CLAUDE.md Retire the tracksolid_2 schema references (schema no longer exists); §9 Fleet State dated 2026-04-19 with correct pipeline status (running, 875 runs/24h, 0 failures) and accurate position_history row counts (hypertable stats don't show in pg_stat_user_tables). - docs/superpowers/specs/2026-04-19-daily-operations-dashboard-design.md Design spec covering architecture, views, panel layout, deployment, rollback, and known data gaps.	2026-04-19 13:44:18 +03:00
David Kiania	4371a0d6e6	docs: CLAUDE.md audit — add commands section, fix stale DB access note Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Add §0 Commands (uv, pytest, ruff, mypy, docker exec query pattern). Fix §3 DB access: DATABASE_URL is internal-only since fix `152fce8`. Add docs/superpowers/ to codebase map. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-19 00:27:59 +03:00
David Kiania	18e7e668c0	docs: fleet intelligence pitch deck copy and one-pager (25 slides + A4 leave-behind) Full slide-by-slide copy for elicitation pitch: 6 pain questions, feature reveal, business case, optional add-ons (RustFS + DuckDB), and one-pager. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-18 17:05:07 +03:00
David Kiania	c6e4a227c8	docs: add blob storage and data warehouse as optional pitch products RustFS (S3-compatible blob) and DuckDB (historical analytics) added as optional add-on tiers with elicitation pain questions and tier model. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-18 17:00:39 +03:00
David Kiania	9f4406d863	docs: fleet intelligence partner pitch design (elicitation method) 6-question pain-first pitch structure targeting regional GPS resellers, with deck architecture, one-pager layout, and partnership model options. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-18 16:41:04 +03:00
David Kiania	152fce81a8	fix: point DATABASE_URL at timescale_db container (not legacy 31.97.44.246:5888) Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Ingest scripts were connecting to the old tracksolid_2 database instead of the timescale_db container in this stack. Grafana was already correct (uses service name timescale_db:5432). Also strip leading space and quotes from DATABASE_URL and API_BASE_URL so os.getenv() returns clean values. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 15:43:49 +03:00
David Kiania	07ef491695	fix: change DB host port 5888→5433 (5888 already allocated by legacy DB) Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 14:19:20 +03:00
David Kiania	160f477318	infra: expose timescale_db port 5888 for direct pgcli access Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details Maps host port 5888 → container port 5432 so the DB can be reached directly from the MacBook (requires UFW allow 5888/tcp on the server). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 14:14:32 +03:00
David Kiania	244112154a	docs: update CLAUDE.md with session learnings (18 Apr 2026) Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details - §3: note tracksolid_2 as live schema, tracksolid as empty target; add DB direct access tip (31.97.44.246:5888, leading space in .env) - §4: add import_drivers_csv.py and migration 06 to codebase map - §5: document tracksolid_2 live tables with column differences (assigned_team vs cost_centre, city vs assigned_city); add ops.* - §8: add rule 9 (Forgejo API auth via keychain) and rule 10 (always check active schema before querying) - §9: update fleet state — pipeline stopped Apr 6, CSV fleet pending, 0 driver names, 19 stale positions - §10: replace driver-name manual item with deploy + CSV import tasks Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 12:26:21 +03:00
David Kiania	274473c544	docs: update analytics report with live DB state (18 Apr 2026) Some checks failed Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Static Analysis / static (pull_request) Has been cancelled Details Tests / test (pull_request) Has been cancelled Details - §1: add current deployment state table — 63 devices, 0 driver names, 5 trips, pipeline stopped 6 Apr (401 token expiry); note tracksolid_2 vs tracksolid schema split - §6: status column per question (Ready/Needs data/Blocked) reflecting actual DB state; add cost-per-ticket, city drift, odometer rows - §8: add Step 0 full deployment sequence (git pull → migrations 01-06 → container rebuild → sync_driver_audit → import_drivers_csv); Step 3 updated to reference import script; Step 5 collapsed to pointer - Footer: db-state stamp and update date Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 08:39:58 +03:00
David Kiania	cebcf74ba2	feat: business analytics expansion + driver CSV import - 01_BusinessAnalytics.md: add §0 usage tags, §2.4 cost-per-ticket, §3.6–3.8 alarm/drift/odometer, §4.4–4.5 dispatch log + SLA metrics, §9 fleet readiness scorecard, §10 service-interval forecaster, Appendix B threshold calibration guide (773 → 1437 lines) - 06_business_analytics_migration.sql: schema support for all new analytics sections — assigned_city column, dispatch_log table, ops schema, service_log, odometer_readings, tickets skeleton, vw_service_forecast view - import_drivers_csv.py: one-shot script to populate driver_name, vehicle_number, vehicle_models, cost_centre, assigned_city, sim, iccid, imsi from 20260414_FS__Logistics - final_fixed.csv (144 rows); dry-run by default, --apply to commit, --only-null for safe additive mode - 20260414_FS__Logistics - final_fixed.csv: source data committed for reproducibility and container exec workflow Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 08:30:34 +03:00
David Kiania	8867be9d3d	perf+fix: SAVEPOINT-per-item pollers, batched GPS inserts, parallel detail fetch Some checks are pending Static Analysis / static (push) Waiting to run Details Tests / test (push) Waiting to run Details Audit fixes across the ingestion stack: Observability - Move log_ingestion out of batch loops in poll_alarms and poll_parking (was emitting N cumulative log rows per run instead of one). - Add missing log_ingestion + t0 to poll_trips. - Count inserted via cur.rowcount instead of naive +=1 so ON CONFLICT DO NOTHING no longer inflates the metric. Resilience - SAVEPOINT-per-item added to poll_alarms, poll_live_positions, poll_trips, poll_parking so one bad row no longer aborts the batch (webhook handlers already had this; pollers were inconsistent). Performance - /pushgps and poll_track_list now use psycopg2.extras.execute_values with ON CONFLICT DO NOTHING — 10-50x write throughput on larger batches. - sync_devices and sync_driver_audit fetch jimi.track.device.detail concurrently via ThreadPoolExecutor(max_workers=8), cutting the daily registry sync from ~24s to ~3s for an 80-device fleet. - poll_track_list split into two phases: parallel API fetch (4 workers, no DB connection held) then one batched write. Previously the DB connection was held across every per-IMEI HTTP call, risking pool starvation. Security - _validate_token uses hmac.compare_digest for constant-time token comparison (closes timing side-channel). - _parse_data_list caps incoming items at WEBHOOK_MAX_ITEMS (default 5000) so a pathological push cannot blow memory. Tests - Fix test_null_alarm_type_skipped: its INSERT-count assertion was catching the ingestion_log insert written by log_ingestion. Filter that out so the test checks only data-table inserts. - Full suite: 66 passed. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-18 00:33:55 +03:00
David Kiania	f7cc48cc6a	chore: align .python-version to 3.12.0 (matches Docker image and pyproject.toml)	2026-04-12 21:41:43 +03:00
David Kiania	20d3ddb841	feat: add db_audit health checks, runner, and scheduled Forgejo workflow Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 21:40:29 +03:00
David Kiania	6ed4d3a1e2	test: add test suite - unit tests, webhook endpoint tests, and CI workflow 57 unit tests covering clean helpers, API signing, and field mapping fixes (FIX-E06, FIX-M16, BUG-01, BUG-03); integration tests for webhook endpoints with mocked DB; Forgejo CI workflow with TimescaleDB service container. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 21:38:20 +03:00
David Kiania	2ca3d2f021	ci: add ruff + mypy static analysis config and Forgejo workflow	2026-04-12 21:32:33 +03:00
David Kiania	75d3417a2b	docs: add quality program design spec	2026-04-12 21:31:56 +03:00
David Kiania	f9834564ab	Add CLAUDE.md and project docs for structured Claude project CLAUDE.md: cached context file covering project identity, tech stack, codebase map, schema quick-ref, API gotchas, fix history, working rules, fleet state, and open items. Structured for maximum cache efficiency — stable content first, dynamic state at the end. docs/CONNECTIONS.md: connection parameter shapes (no secrets) for SSH, DB, API, container resolution, Forgejo, Grafana, n8n. docs/PROJECT_CONTEXT.md: client business context (telco field service, 3 cities, service types), data quality gaps, KPI framework by domain, integration roadmap. docs/KPI_FRAMEWORK.md: living KPI register with status tracking, thresholds, client feedback log, and review checklist. To be co-developed with client iteratively. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 20:59:15 +03:00
David Kiania	2131faf8c6	Add 260412 baseline report — first trip data, FIX-M16 confirmed Post-deployment snapshot at ~00:15 EAT 2026-04-12. Key changes vs 260410: - 3 trips recorded (FRED KMGW 538W HULETI, 6.94 km total) — pipeline validated - FIX-M16 distance unit fix confirmed: implied speed matches API avgSpeed exactly - 70 track_list fixes in 24h (was 13) — dense trail from active driving - KDK 829A GP returned to primary depot from secondary Nairobi East cluster - Uganda anomaly (X3-63282) persists — flagged for management - Driver name root cause confirmed: not assigned in Tracksolid Pro UI Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 00:14:27 +03:00
David Kiania	6a0ceb78dd	Fix trip distance unit (metres→km) and full device sync on upsert [FIX-M16] jimi.device.track.mileage returns distance in metres despite docs claiming km. Confirmed: avgSpeed × runTimeSecond / 3600 = distance/1000. poll_trips() now divides raw value by 1000 before storing as distance_km. 3 existing bad rows corrected in prod DB (distance_km / 1000). [FIX-M17] sync_devices() ON CONFLICT clause was only updating 5 of 26 fields, silently dropping driver_phone, sim, iccid, vehicle_name, status etc. on subsequent syncs. Expanded to update all device fields so driver assignments made in Tracksolid Pro UI propagate to DB on next daily sync. Add sync_driver_audit.py: one-shot script to compare API vs DB device registry, report driver/IMEI gaps, and force a full field upsert. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 00:06:57 +03:00
David Kiania	fcc745f09d	Fix Grafana provisioning: bake datasource/dashboard config into custom image Coolify only copies docker-compose.yaml and .env to its working directory — the ./grafana/provisioning bind mount source was always empty on the server, so Grafana started with no datasource or dashboard configured (causing the 'Failed to load home dashboard' error). Fix: build a custom Grafana image (grafana/Dockerfile) that COPYs the provisioning directory at image build time. Grafana substitutes ${GRAFANA_DB_RO_PASSWORD} at startup from the env var now in Coolify's store. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 22:18:44 +03:00
David Kiania	d706d17cc8	Fix Grafana datasource: add GRAFANA_DB_RO_PASSWORD and sync grafana_ro on startup grafana_ro DB role was created with placeholder password 'SET_PASSWORD_IN_ENV' and GRAFANA_DB_RO_PASSWORD was never set in .env, so Grafana's TracksolidDB datasource could not authenticate — causing 'Failed to load home dashboard'. Fix: - Add GRAFANA_DB_RO_PASSWORD to .env with a secure generated password - Add sync_role_passwords() to run_migrations.py — runs ALTER ROLE on every startup so DB password stays in sync with the env var (idempotent) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 19:22:30 +03:00
David Kiania	87ecab4a72	Wire /pushevent to device_events table (was log-only) LOGIN/LOGOUT events from Jimi now persist to tracksolid.device_events. Table already existed with correct schema (imei, event_type, event_time, timezone, unique constraint). Follows same SAVEPOINT + log_ingestion pattern as all other DB-writing endpoints. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 18:42:22 +03:00
David Kiania	b1e4d6e85f	Fix 5 webhook bugs: SAVEPOINTs, NULL guards, BCD timestamps, /pushevent, log NULL fix BUG-01: OBD event_time — try unix_to_ts before clean_ts (Jimi sends epoch ints) BUG-02: push_alarm — guard alarm_type not null (NULL breaks ON CONFLICT dedup) BUG-03: push_trip_report — _parse_trip_ts handles Jimi BCD format YYMMDDHHmmss BUG-04: SAVEPOINT per item in all 5 DB endpoints (FK violation on one item no longer aborts the whole batch; SAVEPOINT now inside try for safety) BUG-05: Add /pushevent endpoint (log-only; was returning 404 to Jimi) FIX: push_fault_info — skip null fault_code (NULL != NULL in PG unique index) FIX: log_ingestion — pass SQL NULL not string "None" when no error occurred Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 18:19:13 +03:00
David Kiania	1f11a65b0b	Add 02_tracksolid_docker_commands.md — remote DB command reference Comprehensive reference for SSH + docker exec psql access to the TimescaleDB instance on rahamafresh.com. Covers: - How it works (SSH → docker exec → psql layers explained) - tsdb() shell function setup for the server - Mac one-liners for single queries, interactive sessions, piping SQL files - Fleet & live positions queries (active vehicles, silent devices, anomalies) - Trips & movement (today's KPIs, speeding, after-hours, utilisation rate) - Alarms (summary, unacknowledged, acknowledge) - Parking & idle time - Position history & route replay - Ingestion pipeline health checks - Device registry (metadata gaps, odometer, subscriptions) - Schema & migration operations - Container & service operations (logs, restart, disk, chunk sizes) - Quick reference table for all flags Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 08:23:01 +03:00
David Kiania	ae5bd2c960	Update tracksolidApiDocumentation.md with live implementation findings Reflects accurate field names, behaviours, and status from production: Polling endpoints: - 5.1 location.list: add full response schema (direction, gpsSignal, gpsNum, powerValue, elecQuantity, posType, locDesc); add implementation note (311 calls, ~19 devices/sweep, ~200ms, missing devices silently omitted) - 5.4 track.mileage: add maxSpeed field (BUG-03); add distance unit note (BUG-02 — values are km from API, corrected via migration 04) - 5.5 track.list: add altitude/satellite fields; add POLL-01 implementation note (30-min schedule, 35-min lookback, source='track_list', ~137s/call) - 5.7 parking: clarify acc_type=0 required; note durSecond vs stopSecond; add POLL-02 production status (60 calls, 0 rows, overnight expected) - Rate limits: document track.list latency (~137s per call) Alarms: - 6.1: replace vague note with explicit poll-vs-push field name table (alertTypeId/alarmTypeName vs alarmType/alarmName); confirm BUG-01 fix verified in production (type 3 / "Vibration alert" now stored correctly) Webhooks: - 10.1 /pushevent: mark implemented (PUSH-01), db table - 10.2 /pushhb: mark as not yet wired, table ready - 10.4 /pushalarm: mark implemented, cross-ref field name table - 10.7 /pushoil: mark implemented (PUSH-02), unit int→text note - 10.9 /pushtem: mark implemented (PUSH-03) - 10.10 /pushlbs: mark implemented (PUSH-04) - 10.20 /pushobd: mark implemented, document OBD scalar extraction - 10.21 /pushfaultinfo: mark not yet wired, table ready - 10.22 /pushtripreport: mark implemented Appendix B: full rewrite — split into polling and push tables with accurate status (✅/⚠️/not used), call counts, and DB table references Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 07:52:28 +03:00
David Kiania	d7ffa136a3	Regenerate 260410_baseline_report.md from live database (post-migration) Full live-query refresh against tracksolid_db at 07:38 EAT 2026-04-11. All data sourced directly from the server via 10 targeted psql queries. Report covers: all 17 table row counts, full 63-device registry with odometer/SIM/expiry, live position detail for all 19 reporting devices with GPS signal quality, geographic cluster map, position_history by source (poll=124 / track_list=13 = 137 total), alarm detail confirming BUG-01 fix, ingestion log health (399 calls, 0 failures), subscription status breakdown, silent device full list (44 devices), schema additions verification, Grafana readiness matrix, and P0/P1/P2 action plan. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 07:42:33 +03:00
David Kiania	f277532a9d	Regenerate 260410_baseline_report.md with post-migration comprehensive data Updated report reflects state after migrations 04 and 05 are fully applied. Includes: all 13 table row counts, fleet composition (63 devices / 4 models), live position coverage (19/63), position history breakdown by source (poll vs track_list), alarm detail (2 vibration alerts, BUG-01 fix confirmed), schema health checklist, ingestion log polling summary, odometer service flags, Uganda anomaly flag for X3-63282, data quality gap priority table, and Grafana readiness assessment. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 07:29:49 +03:00
David Kiania	5f1b32f1dc	Extend seed sentinels to cover migrations 04 and 05 Containers share one DB — when ingest_movement applies 04, ingest_events and webhook_receiver start later and find distance_m already renamed, causing a spurious FATAL before the next restart catches the recorded row. Added sentinels for all four migrations so any container self-heals on first startup regardless of which container ran first: 04 — trips.distance_km column exists 05 — tracksolid.device_events table exists Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:48:30 +03:00
David Kiania	5d47eece6b	Fix: seed pre-tracking migrations to skip already-applied 02 and 03 Migration 02 and 03 were applied before the schema_migrations tracking table existed, so they had no record and the runner tried to re-run them, hitting non-idempotent TimescaleDB policy/trigger/cagg statements. seed_pre_tracking_migrations() checks for sentinel schema objects and inserts records for any migration that was clearly already applied: - 02: tracksolid.devices table exists - 03: position_history.altitude column exists Called immediately after ensure_tracking_table() on every startup. Safe on fresh databases (objects absent → nothing seeded → runs normally). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:43:44 +03:00
David Kiania	63e555b822	Fix: create tracksolid schema before schema_migrations table On a fresh database the tracksolid schema doesn't exist yet — migration 02 creates it, but ensure_tracking_table() ran first. Added CREATE SCHEMA IF NOT EXISTS tracksolid before the table DDL. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:40:32 +03:00
David Kiania	aa290151ea	Update run_migrations.py: add 04+05, idempotency tracking, expanded verify - Add 04_bug_fix_migration.sql and 05_enhancement_migration.sql to list - Use schema_migrations table to skip already-applied migrations (prevents migration 04's RENAME from failing on re-run after first deployment) - Expand CRITICAL_TABLES to include all 5 new tables from migration 05 - record_applied() writes to schema_migrations after each success - Cleaner output: APPLY / SKIP / OK per file with summary count On next Coolify redeploy each container will skip 02-05 (already applied) and apply any new migrations added in future commits. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:34:57 +03:00
David Kiania	20a98074a6	Add idempotent migration runner script run_migrations.sh auto-discovers numbered SQL files (NN_*.sql), tracks applied migrations in tracksolid.schema_migrations table, and skips already-applied files — safe to run on every deployment. Usage: bash /app/run_migrations.sh Coolify: add to Post-deployment Command in service settings. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:31:57 +03:00
David Kiania	97b19eb968	Add 2026-04-10 baseline fleet report from live database query Captures first-night state of the tracksolid_db pipeline: - 63 devices registered, 19 with live positions, 4 active today - 3 vehicles with fresh GPS (<10 min): Westlands x2, Athi River x1 - X3-63282 located in Uganda — flagged for investigation - KDK 829A GP (239k km) and Belta KCU-647D (234k km) flagged for service review - Migration 04 and 05 not yet applied (distance_m column still present) - Parking fix and trip polling not yet active (containers not redeployed) - Prioritised action list for full operational readiness Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:20:16 +03:00
David Kiania	40e452e156	Replace hardcoded container names with dynamic lookup Coolify regenerates the container suffix on every redeploy, making hardcoded names stale. All three docs now use: TS_DB=$(docker ps --filter "name=timescale_db" --format "{{.Names}}" \| head -1) OPERATIONS_MANUAL.md: replaced bare connection string with full tsdb() shell function, one-liner pattern, and multi-container label-filter guidance. tracksolid_DB_manual.md: updated header and connection example. 01_BusinessAnalytics.md: updated Step 5 migration commands. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 23:09:01 +03:00
David Kiania	09b3860706	Add fleet business analytics document Covers fleet utilisation, driver behaviour (speeding, harsh driving, tardiness, after-hours movement), real-time dispatch queries, km per driver per day, full business question inventory, Grafana dashboard blueprint, and the 5-step roadmap to unlock remaining capabilities. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 22:57:36 +03:00
David Kiania	3797a4e2ca	Implement POLL-01 high-res GPS trails and POLL-03 on-demand location refresh POLL-01 (FIX-M14): Add poll_track_list() calling jimi.device.track.list - Runs every 30 min with 35-min lookback window (5-min overlap prevents gaps) - Inserts all device waypoints into position_history with source='track_list' - Increases position density from ~1/min to 2-6 fixes/min per active vehicle - Single shared DB connection for all devices per cycle (efficient) POLL-03 (FIX-M15): Add get_device_locations() utility function - Calls jimi.device.location.get for up to 50 specific IMEIs on demand - Used for alarm enrichment, stale device recovery, dashboard precision refresh Manual updates: - position_history section rewritten to document dual ingestion sources - Three new queries: data density check, harsh driving detection, route trace - Known Data Issues: issues 10 and 11 added and marked Fixed - API coverage table updated to reflect all three endpoints now in use Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 22:46:00 +03:00
David Kiania	d534aceadc	Add DB connection string to ops manual, add administration notes, remove stale deploy guide Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 22:34:56 +03:00
David Kiania	05993100e9	Enhance tracksolid_DB_manual.md with full analytics suite - Add sections 16–21: Daily, Weekly, Monthly, Quarterly analytics, new table docs (device_events, fuel_readings, temperature_readings, lbs_readings, geofences), and updated Known Data Issues - Fix all distance queries: remove erroneous /1000000.0 division (column is now distance_km in kilometres after migration 04) - Update alarms section to reflect BUG-01 field mapping fix - Update parking section to reflect POLL-02 acc_type/durSecond fix - Rewrite "verify distance" section as accuracy cross-check query - Expand row count query to include 5 new tables from migration 05 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 22:33:06 +03:00
David Kiania	c05b47abe2	Fix alarm field mapping, distance unit bug, parking params; add schema migrations BUG-01 [FIX-E06]: jimi.device.alarm.list poll response uses alertTypeId/ alarmTypeName/alertTime, not the webhook field names. All 1,054 stored alarm records had null alarm_type/alarm_name as a result. Corrected field mapping in ingest_events_rev.py; also added alarm_name and source columns to INSERT. BUG-02 [FIX-M11/M12]: trips.distance_m was storing millimetres due to an erroneous * 1000 on an already-km API value. Removed the multiplication in poll_trips() and push_trip_report(). Column renamed to distance_km in migration 04 (historical rows divided by 1,000,000 to correct to km). All SQL in both ingestion files updated to reference distance_km. POLL-02 [FIX-M13]: parking poll returned 0 rows because the required account and acc_type=0 parameters were missing. Also fixed response field mapping: durSecond was incorrectly read as 'seconds'. Migration 04: corrects and renames distance_m → distance_km. Migration 05: adds normalized OBD columns, alarm/device enrichment columns, new tables (device_events, fuel_readings, temperature_readings, lbs_readings, geofences), expands dwh_gold fact table, and adds refresh_daily_metrics() ETL. tracksolid_DB_manual.md updated to reflect column rename and mark fixed issues. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 22:18:30 +03:00
David Kiania	791bf2700c	Add Ubuntu instance deployment guide Step-by-step setup guide covering system updates, hostname/timezone, sudo user creation, SSH hardening, UFW firewall, CrowdSec with nftables bouncer, Zsh shell setup, and instance verification commands. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-09 16:40:05 +03:00
David Kiania	82761e1e3f	Add Grafana NOC operational manual Covers pre-deployment checklist, post-deploy verification steps for each panel, database verification queries, troubleshooting guide, and day-to-day NOC operations reference. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-09 00:12:48 +03:00
David Kiania	cd6b2ca81a	Add Grafana NOC fleet dashboard with provisioning Adds a fully-provisioned Grafana dashboard for NOC operators to monitor 80 vehicles in real-time: live geomap with direction arrows, speed, driver info, and color-coded plates. Includes datasource and dashboard provider YAMLs, dashboard JSON (schemaVersion 39 / Grafana 11.0.0), and docker-compose updates to mount provisioning at container start. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-09 00:01:52 +03:00
David Kiania	2f3879aa2a	Add n8n workflow templates and change webhook port to 8888 Port 8000 was already in use on the host. Updated uvicorn to listen on 8888. Added 6 importable n8n workflow JSON files for Jimi push data forwarding (OBD, faults, alarms, GPS, heartbeats, trips). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 18:54:42 +03:00
David Kiania	004fed7ab9	Add operations manual with verification queries per service Comprehensive guide covering: - Service architecture and scheduled tasks - Per-service verification SQL queries grouped by service - Health dashboard queries for monitoring - Polling vs push coexistence and dedup strategy - Environment variables, data retention, troubleshooting Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 17:59:05 +03:00

1 2

62 commits