FactSheet for AI Systems

This FactSheet provides transparency regarding the Artificial Intelligence (AI) product offered by Rep'd. It details system purpose, training data, model architecture, and risk management strategies in alignment with GovAI Coalition standards.

1. System Overview

Vendor Name Rep'd
System Name Rep'd
Overview Rep'd is an AI-powered video platform for local government. We provide municipalities with an AI-powered Q&A system that answers resident questions instantly through pre-recorded staff videos and generative AI text responses. All sources are immediately visible to residents when a question is answered via AI-generated text, and every video is reviewable by staff before being published or shared with residents.
Purpose Rep'd helps municipalities:
  • Build Trust: Rep'd "humanizes" local government by introducing names, faces, and context to sensitive or complex issues.
  • Stop Misinformation: Rep'd provides local government with a megaphone to correct misinformation before it spreads.
  • Save Time: AI answers simple resident questions without staff involvement, and prevents misunderstandings from devolving into time-consuming crises.
Intended Domain Improving local government communications by way of saving municipality staff time while providing critical context and a human tone to complex or sensitive local issues.

2. Technical Specifications & Data

Training Data Our AI systems are bespoke for each municipal partner. Each system is trained via a combination of publicly-available municipal website URLs, domains, and documents. All sources are identified, approved, and provided to us by our municipal partners and are legally obtained. We add data to the training set as instructed by our municipal partners. Most partners provide the majority of source material upfront to minimize the need for constant retraining.
Test Data We run extensive testing on every aspect of the platform, including AI-powered translation, transcription, sentiment identification, and automated categorization. Testing is typically conducted in a private setting using specific source material provided by municipal partners (websites, domains, documents) prior to launch.
Model Information The AI system utilizes a multi-model architecture including:
  • AWS: Neural machine translation (AWS Translate) and speech-to-text processing (AWS Transcribe) for transcription and translation.
  • OpenAI: Hybrid model approach using GPT-4o for primary answer generation and quality assessment, with GPT-3.5-turbo for sentiment analysis, question categorization, and content classification. Text embeddings use text-embedding-3-small.
Update Procedure Municipal data is continuously updated through automated scraping processes that parse sitemaps, crawl website content, and process documents (PDFs, Excel files). New content is chunked and embedded into client-specific vector databases. OpenAI and AWS models are managed via API versioning with no direct client control. Staff can review and override AI responses before publication.
Inputs and Outputs Inputs: Text (user questions), video/audio files (for transcription), municipal website URLs, documents (PDFs, Excel files), and structured government data.
Outputs: AI-generated text answers with source citations, transcribed speech, translated text, sentiment scores (1-3 scale), question categories, confidence scores (0-1 scale with good/bad grades), and social media content.

3. Performance & Risk Management

Performance Metrics
  • Confidence Scores: AI-generated responses receive confidence scores (0-1 scale) with "good"/"bad" grade classifications.
  • Vector Similarity: Semantic search uses cosine similarity scores for source matching and relevance ranking.
  • Token Usage: API token consumption tracked for cost management.
  • Source Traceability: All answers include URL citations to original sources.
  • User Feedback: Thumbs up/down mechanisms with optional rejection notes stored for quality improvement.
  • Monitoring: Continuous API response validation and manual audits.
Bias Mitigation Translation and transcription biases are addressed through human review. Sentiment analysis uses standardized prompts with GPT-3.5-turbo. Government data retrieval is strictly limited to factual, sourced information from official municipal sources to avoid misrepresentation.
Robustness The system flags low confidence scores for human review via feedback mechanisms (thumbs up/down). Deep search functionality performs independent web verification for rejected answers. Continuous monitoring and manual validation enhance system robustness.
Optimal Conditions Operates optimally with structured inputs, well-formatted text, and clear audio. High-quality structured source material significantly improves retrieval accuracy.
Poor Conditions Performance declines with noisy audio or ambiguous text. Translation accuracy may lower for highly technical idioms. Hallucinations are minimized via RAG, but may occur if government source data lacks relevant information.
Explanation & Transparency Each AI response includes confidence scores and source traceability. All Government data retrieval outputs link back to original official sources, ensuring transparency. Logic is continuously refined for user comprehension.
Jurisdiction & Data Protection Data is processed in compliance with applicable privacy regulations. AWS services (Translate, Transcribe) and OpenAI APIs are used with appropriate data handling practices. Municipal data is stored in client-specific database schemas for data isolation. Security protocols safeguard sensitive information across all processes.

4. Impact Assessment

Monitoring & Human Oversight AI-generated responses are logged and periodically reviewed. Human reviewers can override outputs. While overrides do not currently retrain models automatically, they are used in manual audits to calibrate future system parameters.
Bias Management AWS and OpenAI apply built-in mitigation. We rely on verified government sources for data retrieval to minimize misinformation. Sentiment analysis and categorization use standardized prompts to ensure consistency across different question types.
Flagging Issues Users can report AI performance issues through integrated feedback mechanisms. Reports are manually reviewed, and adjustments are made to filters and overrides.