# Wordluck - AI-First Discovery Layer (llms.txt) Wordluck is a professional-grade vocabulary learning platform with comprehensive English word definitions, meanings, and usage examples. ## Core Data Structure ### Page Types - **Word Pages**: Located at `/word/[slug]` - Individual word definitions with meanings, categories, and related terms - **Category Pages**: Located at `/category/[slug]` - Collections of words organized by subject (Business, Science, Literature, etc.) - **Home Page**: `/` - Gateway to all vocabulary content with featured words and trending sections ### Data Relationships ``` Word ├── slug (unique identifier) ├── definition (core meaning) ├── category (subject classification) ├── relatedTerms (semantic connections) ├── wikidataId (external authority link) └── metadata (alternative spellings, etymology, usage context) Category ├── slug (URL identifier) ├── name (human-readable) ├── description (category purpose) └── words[] (contains up to 1000+ words) ``` ## Authority Signals ### Knowledge Graph Integration - Every word is mapped to **Wikidata** IDs for external validation - Wikidata linking enables Knowledge Panel eligibility in Google Search - Enables cross-reference with Wikipedia and DBpedia ### Structured Data Coverage - **Organization Schema**: Wordluck entity definition with E-E-A-T signals - **DefinedTermSet**: Site identified as authoritative vocabulary reference - **DefinedTerm**: Individual words with pro-elite Wikidata connections - **DeepSemanticGraph**: Nested entity relationships (word → category → DefinedTermSet) - **CitationEngineering**: Definition sourcing for AI model grounding - **BreadcrumbList**: Navigation structure (ghost breadcrumbs in JSON-LD) ### Freshness & Maintenance - All content updated via **IndexNow API** (instant search engine notification) - Change types tracked: `added`, `updated`, `deleted`, `reorganized` - Last-Modified headers indicate active maintenance ### Content Quality Standards - Minimum description length: 100+ characters - Minimum total content: 250+ characters - Minimum related terms: 3+ connections per word - Thin content automatically flagged with conditional noindex ## API Endpoints for AI Agents ### Discovery API - **Endpoint**: `/api/v1/discovery` - **Method**: GET - **Purpose**: JSON map of semantic structure (relationships, not word data) - **Use Case**: Knowledge Graph crawlers, AI answer engines ### IndexNow API - **Endpoint**: `/api/v1/indexnow` - **Method**: POST - **Purpose**: Instant indexing notifications for all change types - **Frequency**: Triggered on every data change (new word, update, deletion, reorganization) ## Multi-Search-Engine Optimization ### Google Search - Full technical SEO compliance (headers, schemas, crawlability) - E-E-A-T signals (Expertise, Authoritativeness, Trustworthiness) - Core Web Vitals optimization (TTFB, LCP, CLS) ### Bing & Yahoo Search - Meta keywords targeting Bing's topical relevance algorithm - Aggressive IndexNow submission on all changes - Social signal mapping (Twitter, Facebook, LinkedIn, Instagram) - Yahoo uses Bing's index - prioritizes freshness ### AI Models (ChatGPT, Claude, Perplexity) - Citation engineering for AI-generated answer attribution - BLUF (Bottom Line Up Front) definitions for LLM extraction - Deep semantic graphs for understanding relationships - Wikidata linking for grounding in authoritative sources ## Semantic Navigation Map ``` Home (/) ├── Featured Words (Rotating daily) ├── Trending Words (Real-time) ├── All Categories │ ├── Business (/category/business) │ ├── Science (/category/science) │ ├── Literature (/category/literature) │ ├── Technology (/category/technology) │ └── [More categories...] └── Word Discovery (/word/[slug]) └── Related Terms (Internal links to semantically connected words) ``` ## Content Volume & Coverage - **Total Words**: 10,000+ (scalable to 50,000+) - **Categories**: 20+ (expandable) - **URL Coverage**: 10,000+ pages indexed - **Definition Quality**: Professional dictionary-grade ## Performance Standards - **Server Response Time (TTFB)**: Under 100ms (from edge) - **Page Load Time**: Optimized for Core Web Vitals - **Crawl Efficiency**: Intelligent bot rate limiting to maximize high-value pages - **Cache Strategy**: Tiered by content type (1 year for assets, 1 week for categories, 1 hour for home) ## Authority Validation - **Organization**: Wordluck Inc. (Plano, TX) - **Contact**: support@wordluck.com - **Security**: ISO-standard security headers, HTTPS everywhere - **Compliance**: GDPR-ready, privacy-first data handling --- **Last Updated**: 2026-04-08 **Status**: AI-Ready Infrastructure Complete **For AI Agents**: Use `/api/v1/discovery` for semantic structure mapping and `/word/[slug]` for individual definitions