task-gdpval-3
Prompt
Reference Files (25)
Download all (.zip)expert_contributed — Author-generated; mirrors a skeleton/template presentation deck to be populated.
expert_contributed — Author-generated; mirrors a deck outline / structure brief.
expert_contributed — Author-generated; mirrors an annual plans / budget workbook.
expert_contributed — Author-generated; mirrors a category-management framework workbook.
expert_contributed — Author-generated; mirrors a chart-of-accounts extract.
expert_contributed — Author-generated; mirrors a contracts register.
expert_contributed — Author-generated; mirrors an internal email thread used as judgmental evidence.
expert_contributed — Author-generated; mirrors an internal email thread used as judgmental evidence.
expert_contributed — Author-generated; mirrors an internal email thread used as judgmental evidence.
expert_contributed — Author-generated; mirrors an internal email thread used as judgmental evidence.
expert_contributed — Author-generated; mirrors an internal email thread used as judgmental evidence.
expert_contributed — Author-generated; mirrors an internal email thread used as judgmental evidence.
expert_contributed — Author-generated; mirrors a general-ledger master extract.
expert_contributed — Author-generated; mirrors a stakeholder interview transcript.
expert_contributed — Author-generated; mirrors a stakeholder interview transcript.
expert_contributed — Author-generated; mirrors a stakeholder interview transcript.
expert_contributed — Author-generated; mirrors a stakeholder interview transcript.
expert_contributed — Author-generated; mirrors an organizational chart.
expert_contributed — Author-generated; mirrors a procurement procedures document.
expert_contributed — Author-generated; mirrors a procurement policy document.
expert_contributed — Author-generated; mirrors a purchase-agreements register.
expert_contributed — Author-generated; mirrors a supplier/bid scoring framework workbook.
expert_contributed — Author-generated; mirrors a spend-code-to-UNSPSC mapping table.
expert_contributed — Author-generated; mirrors a raw spend-data extract.
expert_contributed — Author-generated; mirrors a Teams chat export used as judgmental evidence.
Gold Deliverables (1)
Download all (.zip)Gold Trajectory
Scoring Rubric
| ID | Criterion | Category | Pts |
|---|---|---|---|
| 1.1 | Identifies the six Level 1 families in category_framework.xlsx | Extraction | 2 |
| 1.2 | Identifies the four Marketing & Communications subcategories (Events, Agency Fees, Market Research, Promotions and Advertising) | Extraction | 2 |
| 1.3 | Combines spend codes 22000005 and 22001115 into one Events subcategory (shared UNSPSC family 80141600) | Calculation | 5 |
| 1.4 | Maps code 19001515 to Legal Services on the 190xxxxx cost-centre range (not the description) | Reasoning | 3 |
| 1.5 | Maps code 20001713 to Management Consulting on the 200xxxxx cost-centre range | Reasoning | 3 |
| 1.6 | Maps code 22001105 (satisfaction survey) to Marketing & Communications (Market Research), not HR | Reasoning | 3 |
| ID | Criterion | Category | Pts |
|---|---|---|---|
| 2.1 | Remaps all 8 retired 80-prefix rows (AED 5.1M) using the email_thread_1 crosswalk before totalling | Calculation | 4 |
| 2.2 | States total classified spend as AED 263.6M | Calculation | 2 |
| 2.3 | States Digital Technology as the largest family at AED 175.8M (66.7%) | Calculation | 2 |
| 2.4 | States the Marketing & Communications blank-contract-reference rate as 56.2% (27 of 48 POs) | Calculation | 2 |
| 2.5 | Concludes the blank contract references are a system linkage gap (agreements exist, ERP-Linked = No/Partial), not off-contract spend | Reasoning | 5 |
| 2.6 | Identifies DMG as the only supplier with no agreement in place (AED 6.6M Events spend) | Calculation | 4 |
| 2.7 | Identifies the December concentration as the Microsoft Enterprise Agreement December true-up (cumulative AED 154.0M across Years 1–3; latest Year 3) | Reasoning | 4 |
| 2.8 | States December spend as AED 154.8M (58.7% of total) | Calculation | 2 |
| ID | Criterion | Category | Pts |
|---|---|---|---|
| 3.1 | States Digital Technology addressable spend as AED 21.8M (8.3%) after removing the AED 154.0M locked Microsoft agreement | Calculation | 5 |
| 3.2 | Excludes Management Consulting for conflict of interest (incumbents advise the transformation) | Reasoning | 4 |
| 3.3 | Excludes Legal Services citing the General Counsel's three reasons (confidential active matters; panel runs another year; review signals instability) | Reasoning | 4 |
| 3.4 | Identifies Management Consulting as the highest raw scorer before constraints are applied | Calculation | 2 |
| 3.5 | Selects Marketing & Communications as the pilot category | Reasoning | 2 |
| ID | Criterion | Category | Pts |
|---|---|---|---|
| 4.1 | States Events as the largest M&C subcategory at AED 27.1M | Calculation | 2 |
| 4.2 | States the Events non-competitive sourcing rate as 100.0% | Calculation | 4 |
| 4.3 | Uses teams_chat.docx to classify the blank-route Events rows as non-competitive | Reasoning | 3 |
| 4.4 | Identifies the main conference venue as fixed and government-hosted | Reasoning | 3 |
| 4.5 | Identifies the execution services (stand, AV, hospitality, logistics) as contestable | Reasoning | 3 |
| 4.6 | Selects Events as the pilot subcategory | Reasoning | 2 |
| ID | Criterion | Category | Pts |
|---|---|---|---|
| 5.1 | Identifies the 12 FY26 demand items with no confirmed budget (TBC) | Calculation | 2 |
| 5.2 | Uses the email_thread_6 template to request the missing budgets | Artefact completeness | 3 |
| 5.3 | Classifies events into three tiers: Strategic Fixed, Planned Flexible, Emerging/TBC | Reasoning | 2 |
| 5.4 | Flags the Global Manufacturing Summit supplier switch (DMG to DXBLive) as having no contract / no procurement process | Reasoning | 4 |
| ID | Criterion | Category | Pts |
|---|---|---|---|
| 6.1 | Populates all nine skeleton slides (Title, Executive summary, Why this pilot, Agenda, Spend picture, Internal requirements, Strategy, Initiatives, Roadmap) with no bracketed placeholder text remaining | Artefact completeness | 3 |
| 6.2 | Executive summary slide carries an action title and states the AED 27.1M Events opportunity as the headline figure | Artefact completeness | 2 |
| 6.3 | Spend-picture slide includes a chart/data visual and a data-quality note (80-prefix remap and supplier-variant merge) | Artefact completeness | 2 |
| 6.4 | Initiatives slide ties each priority initiative to the specific finding that drives it | Reasoning | 2 |
| 6.5 | Roadmap slide sequences initiatives across three horizons (0-3 / 3-6 / 6-12 months) and lists three next steps with named owners and dates | Artefact completeness | 1 |
| 6.6 | Headline figures in the deck are internally consistent with the analysis (AED 263.6M total; Digital Technology 175.8M/66.7%; December 154.8M/58.7%; Events 27.1M; 100% non-competitive) | Artefact consistency | 2 |