AI Tool Evaluation Scorecard
Systematically compare 2-4 AI tools for a specific business need — weighted scoring across cost, capability, ease of use, integration, and risk. Eliminates gut-feel tool decisions and produces a defensible recommendation.
Run this prompt once per major tool decision. Paste in pricing pages and feature lists from each vendor for the most accurate comparison.
How to use this prompt
- Pick your AI model. Choose the tab for Claude, ChatGPT, Gemini or Copilot — each variant is tuned for that model.
- Copy the full prompt. Click Copy Full Prompt to copy the text to your clipboard.
- Paste into your AI tool. Open your chosen model and paste the prompt into a new chat.
- Replace the
[placeholders]. Swap any bracketed fields for your company name, audience, product or tone. - Run and refine. Review the output. If anything is off, ask the AI to tighten tone, length or format.
Prompt Variants by Model
You are a technology advisor helping a small business owner evaluate AI tools.
<evaluation_request>
Business need: [WHAT PROBLEM ARE YOU SOLVING — e.g., "automate customer email responses"]
Tools to...
You are a technology advisor helping a small business owner evaluate AI tools.
<evaluation_request>
Business need: [WHAT PROBLEM ARE YOU SOLVING — e.g., "automate customer email responses"]
Tools to compare: [TOOL_1], [TOOL_2], [TOOL_3]
Budget: [MONTHLY_BUDGET]
Team technical skill: [non-technical / somewhat technical / technical]
Must-have integrations: [e.g., "Slack, Gmail, Shopify"]
Deal-breakers: [e.g., "no per-seat pricing", "must have API", "GDPR compliant"]
</evaluation_request>
Build a weighted evaluation scorecard:
1. **Criteria definition** — define 8-10 evaluation criteria grouped into: Capability (does it solve the problem?), Usability (can my team actually use it?), Cost (total cost of ownership over 12 months), Integration (works with my stack?), Risk (vendor stability, data privacy, lock-in).
2. **Weight assignment** — assign weights (must total 100%) based on my stated priorities and deal-breakers.
3. **Scoring matrix** — score each tool 1-5 on every criterion. Show your reasoning for each score in one sentence.
4. **Total scores** — weighted total for each tool, ranked.
5. **Recommendation** — which tool to choose, with one paragraph explaining why. Include caveats and what would change the recommendation.
6. **Trial plan** — a 7-day test plan for the top pick, with specific tasks to validate each day.
Act as a technology consultant who helps small businesses choose the right AI tools. You are objective and never favor any vendor.
I need to compare these AI tools for my business:
Problem I...
Act as a technology consultant who helps small businesses choose the right AI tools. You are objective and never favor any vendor.
I need to compare these AI tools for my business:
Problem I am solving: [WHAT PROBLEM]
Tools to compare: [TOOL_1], [TOOL_2], [TOOL_3]
Monthly budget: [AMOUNT]
My team's technical level: [non-technical / somewhat technical / technical]
Must integrate with: [e.g., Slack, Gmail, Shopify]
Deal-breakers: [e.g., "no per-seat pricing", "needs API"]
Create a weighted evaluation scorecard:
1. Define 8-10 criteria in these groups: Capability, Usability, Cost, Integration, Risk
2. Assign weights totaling 100% based on my priorities
3. Score each tool 1-5 on every criterion with one-sentence reasoning
4. Calculate weighted totals, rank the tools
5. Give me a clear recommendation with caveats
6. Design a 7-day trial plan for the winner — specific tasks each day to validate
Be honest about weaknesses. I want a real comparison, not a sales pitch.
I need to evaluate AI tools for my business. Help me make a data-driven decision.
**Problem to solve:** [WHAT PROBLEM]
**Tools to compare:** [TOOL_1], [TOOL_2],...
I need to evaluate AI tools for my business. Help me make a data-driven decision.
**Problem to solve:** [WHAT PROBLEM]
**Tools to compare:** [TOOL_1], [TOOL_2], [TOOL_3]
**Monthly budget:** [AMOUNT]
**Team technical skill:** [non-technical / somewhat technical / technical]
**Must integrate with:** [e.g., Slack, Gmail, Shopify]
**Deal-breakers:** [list any]
Research the current pricing, features, and recent reviews for each tool, then:
1. Define 8-10 evaluation criteria (Capability, Usability, Cost, Integration, Risk)
2. Weight them based on my priorities (total = 100%)
3. Score each tool 1-5 per criterion with reasoning
4. Calculate weighted totals and rank
5. Recommend the best option with caveats
6. Create a 7-day trial plan for the top pick
Use current data, not outdated information. Be objective.
Help me compare AI tools for my business.
Problem: [WHAT PROBLEM]
Tools: [TOOL_1], [TOOL_2],...
Help me compare AI tools for my business.
Problem: [WHAT PROBLEM]
Tools: [TOOL_1], [TOOL_2], [TOOL_3]
Budget: [MONTHLY]
Team skill: [non-technical / somewhat technical / technical]
Must work with: [existing tools]
Deal-breakers: [list]
Build a scorecard:
1. 8-10 evaluation criteria across Capability, Usability, Cost, Integration, Risk
2. Weight each criterion (total 100%)
3. Score each tool 1-5 with brief reasoning
4. Weighted totals and ranking
5. Clear recommendation with caveats
6. 7-day trial plan for the top pick
Frequently Asked Questions
What does the AI Tool Evaluation Scorecard prompt do?
Systematically compare 2-4 AI tools for a specific business need — weighted scoring across cost, capability, ease of use, integration, and risk. Eliminates gut-feel tool decisions and produces a defensible recommendation.
Which AI models is this prompt tested on?
This prompt is field-tested on Claude, ChatGPT, Gemini and Copilot. Each model has its own optimized variant above.
Do I need a paid AI account to use this prompt?
No. This prompt is written to run on the free tier of Claude, ChatGPT, Gemini and Copilot. Paid tiers simply give you longer context windows and faster responses.
Can I customize this prompt for my business?
Yes. Any text inside square brackets is a placeholder you replace with your own business details, such as company name, audience, product or tone. You can also ask the AI to adjust format, length or style after the first output.
When was this prompt last verified?
Each model variant above shows its own freshness stamp. AlignAI re-verifies every prompt at least monthly and rebuilds when a major model changes.
Don’t see what you need? tailored to your use case.