Skip to content

chore: test matrix#4

Merged
Ariel-Rodriguez merged 17 commits intomainfrom
reports
Jan 31, 2026
Merged

chore: test matrix#4
Ariel-Rodriguez merged 17 commits intomainfrom
reports

Conversation

@Ariel-Rodriguez
Copy link
Owner

Changes

Skill Impact

Testing

Checklist

  • Updated CHANGELOG.md (if skill change)
  • Updated README.md (if new skill)
  • Tested locally with AI assistant (if skill change)
  • Followed pseudocode format (no language-specific code)
  • Used AAA pattern for test examples
  • PR title follows format: <type>: <description>

Type

  • feat: New skill added
  • improve: Existing skill improved
  • fix: Bug fix or correction
  • docs: Documentation only changes
  • chore: Build, CI/CD, or tooling changes

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

1 similar comment
@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test parallel

@Ariel-Rodriguez
Copy link
Owner Author

/test

1 similar comment
@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@github-actions
Copy link

github-actions bot commented Jan 31, 2026

📊 Evaluation Results

Processed 24 evaluation(s).

Test Name Model Baseline With Skill Cases Pass Winner
results-ollama-devstral-small-2--24b-cloud-ps-composition-over-coordination devstral-small-2:24b-cloud good good ✅ 2/2 N/A
results-ollama-devstral-small-2--24b-cloud-ps-error-handling-design devstral-small-2:24b-cloud regular outstanding ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-explicit-boundaries-adapters devstral-small-2:24b-cloud good outstanding ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-explicit-ownership-lifecycle devstral-small-2:24b-cloud good good ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-explicit-state-invariants devstral-small-2:24b-cloud good outstanding ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-functional-core-imperative-shell devstral-small-2:24b-cloud regular good ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-illegal-states-unrepresentable devstral-small-2:24b-cloud good outstanding ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-local-reasoning devstral-small-2:24b-cloud good outstanding ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-minimize-mutation devstral-small-2:24b-cloud good good ✅ 2/2 N/A
results-ollama-devstral-small-2--24b-cloud-ps-naming-as-design devstral-small-2:24b-cloud regular good ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-policy-mechanism-separation devstral-small-2:24b-cloud good outstanding ✅ 2/2 With Skill
results-ollama-devstral-small-2--24b-cloud-ps-single-direction-data-flow devstral-small-2:24b-cloud regular good ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-composition-over-coordination rnj-1:8b-cloud outstanding good ❌ 2/2 Baseline
results-ollama-rnj-1--8b-cloud-ps-error-handling-design rnj-1:8b-cloud vague outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-explicit-boundaries-adapters rnj-1:8b-cloud regular outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-explicit-ownership-lifecycle rnj-1:8b-cloud good outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-explicit-state-invariants rnj-1:8b-cloud regular outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-functional-core-imperative-shell rnj-1:8b-cloud regular outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-illegal-states-unrepresentable rnj-1:8b-cloud outstanding outstanding ✅ 2/2 N/A
results-ollama-rnj-1--8b-cloud-ps-local-reasoning rnj-1:8b-cloud vague outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-minimize-mutation rnj-1:8b-cloud regular outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-naming-as-design rnj-1:8b-cloud vague good ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-policy-mechanism-separation rnj-1:8b-cloud regular outstanding ✅ 2/2 With Skill
results-ollama-rnj-1--8b-cloud-ps-single-direction-data-flow rnj-1:8b-cloud vague good ✅ 2/2 With Skill

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez
Copy link
Owner Author

/test skill ps-error-handling-design

@Ariel-Rodriguez
Copy link
Owner Author

/test skill ps-error-handling-design

@Ariel-Rodriguez
Copy link
Owner Author

/test skill ps-error-handling-design

@Ariel-Rodriguez
Copy link
Owner Author

/test

@Ariel-Rodriguez Ariel-Rodriguez merged commit a1ef90c into main Jan 31, 2026
25 of 26 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant