Files
aitbc/.windsurf/skills/ollama-gpu-testing-skill.md
aitbc f36fd45d28
Some checks failed
Blockchain Synchronization Verification / sync-verification (push) Successful in 4s
Documentation Validation / validate-docs (push) Successful in 12s
Documentation Validation / validate-policies-strict (push) Successful in 3s
Integration Tests / test-service-integration (push) Failing after 12s
Multi-Node Blockchain Health Monitoring / health-check (push) Successful in 3s
P2P Network Verification / p2p-verification (push) Successful in 2s
Python Tests / test-python (push) Successful in 10s
Security Scanning / security-scan (push) Successful in 31s
Implement RECEIPT_CLAIM transaction type
- Add status fields to Receipt model (status, claimed_at, claimed_by)
- Add RECEIPT_CLAIM handling to state_transition.py with validation and reward minting
- Add type field to Transaction model for reliable transaction type storage
- Update router to use TransactionRequest model to preserve type field
- Update poa.py to extract type from mempool transaction content and store only original payload
- Add RECEIPT_CLAIM to GasType enum with gas schedule
2026-04-22 13:35:31 +02:00

199 lines
6.5 KiB
Markdown

---
description: Atomic Ollama GPU inference testing with deterministic performance validation and benchmarking
title: ollama-gpu-testing-skill
version: 1.1
---
# Ollama GPU Testing Skill
## Purpose
Test and validate Ollama GPU inference performance, GPU provider integration, payment processing, and blockchain recording with deterministic benchmarking metrics.
## Activation
Trigger when user requests Ollama GPU testing: inference performance validation, GPU provider testing, payment processing validation, or end-to-end workflow testing.
## Input
```json
{
"operation": "test-gpu-inference|test-payment-processing|test-blockchain-recording|test-end-to-end|comprehensive",
"model_name": "string (optional, default: llama2)",
"test_prompt": "string (optional for inference testing)",
"test_wallet": "string (optional, default: test-client)",
"payment_amount": "number (optional, default: 100)",
"gpu_provider": "string (optional, default: aitbc-host-gpu-miner)",
"benchmark_duration": "number (optional, default: 30 seconds)",
"inference_count": "number (optional, default: 5)"
}
```
## Output
```json
{
"summary": "Ollama GPU testing completed successfully",
"operation": "test-gpu-inference|test-payment-processing|test-blockchain-recording|test-end-to-end|comprehensive",
"test_results": {
"gpu_inference": "boolean",
"payment_processing": "boolean",
"blockchain_recording": "boolean",
"end_to_end_workflow": "boolean"
},
"inference_metrics": {
"model_name": "string",
"inference_time": "number",
"tokens_per_second": "number",
"gpu_utilization": "number",
"memory_usage": "number",
"inference_success_rate": "number"
},
"payment_details": {
"wallet_balance_before": "number",
"payment_amount": "number",
"payment_status": "success|failed",
"transaction_id": "string",
"miner_payout": "number"
},
"blockchain_details": {
"transaction_recorded": "boolean",
"block_height": "number",
"confirmations": "number",
"recording_time": "number"
},
"gpu_provider_status": {
"provider_online": "boolean",
"gpu_available": "boolean",
"provider_response_time": "number",
"service_health": "boolean"
},
"issues": [],
"recommendations": [],
"confidence": 1.0,
"execution_time": "number",
"validation_status": "success|partial|failed"
}
```
## Process
### 1. Analyze
- Validate GPU testing parameters and operation type
- Check Ollama service availability and GPU status
- Verify wallet balance for payment processing
- Assess GPU provider availability and health
### 2. Plan
- Prepare GPU inference testing scenarios
- Define payment processing validation criteria
- Set blockchain recording verification strategy
- Configure end-to-end workflow testing
### 3. Execute
- Test Ollama GPU inference performance and benchmarks
- Validate payment processing and wallet transactions
- Verify blockchain recording and transaction confirmation
- Test complete end-to-end workflow integration
### 4. Validate
- Verify GPU inference performance metrics
- Check payment processing success and miner payouts
- Validate blockchain recording and transaction confirmation
- Confirm end-to-end workflow integration and performance
## Constraints
- **MUST NOT** submit inference jobs without sufficient wallet balance
- **MUST** validate Ollama service availability before testing
- **MUST** monitor GPU utilization during inference testing
- **MUST** handle payment processing failures gracefully
- **MUST** verify blockchain recording completion
- **MUST** provide deterministic performance benchmarks
## Environment Assumptions
- Ollama service running on port 11434
- GPU provider service operational (aitbc-host-gpu-miner)
- AITBC CLI accessible for payment and blockchain operations
- Test wallets configured with sufficient balance
- GPU resources available for inference testing
## Error Handling
- Ollama service unavailable → Return service status and restart recommendations
- GPU provider offline → Return provider status and troubleshooting steps
- Payment processing failures → Return payment diagnostics and wallet status
- Blockchain recording failures → Return blockchain status and verification steps
## Example Usage Prompt
```
Run comprehensive Ollama GPU testing including inference performance, payment processing, blockchain recording, and end-to-end workflow validation
```
## Expected Output Example
```json
{
"summary": "Comprehensive Ollama GPU testing completed with optimal performance metrics",
"operation": "comprehensive",
"test_results": {
"gpu_inference": true,
"payment_processing": true,
"blockchain_recording": true,
"end_to_end_workflow": true
},
"inference_metrics": {
"model_name": "llama2",
"inference_time": 2.3,
"tokens_per_second": 45.2,
"gpu_utilization": 78.5,
"memory_usage": 4.2,
"inference_success_rate": 100.0
},
"payment_details": {
"wallet_balance_before": 1000.0,
"payment_amount": 100.0,
"payment_status": "success",
"transaction_id": "tx_7f8a9b2c3d4e5f6",
"miner_payout": 95.0
},
"blockchain_details": {
"transaction_recorded": true,
"block_height": 12345,
"confirmations": 1,
"recording_time": 5.2
},
"gpu_provider_status": {
"provider_online": true,
"gpu_available": true,
"provider_response_time": 1.2,
"service_health": true
},
"issues": [],
"recommendations": ["GPU inference optimal", "Payment processing efficient", "Blockchain recording reliable"],
"confidence": 1.0,
"execution_time": 67.8,
"validation_status": "success"
}
```
## Model Routing Suggestion
**Fast Model** (Claude Haiku, GPT-3.5-turbo)
- Basic GPU availability checking
- Simple inference performance testing
- Quick service health validation
**Reasoning Model** (Claude Sonnet, GPT-4)
- Comprehensive GPU benchmarking and performance analysis
- Payment processing validation and troubleshooting
- End-to-end workflow integration testing
- Complex GPU optimization recommendations
**Coding Model** (Claude Sonnet, GPT-4)
- GPU performance optimization algorithms
- Inference parameter tuning
- Benchmark analysis and improvement strategies
## Performance Notes
- **Execution Time**: 10-30 seconds for basic tests, 60-120 seconds for comprehensive testing
- **Memory Usage**: <300MB for GPU testing operations
- **Network Requirements**: Ollama service, GPU provider, blockchain RPC connectivity
- **Concurrency**: Safe for multiple simultaneous GPU tests with different models
- **Benchmarking**: Real-time performance metrics and optimization recommendations