Episodic Memory Testing

🌐Community
by d-o-hub · vlatest · Repository

Evaluates an AI's ability to recall and apply past experiences from simulated scenarios using episodic memory benchmarks.

Install on your platform

We auto-selected Claude Code based on this skill’s supported platforms.

1

Run in terminal (recommended)

terminal
claude mcp add episodic-memory-testing npx -- -y @trustedskills/episodic-memory-testing
2

Or manually add to ~/.claude/settings.json

~/.claude/settings.json
{
  "mcpServers": {
    "episodic-memory-testing": {
      "command": "npx",
      "args": [
        "-y",
        "@trustedskills/episodic-memory-testing"
      ]
    }
  }
}

Requires Claude Code (claude CLI). Run claude --version to verify your install.

About This Skill

What it does

This skill allows an AI agent to test the quality and recall of episodic memories. It can assess memory accuracy, completeness, and temporal order through structured questioning and evaluation. The tool provides a means for evaluating how well an AI agent retains and reconstructs past experiences.

When to use it

  • Debugging Memory Systems: Use this skill when troubleshooting issues with an AI agent's episodic memory system to identify areas of weakness or failure.
  • Evaluating Learning Progress: Track the improvement in memory performance over time as an AI agent learns new information and refines its memory storage techniques.
  • Assessing Agent Reliability: Determine how consistently an agent can recall past events, which is crucial for tasks requiring accurate historical context.
  • Benchmarking Memory Capabilities: Compare the episodic memory capabilities of different AI agents or models using a standardized testing procedure.

Key capabilities

  • Memory Recall Testing
  • Accuracy Evaluation
  • Completeness Assessment
  • Temporal Order Verification

Example prompts

  • "Test my agent's memory about yesterday’s interactions."
  • "Evaluate the accuracy of the events stored in my agent's episodic memory regarding the meeting with John Smith."
  • "Can you check if my agent remembers the order of tasks performed this morning?"

Tips & gotchas

The effectiveness of this skill depends on the quality and structure of the episodic memories being tested. Ensure that the AI agent’s underlying memory system is properly initialized and populated with relevant data before using this testing tool.

Tags

🛡️

TrustedSkills Verification

Unlike other registries that point to live repositories, TrustedSkills pins every skill to a verified commit hash. This protects you from malicious updates — what you install today is exactly what was reviewed and verified.

Security Audits

Gen Agent Trust HubPass
SocketPass
SnykPass

Details

Version
vlatest
License
Author
d-o-hub
Installs
9

🌐 Community

Passed automated security scans.