refactor(locomo): use clack prompts, add full context, partial run#51
refactor(locomo): use clack prompts, add full context, partial run#51
Conversation
There was a problem hiding this comment.
Code Review
This pull request refactors the LoCoMo benchmark tool, introducing an interactive CLI powered by @clack/prompts, a checkpointing system for resuming interrupted runs, and a "Full Context" baseline comparison mode. The orchestration logic has been moved to a new runner.ts, and state persistence is handled via checkpoint.ts. Feedback suggests enhancing type safety by utilizing Zod for JSON parsing in checkpoint.ts and cli.ts, restoring progress reporting during the ingestion phase, and deduplicating the runWithConcurrency utility across the codebase.
|
/gemini review |
There was a problem hiding this comment.
Code Review
This pull request refactors the LoCoMo benchmark into an interactive CLI tool using @clack/prompts, replacing the previous argument-based execution. Key enhancements include a new checkpointing system for resuming interrupted runs, the introduction of a 'Full Context' baseline for comparison against plast-mem, and a modularized architecture that separates benchmark orchestration, statistics calculation, and schema validation. The README has been updated to document the new interactive options and the revised results JSON structure. Feedback is provided regarding the README's JSON example, suggesting a more descriptive representation of the nested statistics to improve clarity.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
/gemini review |
There was a problem hiding this comment.
Code Review
This pull request refactors the LoCoMo benchmark into an interactive CLI using @clack/prompts, adding a checkpointing system for resuming runs and a 'Full Context' baseline comparison. The logic is modularized into specialized files for runner orchestration, checkpointing, and Zod-based schema validation. Feedback suggests removing an unused function and adopting a more idiomatic Array.from syntax for task initialization.
|
/gemini summary |
Summary of ChangesThis pull request significantly refactors the Highlights
Changelog
Activity
|
No description provided.