momentic-test

Create, run, and maintain Momentic E2E tests and modules, which are serialized to disk as *.test.yaml and *.module.yaml files. Momentic uses fast, accurate AI agents to automate browser interactions for the purpose of testing web applications.

Skill file

Preview skill file
---
name: momentic-test
description: Create, run, and maintain Momentic E2E tests and modules, which are serialized to disk as *.test.yaml and *.module.yaml files. Momentic uses fast, accurate AI agents to automate browser interactions for the purpose of testing web applications.
---

# Momentic background

## Execution model

Momentic is an end-to-end testing framework. Tests are ordered lists of
structured steps executed with Playwright and CDP.

- Interactive steps such as clicks and types use AI to resolve natural-language
  targets into concrete browser actions.
- Assertion steps can use multimodal models to evaluate page state.
- Goal-based AI actions can perform broader tasks such as "checkout with credit card".

## Cache and memory

Momentic caches resolved step metadata such as selectors, XPaths, visible text,
and coordinates so most runs avoid repeated AI calls. This is critical for speed, but
stale cache is a real debugging possibility: a step may hit the wrong element.
AI assertions may also use past-result memory to stay consistent across runs;
bad memory can explain repeated borderline failures.

Caches are scoped by git metadata, including branch. Cache writes are skipped on
protected branches, including the configured main branch, unless cache saving is
forced with `--save-cache` or the `CI` environment variable is set. Cache reads
can still happen on protected branches. Use `--disable-cache` to bypass cache
entirely.

Ways to force fresh behavior:

- Change the step description/assertion when the intent has changed; this
  changes the step identity used for cache matching. In v1, splicing a changed
  step also creates fresh internal UUIDs.
- Use `--disable-cache` for dynamic targets that should resolve fresh every run, e.g "the calendar cell for today's date".
- Preserve good previewed cache by carrying `CacheId` into splice with
  `--cache-id <CacheId>`.
- Change assertion wording and add disambiguation when previous AI memory is now misleading.

## Timing

Momentic uses "smart waiting" before targeting steps. It waits up to the
configured Smart waiting timeout, which defaults to 5 seconds, for page state to settle or the desired element to appear. Within that window, do not add manual sleeps or waits. For slower or more semantic readiness, use `waitForUrl`, text/element checks, or an AI assertion.

## Settings precedence

`momentic.config.yaml` sets project defaults, but many settings can be
overridden at the test level: browser type, viewport, locale/timezone,
geolocation, page-load timeout, smart-waiting timeout, proxy, headers, auth, extensions, etc. Always check the test's own metadata before assuming the project default applies.

## Test context

Each run has a test-scoped `env` context that persists across steps, including
modules. Later steps can read values written by earlier steps.

- v2: use `saveAs` on steps with return values.
- v1/MCP CLI strings: use `--env-key`.
- JavaScript: prefer `return` plus `saveAs` / `--env-key`; use
  `setVariable(name, value)` when setting multiple variables.
- Use `env.NAME` in JavaScript and module input expressions.
- Use `{{ env.NAME }}` in string fields. `{{ ... }}` can evaluate JavaScript,
  but do not use it inside JavaScript step source because `env` is already in
  scope there.

Module inputs are JavaScript fragments as strings. Quote literal strings and use
`env.X` for variables; they are not `{{ }}` templates.

## JavaScript context

JavaScript steps can run in `NODE` or `BROWSER`.

- `NODE`: default for API calls, data prep, OTP/email/SMS, DB queries, and
  variable writes. It has Momentic globals and preloaded libraries. `env`, `setVariable`, `email`, `sms`, `axios`, `assert`, `faker`, `moment`, `pg`, `OTPAuth`, `child_process`, etc. Check the JavaScript command docs or the Step Authoring Guide for more details
- `BROWSER`: runs in the page with `window` / `document` and page globals. It
  does not have Node-only Momentic helpers.

Keep short one-off JavaScript inline. In v2 YAML, reusable utilities and long
scripts can live in a project script file, following existing project
conventions. Prefer locations such as
`./scripts/page-utilities/auth-loader.js`. Read nearby scripts first and match their
module style, helper naming, env usage, and error style.

# Project state on disk

Tests are `*.test.yaml` files. Modules are reusable step collections stored as
`*.module.yaml` files. Test IDs are authoritative and live on the test file's
`id` field.

There are two major file formats:

- `fileType: momentic/test/v2` or `fileType: momentic/module/v2` -> **v2**.
  Direct YAML editing is preferred for high-confidence changes as it is faster.
- Missing `fileType` or any other value -> **v1**. Never edit v1 YAML
  directly; persist changes only through `momentic_test_splice_steps`.

`momentic.config.yaml` is the project root config. It stores project defaults
for agents, AI features, browser options, recording, timeouts, browser type,
file globs, and environments. See
https://momentic.ai/docs/configuration/momentic-config.md.

v2 steps can reference local files by relative path:

- Module invocations: `path: ./modules/login.module.yaml`
- JavaScript steps: `javascript: ./scripts/setup.js`
- Auth state: `authLoad: ./auth-state.json`, `authSave: ./auth-state.json`

Relative paths resolve from the YAML file containing the step, not from the
project root or importing test. Use `./...` or `../...`; do not use absolute
paths or `~`. If you move, rename, or delete a referenced file, grep for the old
path and update every reference.

v1 YAML should still be edited only through MCP. Do not use `codeFile` in v1
YAML or MCP CLI step strings; v1 JavaScript steps should carry executable code
in `code` / `--code`. Use JavaScript file references only in v2 YAML.

Do not add internal or auto-generated fields to v2 YAML.

# Before you edit

Gather only what you need:

- Test goal and user-visible success criteria.
- Start point: `baseUrl` or named environment.
- Auth requirements and required env vars.
- Risky actions that must not run twice: submit, purchase, delete, send, create.

For long tasks, inspect nearby tests and modules before authoring. Reusing an
existing module is usually better than rebuilding a common flow inline.

Ask before long-running checks, starting over from scratch, destructive actions,
or editing a shared module.

# Choose the workflow

If the user requests a specific workflow, respect it unless it is unsafe or
impossible. Otherwise, use direct v2 YAML editing when the file is v2, the
change is localized, the step sequence is known, and live UI discovery is not
required. Good examples: scaffold from a nearby pattern, reword an assertion,
adjust a target, update an env key, fix a file reference, or insert a small
known step.

Use the MCP browser-validation workflow when the file is v1 or unknown, UI state
must be discovered, locator timing is flaky, the flow is multi-step and unclear,
or the user asks to build/validate interactively ("headful mode").

Use `momentic_test_create` for new tests; search for the tool if it is not
visible. It requires `name` and either `baseUrl` or `environment`. Only pass
destination fields such as `pathSegments` when requested.
`momentic_session_start` requires an existing `testId`; it does not create
tests.

# Universal authoring rules

- Prefer natural-language element descriptions. Use selectors or
  coordinates only as a last resort for cases the AI cannot see, such as SVG
  internals, canvas, or a user-requested selector-level target.
- Prefer native Momentic steps over JavaScript. Use JS only when no native step
  expresses the behavior. JS steps can run in either the browser (client-side)
  or Node (server-side).
- Do not add navigation at the start. The session starts on the test's base URL.
- Keep assertions minimal and user-driven. Add readiness checks only when they
  are needed to make the next dependent action reliable.
- After a click/action that should navigate or materially change state, add an
  immediate validation before dependent actions. Prefer `waitForUrl` for URL
  contracts, `checkPageContains` / `checkElement...` for stable text or
  elements, and `assert` for semantic visual state.
- Do not use AI actions (`act`, `AI_ACTION`, `AI_ACTION_DYNAMIC`) unless the
  user asks or the existing test already uses one.
- Do not add optional/default fields unless needed for correctness.
- Keep the delta small. Preserve unrelated params, request bodies, env keys,
  literal values, quoting, comments, ordering, and step style.
- Do not work around real app failures. If the app is broken, data is missing,
  or a backend is down, report the failure instead of weakening the test.
- Do not reorganize `before` / `steps` / `after` or setup / main / teardown
  unless the test intent requires it.

# Working with v2 YAML

v2 is the human-editable format. Steps are compact: each step has one top-level
command key, such as `click: Submit`, or a detailed map under that key. Tests
use `before` / `steps` / `after`; modules use `steps`. Durations are always
milliseconds. No visible step/command IDs.

Direct-edit loop:

1. For a new test, create it with `momentic_test_create`, then edit the YAML in one batch instead of adding known steps one-by-one through MCP.
2. Make the smallest YAML edit. Preserve style. If you are unsure of syntax, fetch and read what is needed from https://static.momentic.ai/v2-format-reference.md.
3. Validate only if the user asked, risk warrants it, or the change needs live
   confirmation.

Common mistakes:
- Putting options beside the command instead of nesting under it.
- Using the wrong target-field name for a command.

`npx momentic lint` validates v2 schemas and file references. Lint runs
automatically before `momentic app` and `momentic run`; run it manually if you
are unsure of syntax after edits or after moving/renaming referenced files.

State refresh after disk edits:

- No active MCP session: start a fresh session when ready to validate.
- Active session plus `momentic_test_reload`: reload before `momentic_run_step`.
- Active session and no reload tool: terminate and restart the session.
- `momentic_test_get` inspects persisted state; it does not refresh an active
  session unless the tool explicitly says so.

# MCP browser-validation workflow

Use this for every v1 edit and for v2 work that needs live discovery. The tool
surface is shared; persistence differs: v1 uses splice, while v2 can use splice
or direct YAML edit plus reload.

## Discovery

- `momentic_get_artifacts()`: project context, config path, cwd, and artifact
  files for tests, modules, environments, etc. Read only what you need.
- `momentic_test_get({ testId | testPath })`: inspect persisted test state.
  Before a session, this is useful. During an active session after splicing,
  prefer the splice response or `returnTest: true`.
- `momentic_module_recommend({ userRequest })`: find reusable flows.
- `momentic_module_get({ selector })`: inspect module params, defaults, enums,
  and steps. Selector is exactly one of `{ id }`, `{ name }`, or `{ path }`.

## Sessions

- `momentic_session_start({ testId, ... })`: start browser session. It returns
  metadata, the Step Authoring Guide artifact, and Test Content with
  active-session step IDs. Required: `testId`. Call it by itself, not in
  parallel with other MCP tools. Options include env/config/project overrides,
  headful mode, and video.
- Read the Step Authoring Guide before constructing CLI-style steps.
- `momentic_run_step({ sessionId, fromStep, toStep?, targetSection?,
  resetSession? })`: run existing active-session steps. Use step IDs from Test
  Content or splice responses, never raw YAML. Use `parentStepIdChain: []` for
  top-level steps.
- If state drifts, restart with `momentic_run_step` and `resetSession: true` on
  the same `sessionId`; do not reset between every micro-edit.
- `momentic_session_terminate({ sessionId })`: terminate when done. If started
  with `video: true`, the response includes the video directory.

## Test authoring loop

Author MCP steps in checkpoint-sized chunks. Preview forward until a logical
section works, then splice that checkpoint. Good checkpoints are natural flow
boundaries such as login complete, form submitted, page deleted, etc. Avoid splicing one step at a time, unless it is highly risky and non-idempotent.

- `momentic_preview_step({ sessionId, step })`: execute one step in the browser without
  persisting. If it returns `CacheId`, include
  `--cache-id <CacheId>` when splicing that step. The response screenshot shows
  the page state after the step.
- If a preview screenshot is not enough to target, call
  `momentic_get_session_state` with `returnBrowserState: true`, then inspect the
  artifact for stable names, roles, visible text, and prominent structure to
  help craft a reliable description. Use sparingly as browser state is large.
- When the next several steps are obvious and low-risk, such as filling known
  form fields, splice them together and run that saved range instead of
  previewing each field one by one.
- `momentic_test_splice_steps({ sessionId, startIndex, deleteCount, steps,
  targetSection?, parentStepIdChain?, returnTest? })`: insert, replace, or
  delete steps and persist.
- After splicing, read the response immediately; it is the source of truth for
  inserted/deleted refs and active-session step IDs.
- If `returnTest: true`, verify the returned structure before continuing.
- If downstream steps remain, run the immediate next step to confirm the flow
  still connects.
- For non-idempotent actions such as submit, purchase, delete, send, etc.,
  avoid repeated previews. Preview the setup steps, splice the checkpoint before
  the risky action, then execute the risky saved step once only when validation
  requires it.
- For obvious adjacent low-risk steps, batch them; do not checkpoint after every
  field unless locator or page state is uncertain.
- When the requested edit is complete, ask whether to validate from the start by
  running the relevant saved range with `momentic_run_step`.

## Reading tool output

Sessions are live browser processes. Screenshots and browser states are instant snapshots. If the screenshot does not show expected state after an action, call
`momentic_get_session_state` once more; the page may still be loading.

MCP tools may return artifact links under `.momentic-mcp/...`. Read linked files
only when needed:

- UI-state text: refine targeting or debug structure.
- Screenshots: usually already returned inline as images.
- Environment files: validate `envKey`, JavaScript/API outputs, or dependent
  env values.
- `momentic_get_session_state` returns serialized UI state only with
  `returnBrowserState: true`; screenshots are returned by default.

## CLI-style step strings

`preview_step` and `splice_steps` use CLI-style strings:
`--step-type <TYPE> [options]`, e.g. `CLICK`, `TYPE`, `NAVIGATE`,
`AI_ASSERTION`, `MODULE`, `WAIT_FOR_URL`.

Examples:

- Navigate: `--step-type NAVIGATE --url "https://example.com"`
- Click: `--step-type CLICK --description "the Sign in button"`
- Type:
  `--step-type TYPE --description "Search input" --value "hello" --press-enter`
- AI assertion:
  `--step-type AI_ASSERTION --assertion "the page shows a Sign in button" --timeout-seconds 10`
- Module:
  `--step-type MODULE --module-id <id> --inputs email=env.USER_EMAIL --inputs password=env.USER_PASSWORD`

Splice example:

```json
{
  "sessionId": "SESSION_ID",
  "startIndex": 0,
  "deleteCount": 0,
  "steps": [
    "--step-type NAVIGATE --url \"https://example.com\"",
    "--step-type AI_ASSERTION --assertion \"the page shows a Sign in button\" --timeout-seconds 10 --cache-id UUID_FROM_PREVIEW"
  ],
  "targetSection": "main"
}
```

For conditionals, create the `CONDITIONAL` step with `--assertion-type` and the
matching assertion fields, then splice nested steps with
`parentStepIdChain: [conditionalStepId]`.

# Modules

Default to module-first for logical flows of 4+ steps such as login,
navigation, setup, or checkout. Call `momentic_module_recommend`, inspect strong
candidates with `momentic_module_get`, then decide module vs inline.

Modules cannot contain modules. Splicing a `MODULE` step inside a module fails.

Editing a shared module requires user confirmation. To modify a module through
MCP, replace the module step with a `MODULE` step carrying the needed metadata
flags: `--parameters`, `--parameter-enum`, `--default-parameter`,
`--module-display-name`, `--module-description`, `--module-enabled`. Keys in
`defaultParameters` and `parameterEnums` must exist in `parameters`.

Module `inputs` values are JavaScript fragments as strings. Quote string
literals, reference env as `env.X`, and respect enum constraints exactly.

# Validation strategy

- Direct v2 edit with no live validation requested: summarize changes and ask
  whether to run.
- Direct v2 edit with active session: reload if available, otherwise restart
  the session, then run the edited range.
- MCP-authored edit: preview forward, splice at logical checkpoints, then run the
  next downstream saved step or range.
- Long full-test runs and risky actions require confirmation.
- Terminate MCP sessions when done.

# Troubleshooting

## Page state and timing

- Wrong page/UI: read latest UI state or call `momentic_get_session_state`.
- Screenshot not updated: call `momentic_get_session_state` once more.
- Flaky timing: prefer `AI_ASSERTION`, `checkPageContains`, `checkElement...`,
  or `waitForUrl` over generic `WAIT`.
- Long backend job/import/upload: use an assertion or URL/text/element wait with
  an appropriate timeout instead of sleeps.
- Weird session state: use `momentic_run_step` with `resetSession: true`.

## Targeting and cache

- Element not found: inspect screenshot/UI state. If visible, improve the
  description using visible text, role, and nearby context; if absent, debug the
  prerequisite step.
- Wrong element was hit quickly or without any AI: suspect stale cache, especially when the page structure is similar to a prior run.
- Dynamic target: change the description to be stable, or disable cache for that
  step.
- Playwright stability failure: a target can be found but still not be
  actionable because it is hidden, detached, outside the viewport, covered, or
  animating. Prefer fixing page state when possible. Use `--force` only for that
  step when bypassing actionability is acceptable, or enable project-wide visual
  actions if coordinate-based interaction is the right tradeoff.
- Visual actions are enabled in project browser settings with
  `visualActions: true`. Momentic interacts by X/Y coordinates and tries to
  preserve element identity best-effort instead of hard-failing like Playwright
  actionability checks.
- Quoted text: quoted substrings in descriptions are treated literally by
  Momentic AI. Use quotes only when that exact text must appear on screen or in
  the element's accessible name; omit quotes for semantic matching.

## AI assertion performance

- Ambiguous assertion: make the expected visual/text condition concrete. Include
  the relevant page region, object, count, or state.
- Literal text mismatch: quoted strings are treated as text that must appear on
  screen. Remove quotes when semantic matching is intended. Describe the purpose
  of elements when possible rather than specific text or labels.
- Out-of-viewport: visual conditions like color or shape can only be evaluated
  if the element is in the viewport. Use scroll/hover setup.
- Visually subtle condition: `AI_ASSERTION` supports VISION_ONLY mode which has
  better visual reasoning.
- Repeated bad verdict: reword the assertion so the intended condition is
  clearer and old memory no longer applies.
- Transient conditions: AI checks retry multiple times over the configured
  timeout but each attempt is an instantaneous snapshot. As such, extremely fast
  changes like toasts that appear and vanish in 1 second can be missed. Also,
  there is no ability to evaluate changes over time ("the screen is darker than
  before"). Prefer crafting assertions for stable final state; if necessary
  client-side JavaScript observers can be used.

## Format and data

- v2 load/run failure: run `npx momentic lint <path>`; broken relative file
  references are common after moves.
- Module failure: re-check required params, defaults, enums, and JS-fragment
  input syntax with `momentic_module_get`.
- Env value missing: confirm the producing step used `saveAs` / `--env-key` or
  `setVariable`, and that the consuming syntax is `env.X` vs `{{ env.X }}`.
- JavaScript failure: confirm the environment. Browser JS cannot use Node
  helpers; Node JS cannot read live DOM globals.

After about three attempts on the same problem, stop and ask the user for a
direction.

# Decision cheat sheet

- Known v2 sequence: direct YAML edit.
- v1 or unknown UI state: MCP preview, splice, validate.
- Need the right step index: use Test Content, splice refs, or `returnTest`;
  never raw YAML step IDs.
- Need the browser at step N: run once from start/setup to N-1, then keep using
  the same session.
- Single new step idea: `momentic_preview_step`.
- Persist validated MCP steps: `momentic_test_splice_steps`.
- Clean restart: `momentic_run_step` with `resetSession: true`.
- Validate direct v2 edit: `momentic_test_reload` if active, else fresh session.

Source

Creator's repository · momentic-ai/skills

View on GitHub

Security

Verified — safe to install
Passed all 3 independent security checks
What this skill can do
Reads your filesConnects to the internetRuns code on your machine
Checked by 3 independent security firms
Does it try to trick the AI?NoSAFE · Gen Agent Trust Hub
Does it sneak in hidden code?NoNo alerts · Socket
Does it have known bugs?NoMed risk · Snyk