If you only fix one thing about your Playwright suite before letting an agent touch it, fix how you locate elements.

I say this because the locator is the contact surface between your test and your UI—and everything downstream of that (flakiness, waiting, maintenance cost, the agent’s ability to write new tests without breaking them) depends on that surface being solid. A brittle locator poisons the whole loop. An accessible locator makes the rest of Playwright feel almost easy.

The hierarchy

Playwright ships with a bunch of locator APIs. They are not equivalent, and the order you reach for them matters. Here’s the order I want in your head and in your agent rules by the end of this lesson:

page.getByRole('button', { name: 'Add book' })—by semantic role and accessible name
page.getByLabel('Book title')—by form label
page.getByPlaceholder('Search books...')—by placeholder text
page.getByText('Added to your shelf')—by visible text
page.getByTestId('add-book-button')—by data-testid attribute
page.locator('.btn-primary')—raw CSS

Rules one through four are what I want the agent reaching for first. Rule five is the escape hatch when the UI genuinely doesn’t have an accessible name you can match on. Rule six is an anti-pattern—it’s in the list so I can tell you not to use it.

Why this ordering, and not some other ordering

Because the ordering is aligned with what a user sees.

A screen reader user navigating Shelf doesn’t see .btn-primary or .css-3f7g8h. They see “Add book, button.” If your test targets the same thing the screen reader targets, two things happen automatically. One, your test keeps working across refactors because roles and accessible names are stable in a way that CSS classes are not. Two, you get a rough accessibility audit for free—if you can’t find the element by its role, it doesn’t have a role, and that’s a real bug, not a Playwright problem.

This is the single best argument for locator discipline: the refactor-proof test and the accessible component are the same component. You cannot write a getByRole test against an inaccessible button. The locator forces you to fix the component, and the fixed component helps real users. Free wins don’t get much freer.

But, do not overread that claim. A good getByRole suite gives you upstream pressure toward accessible markup. It does not give you a dedicated accessibility gate. We make that distinction explicit in Accessibility as a Quality Gate, because “probably accessible” is not a quality bar.

If you want reps instead of theory, Shelf ships a dedicated playground at src/routes/playground/+page.svelte, and the companion lab has you create tests/playground.spec.ts. Run the app locally while you work, and keep npm run typecheck and npm run build nearby too. The playground intentionally includes a few accessibility warnings so you can see where the locator hierarchy stops helping and why the fallback section exists.

What the agent does by default, and why it’s wrong

Left to its own devices, an agent writing a Playwright test does this:

await page.locator('.book-card button.primary').click();

I get why. The agent looked at the rendered DOM, saw a button inside a book card, and pulled the CSS selector that was there. It’s technically correct—the test passes on the day it was written. Then tomorrow someone renames .book-card to .shelf-entry because that’s what it’s actually called in the data model now, and the test explodes, and the agent blames the “flaky test suite.”

The test wasn’t flaky. The locator was coupled to an implementation detail that had no business being part of a test.

The version the agent should write is closer to this:

await page
  .getByRole('article', { name: /Station Eleven/ })
  .getByRole('button', { name: 'Rate this book' })
  .click();

Longer. Less clever-looking. Immune to the class rename, the stylesheet rewrite, the migration from CSS-in-JS to Tailwind. It targets what the user sees: a book with a specific name, and a button inside it with a specific action.

Scoping with locators, not selectors

Notice the chained getByRole calls in the good version. That’s the other habit I want to burn in: use locator chaining to scope your search, not string concatenation or complex CSS.

// Don't do this
await page.locator('.shelf-entry[data-title="Station Eleven"] button.rate').click();

// Do this
const book = page.getByRole('article', { name: /Station Eleven/ });
await book.getByRole('button', { name: 'Rate this book' }).click();

The chained version reads in the same order a person thinks: find the book, then find the rate button inside it. And book is a reusable Locator—you can assert on it, hover it, re-scope off it, pass it to another helper. The CSS version is a single string that does one thing and then you throw it away.

Composition beats `nth()`

The newer Locator API gives you better composition tools than “well, I guess I’ll use .nth(2) and hope nobody reorders the DOM.”

Three worth memorizing:

locator.and(...) intersects two locators. Use it when the target has two meaningful identities and you want both of them in the selector.
locator.filter({ has, hasText, hasNot }) narrows a collection by child content or visible text. This is the workhorse for repeated cards, rows, and list items.
locator.or(...) expresses “either this UI or that UI.” In practice, add .first() when both branches might exist at once, because strict locators are not in the business of guessing which one you meant.

const composeButton = page.getByRole('button').and(page.getByTitle('Compose'));

const securityDialog = page.getByText('Confirm security settings');

await expect(composeButton.or(securityDialog).first()).toBeVisible();

if (await securityDialog.isVisible()) {
  await page.getByRole('button', { name: 'Dismiss' }).click();
}

await composeButton.click();

The same pattern is excellent for repeated content:

const stationEleven = page
  .getByRole('article')
  .filter({ has: page.getByRole('heading', { name: 'Station Eleven' }) })
  .describe('Station Eleven shelf card');

await stationEleven.getByRole('button', { name: 'Rate this book' }).click();

This is what Playwright’s best-practice locator guidance means by chaining and filtering. Use the meaning already in the UI. Stop indexing into DOM order unless you truly have nothing else.

Name the locator when it matters

locator.describe() is one of those tiny APIs that pays rent every time the suite fails in CI. It gives the locator a human-readable label in traces and reports.

That is not decorative. If your trace says “waiting for Station Eleven shelf card” instead of “waiting for getByRole('article'),” the agent spends its time diagnosing the failure instead of reverse-engineering your selector chain.

Use it on locators that:

show up in multiple assertions
represent an important UI object
would otherwise read like anonymous plumbing in a trace

`all()` is not a wait

locator.all() looks friendly and behaves like a trap on dynamic UIs. It returns whatever matches right now. No waiting, no retry, no mercy.

That makes it fine for already-stable static content and a bad default for rendering lists, search results, and delayed UI. If what you really want is “collect the text from the matching elements once the list is ready,” expect(locator).toHaveCount(...) plus locator.evaluateAll(...) is usually sharper:

const results = page.getByRole('listitem');

await expect(results).toHaveCount(3);

const titles = await results.evaluateAll((nodes) =>
  nodes.map((node) => node.textContent?.trim() ?? ''),
);

One hop into page context beats bouncing across the protocol for every row.

Iframes are strict too

Frames are where teams suddenly forget every locator rule they had five minutes ago. The modern pattern is still “start with a normal locator, narrow it until it is unique, then step into the frame.”

const analyticsFrame = page.getByTitle('Quarterly dashboard').contentFrame();

await analyticsFrame.getByRole('button', { name: 'Refresh' }).click();

The FrameLocator docs call out the strictness here for a reason: if multiple frames match, Playwright throws. Good. That is better than clicking inside whichever dashboard happened to render first.

When `data-testid` is fine

I don’t want to tell you data-testid is always wrong. Sometimes it’s the right answer. Specifically:

You have three buttons with the exact same accessible name on the page for real product reasons.
You’re targeting a wrapper element with no semantic role (a layout div that the test needs to check visibility on).
You’re working around a component library that doesn’t expose an accessible name and the fix is not in your repo.

In those cases, add a data-testid and move on. But the rule in your agent instructions should be that data-testid is the third-choice answer, not the first, and the agent should have to write a sentence in the commit message explaining why role and label didn’t work. (That sentence doesn’t need to be enforced mechanically—it needs to exist as a speed bump so the agent doesn’t reach for data-testid by default.)

The agent rules

Drop this into the instructions file, or something like it:

## Playwright locators

Order of preference when locating elements:

1. `page.getByRole(role, { name })`—try this first. Always.
2. `page.getByLabel(labelText)`—for form inputs with visible labels.
3. `page.getByPlaceholder(text)`—for inputs without labels (and fix the missing label if you can).
4. `page.getByText(text)`—for static visible text and confirmation messages.
5. `page.getByTestId(id)`—only when 1–4 genuinely do not work. If you use this, add a line to the commit message explaining why.
6. `page.locator(cssSelector)`—never. If you find yourself here, the component needs an accessible name.

For nested elements, scope with chained locators, `filter({ has, hasText })`,
`and()`, and `or().first()` before reaching for `nth()`.
Use `locator.describe()` on important reusable locators so traces read like
English instead of plumbing.

That’s nine lines. It’s the most valuable nine lines in your instructions file for the next month. It will prevent more flaky tests than any retry configuration you could possibly write.

Wiring it into the loop

Two pieces of feedback hook into locator discipline directly, and both show up again later today.

ESLint rule for page.locator. Later, in Lint and Types as Guardrails, we’re going to set up an ESLint rule that warns (or errors) whenever page.locator appears in a file under tests/. The agent gets a red squiggle the moment it reaches for the escape hatch, which is the fastest possible feedback.

Playwright’s built-in accessibility debugging. When a getByRole query fails, Playwright’s error message prints the accessibility tree of the page at the point of failure. That tree is gold for the agent—it shows exactly what roles and names are available, so the agent can correct its query without guessing. We’ll lean on this when we talk about failure dossiers.

One corollary: dynamic content is still not a license to reach for page.waitForTimeout. If a panel expands, a list loads, or a dialog opens, wait on the user-visible signal you expect to change and keep the locator scoped to that element.

The one thing to remember

Locate by role and accessible name first. Everything else is an escape hatch, and escape hatches should feel slightly uncomfortable to use. If your agent is reaching for CSS selectors, your instructions file isn’t doing its job yet.

Steve Kinney

Locators and the Accessibility Hierarchy

The hierarchy

Why this ordering, and not some other ordering

What the agent does by default, and why it’s wrong

Scoping with locators, not selectors

Composition beats `nth()`

Name the locator when it matters

`all()` is not a wait

Iframes are strict too

When `data-testid` is fine

The agent rules

Wiring it into the loop

The one thing to remember

Additional Reading

Locators and the Accessibility Hierarchy

The hierarchy

Why this ordering, and not some other ordering

What the agent does by default, and why it’s wrong

Scoping with locators, not selectors

Composition beats nth()

Name the locator when it matters

all() is not a wait

Iframes are strict too

When data-testid is fine

The agent rules

Wiring it into the loop

The one thing to remember

Additional Reading

Composition beats `nth()`

`all()` is not a wait

When `data-testid` is fine