feat(agent): add configurable retry_strategy for model calls #1424

zastrowm · 2026-01-05T22:03:47Z

Description

The current retry logic for handling ModelThrottledException is hardcoded in event_loop.py with fixed values (6 attempts, exponential backoff starting at 4s). This makes it impossible for users to customize retry behavior for their specific use cases, such as:

This PR refactors the hardcoded retry logic into a ModelRetryStrategy class so that folks can customize the parameters.

Not Included:

The PR does not introduce a RetryStrategy base class. I started to do so, but am deferring it because:

It requires some additional design work to accommodate the tool-retries, which I anticipate should be accounted for in the design
It simplifies this review which refactors how the default retries work internally
ModelRetryStrategy provides enough benefit to allow folks to customize the agent loop without blocking on a more extensible design

Public API Changes

Added a new retry_strategy parameter to Agent.__init__():

from strands import ModelRetryStrategy

agent = Agent(
    model="anthropic.claude-3-sonnet",
    retry_strategy=ModelRetryStrategy(
        max_attempts=3,
        initial_delay=2,
        max_delay=60
    )
)
# Retries up to 2 times with 2s-60s exponential backoff

The retry_strategy parameter only accepts ModelRetryStrategy, no derived classes. We've been discussing a RetryStrategy base class that is more abstract and supports additional exception types, but I'm punting on that as it requires additional design work whereas this provides immediate benefit to callers attempting to custimize the current agent-loop retry behavior.

For now, alternative retry strategies can be implemented by creating a hook provider that sets event.retry = True on the AfterModelCallEvent when a retry should occur.

Backwards Compatibility

Retry delay

The general default behavior is unchanged — agents still retry up to 5 times (6 attempts in total) with the same exponential backoff. The EventLoopThrottleEventand ForceStopEvent are still emitted during retries, maintaining backwards compatibility with existing hooks that listen for this event.

The exact delay times have changed!. Because of a bug in the original logic, the initial delay was actually doubled the first time it executed (see test_agent_events.py for the test changes to accomidate this). Previous to these changes, the delay(s) were:

8s, 16s, 32s, 64s, 128s

Afer these changes, the delays are:

4s, 8s, 16s, 32s, 64s

I think this are okay changes to make, however.

Default retry behavior

The default retry behavior also reads from event_loop.MAX_ATTEMPTS etc so that anyone who was previously modifying those constants will continue to do so

Implementation Decisions

We preserve backwards comptability to emit EventLoopThrottleEventand ForceStopEvent events as we used to.
- Because backwards compatability, we're forced to.
We do emit ForceStopEvent whenever an exception bubbles out of the model invocation
- This seems to be the convention
Naming
- We name it retry_strategy so that as hooks are expanded to allow retrying tools, we can also enable tool retry strategies
- We name it ModelRetryStrategy since it's only focused on model retries - in the future we might vend other strategies, but we can add a new strategy rather than attempting to fit it all into this one.

Related Issues

[FEATURE] make event loop settings configurable #283

Documentation PR

TODO once we align on this approach

Type of Change

New feature

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Refactored hardcoded retry logic in event_loop into a flexible, hook-based retry system that allows users to customize retry behavior.

codecov · 2026-01-05T22:05:54Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

…igh-Level constructs In doing api bar raising for strands-agents/sdk-python/pull/1424, we determined that HookProvider is a too-low-level interface for exposing directly to integrators. This captures that decision & reasoning in log format and sets us up to record future decisions in a similar way going forward. See DECISIONS.md on the decision & the format

Enforces that Agent only accepts ModelRetryStrategy instances (not subclasses) for the retry_strategy parameter to prevent API confusion before a base RetryStrategy class is introduced.

src/strands/agent/agent.py

tests/strands/event_loop/test_event_loop.py

# Conflicts: # src/strands/agent/agent.py

src/strands/event_loop/event_loop.py

…level

…igh-Level constructs (#420) In doing api bar raising for strands-agents/sdk-python/pull/1424, we determined that HookProvider is a too-low-level interface for exposing directly to integrators. This captures that decision & reasoning in log format and sets us up to record future decisions in a similar way going forward. See DECISIONS.md on the decision & the format Co-authored-by: Mackenzie Zastrow <[email protected]>

strands-agent

The type check for retry_strategy parameter is too restrictive and will break for subclasses:

if retry_strategy and type(retry_strategy) is not ModelRetryStrategy:
    raise ValueError("retry_strategy must be an instance of ModelRetryStrategy")

This uses type() with is not, which fails for subclasses. Consider:

class MyCustomRetry(ModelRetryStrategy):
    # Custom retry logic
    pass

agent = Agent(retry_strategy=MyCustomRetry())  # ❌ Raises ValueError!

Recommendation: Use isinstance() check instead:

if retry_strategy is not None and not isinstance(retry_strategy, HookProvider):
    raise TypeError(f"retry_strategy must implement HookProvider, got {type(retry_strategy).__name__}")

This allows:

Subclasses of ModelRetryStrategy ✅
Custom HookProvider implementations ✅
Better error message with actual type ✅
Type safety with proper inheritance check ✅

🤖 This is an experimental AI agent response from the Strands team, powered by Strands Agents. We're exploring how AI agents can help with community support and development. Your feedback helps us improve! If you'd prefer human assistance, please let us know.

strands-agent and others added 4 commits January 5, 2026 13:41

feat(agent): add configurable retry_strategy for model calls

fdac712

Refactored hardcoded retry logic in event_loop into a flexible, hook-based retry system that allows users to customize retry behavior.

Yield backwards compatible events + update initial delay to be correct

ef83070

Always emit a ForceStopEvent when an exception bubbles

9caab19

Condense the doc strings down

fb78e58

github-actions bot added the size/l label Jan 5, 2026

zastrowm had a problem deploying to auto-approve January 5, 2026 22:03 — with GitHub Actions Failure

Remove NoopRetryStrategy

bfc71ab

zastrowm mentioned this pull request Jan 6, 2026

feat: Add decision record for HookProviders being too low level for High-Level constructs #1427

Closed

zastrowm mentioned this pull request Jan 8, 2026

feat: Add decision record for HookProviders being too low level for High-Level constructs strands-agents/docs#420

Merged

4 tasks

zastrowm added 2 commits January 14, 2026 13:36

Merges upstream main into configure_agent_retry branch

4162947

Adds strict type validation for Agent retry strategy

d154c66

Enforces that Agent only accepts ModelRetryStrategy instances (not subclasses) for the retry_strategy parameter to prevent API confusion before a base RetryStrategy class is introduced.

github-actions bot added size/l and removed size/l labels Jan 14, 2026

zastrowm temporarily deployed to auto-approve January 14, 2026 19:11 — with GitHub Actions Inactive

zastrowm marked this pull request as ready for review January 14, 2026 19:17

pgrayy reviewed Jan 15, 2026

View reviewed changes

src/strands/agent/agent.py Outdated Show resolved Hide resolved

src/strands/agent/agent.py Show resolved Hide resolved

pgrayy reviewed Jan 15, 2026

View reviewed changes

tests/strands/event_loop/test_event_loop.py Outdated Show resolved Hide resolved

fix: move imports to top

6ea2fe2

zastrowm temporarily deployed to auto-approve January 15, 2026 17:44 — with GitHub Actions Inactive

github-actions bot added size/l and removed size/l labels Jan 15, 2026

zastrowm added 2 commits January 15, 2026 12:45

Merge remote-tracking branch 'upstream/main' into configure_agent_retry

1dc7cad

# Conflicts: # src/strands/agent/agent.py

Tweaks after merge

14dc7e8

zastrowm temporarily deployed to auto-approve January 15, 2026 18:32 — with GitHub Actions Inactive

github-actions bot added size/l and removed size/l labels Jan 15, 2026

Unshure reviewed Jan 15, 2026

View reviewed changes

src/strands/event_loop/event_loop.py Outdated Show resolved Hide resolved

Move retry functionality into event_loop though still exposed at top …

21d688f

…level

github-actions bot removed the size/l label Jan 15, 2026

zastrowm had a problem deploying to auto-approve January 15, 2026 20:58 — with GitHub Actions Failure

github-actions bot added the size/l label Jan 15, 2026

fix: linting error

b122ec2

github-actions bot added size/l and removed size/l labels Jan 15, 2026

zastrowm had a problem deploying to auto-approve January 15, 2026 21:14 — with GitHub Actions Failure

Unshure previously approved these changes Jan 15, 2026

View reviewed changes

Rename retry file to include underscore

c71078c

zastrowm dismissed Unshure’s stale review via c71078c January 15, 2026 22:36

github-actions bot added size/l and removed size/l labels Jan 15, 2026

zastrowm had a problem deploying to auto-approve January 15, 2026 22:36 — with GitHub Actions Failure

zastrowm mentioned this pull request Jan 16, 2026

feat(event_loop): make event loop settings configurable #288

Closed

7 tasks

zastrowm linked an issue Jan 16, 2026 that may be closed by this pull request

[FEATURE] make event loop settings configurable #283

Open

fix: local imports

083be7e

github-actions bot added size/l and removed size/l labels Jan 16, 2026

zastrowm temporarily deployed to auto-approve January 16, 2026 20:45 — with GitHub Actions Inactive

Remove one line

71c5109

zastrowm temporarily deployed to auto-approve January 16, 2026 21:28 — with GitHub Actions Inactive

github-actions bot added size/l and removed size/l labels Jan 16, 2026

strands-agent reviewed Jan 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(agent): add configurable retry_strategy for model calls #1424

feat(agent): add configurable retry_strategy for model calls #1424

Uh oh!

zastrowm commented Jan 5, 2026 •

edited

Loading

Uh oh!

codecov bot commented Jan 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

strands-agent left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat(agent): add configurable retry_strategy for model calls #1424

Are you sure you want to change the base?

feat(agent): add configurable retry_strategy for model calls #1424

Uh oh!

Conversation

zastrowm commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Public API Changes

Backwards Compatibility

Retry delay

Default retry behavior

Implementation Decisions

Related Issues

Documentation PR

Type of Change

Testing

Checklist

Uh oh!

codecov bot commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

strands-agent left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zastrowm commented Jan 5, 2026 •

edited

Loading

codecov bot commented Jan 5, 2026 •

edited

Loading