Always cache the client rather than creating one per request; reuse preserves rate limit state. Respect the provided attribution guidelines when rendering results.