<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Agent on Taras&#39; Blog on AI, Perf, Hacks</title>
    <link>/tags/agent/</link>
    <description>Recent content in Agent on Taras&#39; Blog on AI, Perf, Hacks</description>
    <image>
      <title>Taras&#39; Blog on AI, Perf, Hacks</title>
      <url>/images/papermod-cover.png</url>
      <link>/images/papermod-cover.png</link>
    </image>
    <generator>Hugo -- 0.147.0</generator>
    <language>en</language>
    <copyright>Taras Glek</copyright>
    <lastBuildDate>Thu, 30 Mar 2023 15:09:04 +0200</lastBuildDate>
    <atom:link href="/tags/agent/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>GPT-ARIA: Integrating GPT-3 with Browser Using Accessibility API</title>
      <link>/posts/gpt-aria-experiment/</link>
      <pubDate>Thu, 30 Mar 2023 15:09:04 +0200</pubDate>
      <guid>/posts/gpt-aria-experiment/</guid>
      <description>&lt;p&gt;&lt;a href=&#34;https://github.com/thegpvc/gpt-aria&#34;&gt;https://github.com/thegpvc/gpt-aria&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;This was the most interesting prototype I’ve explored in my career so far, and I want to thank &lt;a href=&#34;https://www.thegp.com/&#34;&gt;TheGP&lt;/a&gt; for sponsoring this project. Also thanks to &lt;a href=&#34;https://github.com/chugai&#34;&gt;Oleksandr Chugai&lt;/a&gt; who I partnered with on implementation and &lt;a href=&#34;https://twitter.com/bencmejla&#34;&gt;Ben Cmejla&lt;/a&gt; who helped with prompting.&lt;/p&gt;
&lt;h2 id=&#34;inspiration&#34;&gt;Inspiration&lt;/h2&gt;
&lt;p&gt;I’ve been thinking about browser automation since the early days of my career when I worked at Firefox, where I got to work on performance problems and built things like Firefox Telemetry (Hi, HN haters!). With the rise of large language models (LLMs)—and OpenAI’s recent launch of ChatGPT plugins — things are getting exciting around user agents again. My favorite demo so far is &lt;a href=&#34;https://github.com/nat/natbot&#34;&gt;natbot&lt;/a&gt;, Nat Friedman’s GPT-3-powered browser agent that he launched last October.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
