pure.md changelog
2025-04-18
- You can now set the
x-puremd-link-mode
request header toretain
to keep anchor tag hrefs inline. By default, hrefs are removed from output. - Better support for reddit comment threads.
2025-04-17
- Requests now follow redirects declared in
http-equiv="refresh"
meta tags. - Added better resilience for sites that omit
<html>
and/or<body>
tags. - Fixed an issue where some responses from the Internet Archive were returning a null content-type.
2025-04-16
- User dashboard now displays an ASCII usage chart of requests made per day.
- Fixed issue where YouTube transcripts failed to load in some cases.
2025-04-14
- Dashboard now shows aggregate request count of the past 30 days.
2025-04-11
- Preferred fetch mechanism is now cached per hostname.
- Better formatting of Hacker News URLs (pages, stories, and comments).
- Better formatting of HTML tables into markdown.
- Fixed a bug where accessing a resource that was already in markdown would fail.
2025-04-04
- Logs of recent requests are now displayed in a table on the user dashboard.
- Fixed a bug where requesting a password reset would fail.
2025-04-03
- Public Loom videos with transcripts are now supported.
- Decreased latency of API token CRUD operations in the dashboard.
2025-04-02
- YouTube URLs are now supported. The markdown output contains video metadata (title, description, number of views/comments, etc), plus the transcript of the entire video. Prefix
youtube.com
oryoutu.be
URLs withpure.md/
to try it out. - Fixed a bug where elements that had a
role="region"
attribute were inadvertently removed.
2025-04-01
- Released pure.md MCP server, which teaches MCP clients like Cursor, Windsurf, and Claude Desktop how to reliably fetch web content when they otherwise might be blocked.
- Upgraded virtual capacity of JavaScript-rendering headless Chrome instances.
- Removal of duplicate lines in output is now O(1).
2025-03-26
- Made improvements to search endpoint concurrency.
- Added AbortController to add a timeout to hanging outbound requests.
2025-03-24
- Fixed a UTF-8 encoding issue that appeared on iOS.
2025-03-21
- The following image content types are now supported:
.jpg
,.jpeg
,.png
,.webp
,.svg
. Object detection and summarization models are used to interpret images into markdown. - PDF-to-markdown conversion no longer runs through inference, and is now significantly faster.
2025-03-19
- Fixed a bug on websites that use Tailwind where elements with the
overflow-hidden
CSS property would inadvertently get removed from the output.
2025-03-09
- PDF file types are now supported.
2025-03-04
- Made significant speedups to the HTML-to-markdown conversion using HTMLRewriter.
2025-03-03
- Rate-limiting is now implemented and enforced based on subscription plan.
2025-02-27
- Users can now self-serve sign up with username and password.
- Added the ability to create and delete API tokens in the dashboard.
2025-02-25
- Added a POST endpoint that extracts data using generative AI. A handful of open source inference models are supported, including Meta's Llama and DeepSeek's R1. Data can be summarized as plain text, or extracted as JSON that conforms to a given schema.
2025-02-21
- Added a status page that pings core services at consistent intervals.
2025-02-20
- Deployed proxy network utilizing custom TLS client.
- Single page applications (SPAs) are now rendered with headless Chrome. This enables DOM hydration of JavaScript-heavy websites.
2025-02-12
- Hello world! pure.md is live in production.