Show HN: Firecrawl-Simple – Stable fork of Firecrawl optimized for self-hosting
github.comFirecrawl Simple is a stripped down and stable version of firecrawl optimized for self-hosting and ease of contribution.
The upstream firecrawl repo contains the following blurb:
>This repository is in development, and we're still integrating custom modules into the mono repo. It's not fully ready for self-hosted deployment yet, but you can run it locally.
Firecrawl's API surface and general functionality were ideal for our Trieve sitesearch product, but we needed a version ready for self-hosting that was easy to contribute to and scale on Kubernetes. Therefore, we decided to fork and begin maintaining a stripped down, stable version.
Fire-engine, Firecrawl's solution for anti-bot pages, being closed source is the biggest deal breaker requiring us to maintain this fork. Further, our purposes not requiring the SaaS and AI dependencies also pushes our use-case far enough away from Firecrawl's current mission that it doesn't seem like merging into the upstream is viable at this time.
Cool project, a mild pet peeve with this type of thing - I have to read 75% of the README before I find out what it even does. The first bits make a huge assumption about what the reader knows.
It's probably safe to assume that if you're looking for a fork of Firecrawl, you already know what Firecrawl does.
until the fork becomes more or at least similarly popular.
Good point, agreed. I assumed that most people looking at the repo would already be familiar with Firecrawl, but there should be at least a sentence or two explaining what it does regardless.
Finally, people starting to realize that AGPL means you can just fork and remove everything you don’t like (including branding)
The future[0] continues to arrive.
[0]: https://vadosware.io/post/the-future-of-free-and-open-source...