ee78e7613f
cURL is a powerful library specifically designed to connect to many different types of servers with different types of protocols. For more detailed information refer to the PHP cURL manual: - http://php.net/manual/en/book.curl.php Due to this change some parameters for the getContents function were necessary (also applies to getSimpleHTMLDOM and getSimpleHTMLDOMCached): > $use_include_path removed This parameter has never been used and doesn't even make sense in this context; If set to true file_get_contents would also search for files in the include_path (specified in php.ini). > $context replaced by $header and $opts The $context parameter allowed for customization of the request in order to change how file_get_contents would acquire the data (i.e. using POST instead of GET, sending custom header, etc...) cURL also provides facilities to specify custom headers and change how it communicates to severs. cURL, however, is much more advanced. - $header is an optional parameter (empty by default). It receives an array of strings to send in the HTTP request header. See 'CURLOPT_HTTPHEADER': "An array of HTTP header fields to set, in the format array('Content-type: text/plain', 'Content-length: 100')" - php.net/manual/en/function.curl-setopt.php - $opts is an optional parameter (empty by default). It receives an array of options, where each option is a key-value-pair of a cURL option (CURLOPT_*) and it's associated parameter. This parameter accepts any of the CURLOPT_* settings. Example (sending POST instead of GET): $opts = array( CURLOPT_POST => 1, CURLOPT_POSTFIELDS => '&action=none' ); $html = getContents($url, array(), $opts); Refer to the cURL setopt manual for more information: - php.net/manual/en/function.curl-setopt.php > $offset and $maxlen removed These options were supported by file_get_contents, but there doesn't seem to be an equivalent in cURL. Since no caller uses them they are safe to remove. Compressed data / Encoding By using cURL instead of file_get_contents RSS-Bridge no longer has to handle compressed data manually. See 'CURLOPT_ENCODING': "[...] Supported encodings are "identity", "deflate", and "gzip". If an empty string, "", is set, a header containing all supported encoding types is sent." - http://php.net/manual/en/function.curl-setopt.php Notice: By default all encoding types are accepted (""). This can be changed by setting a custom option via $opts. Example: $opts = array(CURLOPT_ENCODING => 'gzip'); $html = getContents($url, array(), $opts); Proxy The proxy implementation should still work, but there doesn't seem to be an equivalent for 'request_fulluri = true'. To my understanding this isn't an issue because cURL knows how to handle proxy communication. |
||
---|---|---|
bridges | ||
cache | ||
caches | ||
formats | ||
lib | ||
static | ||
vendor/simplehtmldom | ||
.gitattributes | ||
.gitignore | ||
.travis.yml | ||
CHANGELOG.md | ||
CONTRIBUTING.md | ||
index.php | ||
phpcs.xml | ||
README.md | ||
scalingo.json | ||
UNLICENSE |
rss-bridge
rss-bridge is a PHP project capable of generating ATOM feeds for websites which don't have one.
Supported sites/pages (main)
Bandcamp
: Returns last release from bandcamp for a tagCryptome
: Returns the most recent documents from Cryptome.orgDansTonChat
: Most recent quotes from danstonchat.comDuckDuckGo
: Most recent results from DuckDuckGo.comFacebook
: Returns the latest posts on a page or profile on FacebookFlickrExplore
: Latest interesting images from FlickrGooglePlus
: Most recent posts of user timelineGoogleSearch
: Most recent results from Google SearchIdenti.ca
: Identica user timeline (Should be compatible with other Pump.io instances)Instagram
: Most recent photos from an Instagram userOpenClassrooms
: Lastest tutorials from fr.openclassrooms.comPinterest
: Most recent photos from user or searchScmbBridge
: Newest stories from secouchermoinsbete.frThePirateBay
: Returns the newest indexed torrents from The Pirate Bay with keywordsTwitter
: Return keyword/hashtag search or user timelineWikipedia
: highlighted articles from Wikipedia in English, German, French or EsperantoYouTube
: YouTube user channel, playlist or search
Plus many other bridges to enable, thanks to the community
Output format
Output format can take several forms:
Atom
: ATOM Feed, for use in RSS/Feed readersHtml
: Simple html page.Json
: Json, for consumption by other applications.Mrss
: MRSS Feed, for use in RSS/Feed readersPlaintext
: raw text (php object, as returned by print_r)
Screenshot
Welcome screen:
RSS-Bridge hashtag (#rss-bridge) search on Twitter, in ATOM format (as displayed by Firefox):
Requirements
- PHP 5.6, e.g.
AddHandler application/x-httpd-php56 .php
in.htaccess
openssl
extension enabled in PHP config (php.ini
)allow_url_fopen=1
inphp.ini
Enabling/Disabling bridges
By default, the script creates whitelist.txt
and adds the main bridges (see above). whitelist.txt
is ignored by git, you can edit it:
- to enable extra bridges (one bridge per line)
- to disable main bridges (remove the line)
- to enable all bridges (just one wildcard
*
as file content)
New bridges are disabled by default, so make sure to check regularly what's new and whitelist what you want!
Deploy
Authors
We are RSS Bridge Community, a group of developers continuing the project initiated by sebsauvage, webmaster of sebsauvage.net, author of Shaarli and ZeroBin.
Patch/contributors :
- Yves ASTIER (Draeli) : PHP optimizations, fixes, dynamic brigde/format list with all stuff behind and extend cache system. Mail : contact /at\ yves-astier.com
- Mitsukarenai : Initial inspiration, collaborator
- ArthurHoaro
- BoboTiG
- Astalaseven
- qwertygc
- Djuuu
- Anadrark
- Grummfy
- Polopollo
- 16mhz
- kranack
- logmanoriginal
- polo2ro
- Riduidel
- superbaillot.net
- vinzv
- teromene
- nel50n
- nyutag
- ORelio
- Pitchoule
- pit-fgfjiudghdf
- aledeg
- alexAubin
- cnlpete
- corenting
- Daiyousei
- erwang
- gsurrel
- kraoc
- lagaisse
- az5he6ch
- niawag
- JeremyRand
- mro
Licenses
Code is Public Domain.
Including PHP Simple HTML DOM Parser
under the MIT License
Technical notes
- There is a cache so that source services won't ban you even if you hammer the rss-bridge with requests. Each bridge can have a different duration for the cache. The
cache
subdirectory will be automatically created and cached objects older than 24 hours get purged. - To implement a new Bridge, follow the specifications and take a look at existing Bridges for examples.
- To enable debug mode (disabling cache and enabling error reporting), create an empty file named
DEBUG
in the root directory (next toindex.php
). - For more information refer to the Wiki
Rant
Dear so-called "social" websites.
Your catchword is "share", but you don't want us to share. You want to keep us within your walled gardens. That's why you've been removing RSS links from webpages, hiding them deep on your website, or removed feeds entirely, replacing it with crippled or demented proprietary API. FUCK YOU.
You're not social when you hamper sharing by removing feeds. You're happy to have customers creating content for your ecosystem, but you don't want this content out - a content you do not even own. Google Takeout is just a gimmick. We want our data to flow, we want RSS or ATOM feeds.
We want to share with friends, using open protocols: RSS, ATOM, XMPP, whatever. Because no one wants to have your service with your applications using your API force-feeding them. Friends must be free to choose whatever software and service they want.
We are rebuilding bridges you have wilfully destroyed.
Get your shit together: Put RSS/ATOM back in.