Commit graph

295 commits

Author SHA1 Message Date
Andrew Dolgov
59e83455f1 add some more bayes stuff 2015-06-17 15:15:04 +03:00
Andrew Dolgov
35c12dc40a add workaround for mysql only accepting subset of unicode characters 2015-06-16 08:17:49 +03:00
Andrew Dolgov
6de3a1be7b allow plugins modify article score 2015-06-15 20:01:07 +03:00
Andrew Dolgov
168cf351e2 add housekeeping task for orphaned counter cache entries 2015-03-06 14:14:51 +03:00
Andrew Dolgov
a29fe12195 add auto_assign_labels plugin; allow article filter plugins to add labels to articles 2015-03-04 00:26:52 +03:00
Andrew Dolgov
0c6f7b314a remove image.php; put cached image endpoint to public.php 2015-02-27 14:51:26 +03:00
Andrew Dolgov
e6c826fa2f remove AUTO_ASSIGN_LABELS (confusing, idiots keep enable it and complain of the results) 2015-02-21 13:08:50 +03:00
Andrew Dolgov
c9299c28c6 comment out old-school test trgm code 2015-01-19 12:51:59 +03:00
Andrew Dolgov
e44280a0bd fix typo (refs #818) 2014-11-02 20:08:58 +03:00
Andrew Dolgov
3cc02a5b88 Merge branch 'master' of github.com:gothfox/Tiny-Tiny-RSS 2014-11-02 19:37:27 +03:00
Andrew Dolgov
6383c9c53a when registering feed title, consider its maximum length (closes #818) 2014-11-02 19:36:58 +03:00
Grant Pannell
da7e3e91bf Fix allow duplicate posts feature with optimizations from 1.14 2014-11-02 16:40:10 +10:30
Andrew Dolgov
522e8b3500 update debugging: add force_rehash 2014-08-21 13:20:41 +04:00
Jonathon Padfield
5d2e74a95e Minor refactor for php v5.3, which doens't support immediate dereferencing of array entries 2014-08-20 01:16:52 +00:00
Andrew Dolgov
02b0348a8a mention that language passed is read only 2014-08-19 14:05:06 +04:00
Andrew Dolgov
73f9378d26 update: include detected language to article structure passed to filters 2014-08-19 14:02:28 +04:00
Andrew Dolgov
2d9c568439 add some timing information to update_daemon_common 2014-08-19 10:32:01 +04:00
Andrew Dolgov
d5e490952a fix language detection; remove f_guid debugging 2014-08-19 10:18:32 +04:00
Andrew Dolgov
eb16bd9f01 pass correct instance of pluginhost to calculate_article_hash 2014-08-18 23:11:29 +04:00
Andrew Dolgov
af244f927e tweak article hash function 2014-08-18 21:55:48 +04:00
Andrew Dolgov
b1840673cd simplify article filter chaining mechanism, do not try to process already stored articles 2014-08-18 18:32:16 +04:00
Andrew Dolgov
0084f0d1ba update stored article after filter has finished processing to prevent plugins from reverting changes of earlier invoked plugins (experimental) 2014-08-12 19:36:29 +04:00
Felix Eckhofer
523bd90baf Store size of enclosure to database 2014-07-15 16:23:46 +02:00
Dave Zaikos
a3d5018490 Updated cache_images() to use _MIN_CACHE_IMAGE_SIZE constant when checking file size. New constant is defined upon including rssfuncs.php if not already defined. 2014-06-26 17:22:59 -04:00
wltb
01465325b4 Add subscribe hook, give more information to fetch_feed hook 2014-05-19 14:06:52 +02:00
Andrew Dolgov
15d0528a9f rssfuncs: mention local cache filename for easier debugging 2014-04-30 08:49:46 +00:00
Roland Angerer
f77c72038f deactivated FeedParser re-use as it is not multiuser safe
different users may have different fetch feed and feed fetched hooks
2014-03-14 08:36:44 +01:00
Roland Angerer
95beaa1471 fixed feed caching logic for multiuser scenario
only vanilla feed data may be cached as every user may have different
fetched feed hooks to apply
2014-03-14 08:35:39 +01:00
Andrew Dolgov
e95bb179a8 fix for article language breaking sql insert 2014-03-06 09:02:14 +04:00
Andrew Dolgov
5ba1ddd409 rssfuncs: remove unused variables 2014-02-19 15:17:29 +04:00
Andrew Dolgov
537eca4000 rssfuncs: fix typo 2013-12-18 14:22:24 +04:00
Andrew Dolgov
f08426e3d3 when checking feeds to update, respect only default update setting stored in default profile 2013-12-18 13:55:50 +04:00
Andrew Dolgov
5a450b8760 add workaround for languagedetect idiotic shit of some kind 2013-11-13 20:36:15 +04:00
Andrew Dolgov
d8179cb9d9 pubsubhubbub: use atom rel=self link (when available) when subscribing
to push-enabled feeds
2013-11-11 22:52:15 +04:00
Andrew Dolgov
4ad04ee227 report all libxml errors in updater debug output
force utf8 encoding if devforceupdate is on
parser: try to convert non-unicode feeds with specified encoding to utf8
before trying to remove dangling utf8 characters in case of utf8-related
libxml errors because doing so produces garbage content
2013-10-29 12:15:26 +04:00
Andrew Dolgov
88edaa9344 daemon: cache parser object while processing a batch of feeds withsame url on first success 2013-10-25 14:42:43 +04:00
Andrew Dolgov
5ddd2705ca make language detection optional (closes #779) 2013-09-27 13:45:21 +04:00
Andrew Dolgov
1357a263be include title when detecting article language 2013-09-17 12:28:31 +04:00
Andrew Dolgov
4f71d7431c replace suppress debugging kludge with a more flexible function (fixes
logging with update.php --feeds being stopped after first feed)
2013-09-15 23:02:21 +04:00
Andrew Dolgov
a33558a61e pass logfile to child tasks if locking is possible, lock logfile before writing, add kludge to prevent update_rss_feed unneeded debugging go into master logfile 2013-09-02 12:33:59 +04:00
Andrew Dolgov
f73e03e000 pass feed information to article filters 2013-09-02 10:03:04 +04:00
Andrew Dolgov
5c54e68388 support media:description for media: enclosures 2013-08-05 12:26:09 +04:00
Andrew Dolgov
edba269b6f fix entries not inserted properly when no languages are detected 2013-08-02 16:03:13 +04:00
Andrew Dolgov
00f22824d7 rss: force language to 2 characters; run house keeping hooks properly 2013-08-02 14:47:34 +04:00
Andrew Dolgov
8e47022036 add hook_house_keeping 2013-08-02 14:06:18 +04:00
Andrew Dolgov
2fc4d981d1 remove unused old-style image rewritign 2013-08-02 14:04:00 +04:00
Andrew Dolgov
6b4617970f add text_languagedetect to guess article language for better hyphenation
(bump schema)
2013-07-31 10:30:17 +04:00
Andrew Dolgov
0997c2bd62 Revert "add temporary hack to store original unhashed guid into cached_content for debugging"
This reverts commit 8096e309a5.
2013-07-14 21:48:14 +04:00
Andrew Dolgov
8096e309a5 add temporary hack to store original unhashed guid into cached_content for debugging 2013-07-11 21:40:26 +04:00
Andrew Dolgov
c052e25a8b remove unused cached_content 2013-07-10 16:55:55 +04:00
Andrew Dolgov
420940fa90 do not catchup newly subscribed feeds 2013-06-25 10:01:41 +04:00
Andrew Dolgov
6791af0cfd pass feed it to feed_fetched and fetch_feed hooks 2013-05-20 15:28:56 +04:00
Andrew Dolgov
ee65bef405 add HOOK_FETCH_FEED 2013-05-20 15:26:53 +04:00
Andrew Dolgov
0ad2013bd2 update_rss_feed: remove unused override_url parameter 2013-05-20 15:20:14 +04:00
Andrew Dolgov
47673e6611 add fetch_url and owner_uid to HOOK_FEED_FETCHED 2013-05-18 09:22:06 +04:00
Andrew Dolgov
d1f3fa9791 try to force-convert feed data to utf8 2013-05-17 20:09:43 +04:00
Andrew Dolgov
fd687300bf Revert "subscribe_to_feed: use already fetched data when updating initially"
This reverts commit 23923fb29b.
2013-05-08 19:22:33 +04:00
Andrew Dolgov
23923fb29b subscribe_to_feed: use already fetched data when updating initially 2013-05-07 15:34:20 +04:00
Rasmus Lerdorf
6f7798b643 Fixing bugs found by static analysis 2013-05-07 00:35:10 -07:00
Andrew Dolgov
f4ae0f053b update: remove debugging block 2013-05-02 10:26:32 +04:00
Andrew Dolgov
566417c4e7 restore updstart threshold 2013-05-02 02:31:32 +04:00
Andrew Dolgov
5d3e5a1bb2 simplify feed cache age handling (reduce caching to sequential updates) 2013-05-02 02:30:53 +04:00
Andrew Dolgov
5de4010487 disable marking for the time being 2013-05-02 02:21:11 +04:00
Andrew Dolgov
5ef8409700 move the precautionary timestamp bumping 2013-05-02 02:20:34 +04:00
Andrew Dolgov
5d0d3887af add _DISABLE_HTTP_304 2013-05-02 02:11:11 +04:00
Andrew Dolgov
15c762beda updater: show owner_uid for checked feeds 2013-05-02 02:02:49 +04:00
Andrew Dolgov
52637d3b30 remove cache valid bailout clause 2013-05-02 01:36:17 +04:00
Andrew Dolgov
865a3ed6a0 change feed cache file extension 2013-05-02 01:33:02 +04:00
Andrew Dolgov
d4992d6b48 add support for dc:subject and slash:comments 2013-05-01 20:55:08 +04:00
Andrew Dolgov
ee78f81ccd update: better tag-related debugging info 2013-05-01 20:33:59 +04:00
Andrew Dolgov
852d4ac890 support RDF-XML feeds 2013-05-01 20:30:52 +04:00
Andrew Dolgov
fd0daa9b55 remove simplepie 2013-05-01 19:14:48 +04:00
Andrew Dolgov
431e27851b actually save feed xml in the cache 2013-05-01 18:10:27 +04:00
Andrew Dolgov
99429e57e4 remove simplepie entity decode hacks 2013-05-01 18:07:05 +04:00
Andrew Dolgov
b8f316dc28 change caching to save xml data, remove RDF init section 2013-05-01 17:56:21 +04:00
Andrew Dolgov
04d2f9c831 add basic rss support 2013-05-01 17:38:16 +04:00
Andrew Dolgov
cd07592c29 add basic tinyparser/atom 2013-05-01 17:04:57 +04:00
Andrew Dolgov
65c8d5e76d update: set last_updated to now() when process starts 2013-05-01 15:43:56 +04:00
Andrew Dolgov
39ede9862f experimental: decode numerical utf entities on import in entry title 2013-04-29 16:59:36 +04:00
Andrew Dolgov
efe46a3b53 Merge pull request #167 from KonishchevDmitry/pr-allow-slash-in-filter-regex
Allow slashes in filter regular expressions
2013-04-27 02:10:27 -07:00
Dmitry Konishchev
7b80b5e160 Match each tag separately against user filter regular expression
Each article's tag should be matched against user filter regular
expression separately. Current matching confuses when you want to match
an exact tag. You suppose to write "^tag$", bug now have to write
"(^|,)tag(,|$)" which is very inconvenient and requires knowledge about
how do you process this matching.
2013-04-26 16:30:25 +04:00
Dmitry Konishchev
ffa1bd7b19 Allow slashes in filter regular expressions
User's regular expressions need escaping before passing them to
preg_match()
2013-04-26 15:46:48 +04:00
Andrew Dolgov
90e5f4f1de base if-modified-since on last received article, not feed update timestamp 2013-04-25 18:42:48 +04:00
Andrew Dolgov
23283f11a3 fetch: better checking for 1970- date 2013-04-25 16:12:49 +04:00
Andrew Dolgov
aafd55bab6 do not needlessly mark icons as failed 2013-04-24 19:01:51 +04:00
Andrew Dolgov
84ceb6bd2e fix floicon failed message not respecting debugging setting 2013-04-24 16:48:47 +04:00
Andrew Dolgov
e2cf81e214 unify houskeeping stuff, increase spawn interval 2013-04-24 14:54:59 +04:00
Andrew Dolgov
654650857f reduce daemon chattiness a bit 2013-04-24 14:26:05 +04:00
Andrew Dolgov
8cabc200d5 increase updstart locking interval to prevent concurrent updates; decrease batch size for the daemon 2013-04-24 14:23:38 +04:00
Andrew Dolgov
fce451a4f7 fix updating for pgsql
increase default batch size
remove random check between feedbrowser or update task executing
2013-04-24 13:43:40 +04:00
Andrew Dolgov
81c2066345 updating: do not use cache timestamp if it is unusable for current feed;
sort chronologically instead of random
2013-04-24 13:21:40 +04:00
Andrew Dolgov
6ee0d4b05c do not recalculate avg color repeatedly 2013-04-23 22:37:06 +04:00
Andrew Dolgov
560cbd8c0d attempt to recalculate favicon avg color if it changes 2013-04-23 22:33:23 +04:00
Andrew Dolgov
36490f11a3 add a hack so that if floicon fails on some icon, it won't be processed again 2013-04-23 22:30:06 +04:00
Daniel Andersson
1134497129 Fix doubly applied basepaths to lockfiles. 2013-04-19 21:25:26 +02:00
Andrew Dolgov
68cccafcad remove debug_enabled global 2013-04-19 13:42:40 +04:00
Andrew Dolgov
9ec103525d update_rss_feed: simplify debug reporting 2013-04-19 13:26:10 +04:00
Andrew Dolgov
ebec81a6fb subscribe: verify XML before adding to the database; fetch: try to work around entity problems if initial parsing fails 2013-04-19 13:17:28 +04:00
Andrew Dolgov
1367bc3f5e trim received feed data on fetch 2013-04-19 12:35:30 +04:00
Andrew Dolgov
96db9a14d9 do not try to calculate icon avg color if GD is not present 2013-04-19 08:42:35 +04:00