Commit graph

2612 commits

Author SHA1 Message Date
Andrew Cantino
085473263f Remove additional nitrous files (#1791) 2016-11-13 15:21:30 -05:00
Andrew Cantino
fca06d6ec2 Nitrous.io is shutting down (#1789) 2016-11-13 14:35:25 -05:00
Akinori MUSHA
b6c1e908c8 Update nokogiri to 1.6.8.1 2016-11-11 15:35:24 +09:00
Akinori MUSHA
74077b0ad4 Auto-focus on Agent Type when creating an agent 2016-11-09 23:25:55 +09:00
Akinori MUSHA
486246e63c Add "image" to Event Description 2016-11-07 12:47:13 +09:00
Akinori MUSHA
1e70b31e7f Merge pull request #1770 from cantino/revert-1071
Revert the special treatment for CDATA introduced in #1071
2016-11-03 12:14:42 +09:00
Akinori MUSHA
50123dca53 Fix event_description broken in full JSON mode or without a template 2016-11-02 21:47:01 +09:00
Akinori MUSHA
898e3d8edb Revert the special treatment for CDATA introduced in #1071 2016-11-02 19:35:16 +09:00
Akinori MUSHA
07effe5eb4 Merge pull request #1766 from cantino/fix_website_agent_url_handling
Fix `url` handling of WebsiteAgent
2016-11-02 13:36:38 +09:00
Akinori MUSHA
e50b8e0d5c Document that url in a created event is automatically resolved 2016-11-02 11:14:02 +09:00
Akinori MUSHA
cc16e854b3 Fix a problem in resolving the url key of a created event
The `url` parameter of handle_data() could hold a string or nil when
invoked from handle_event_data(), in which case resolving `url` in a
created event would fail with a type error.  Moreover, the logic did not
have any guard for URI errors.  This commit should fix them.

Fixes #1765.
2016-11-02 11:14:02 +09:00
Akinori MUSHA
4fe35b2a1f Reproduce #1765 2016-11-01 22:29:56 +09:00
Akinori MUSHA
91f096b16f Merge pull request #1743 from cantino/website_agent_can_interpolate_after_extraction
WebsiteAgent can interpolate after extraction

Incorporating feedback from @cantino and @dsander.
2016-11-01 20:20:37 +09:00
Dominik Sander
e3f1429a37 Merge pull request #1764 from strugee/http-to-https
Fix Stubhub test failures
2016-11-01 09:43:22 +01:00
Alex Jordan
651eb50729 Fix another Stubhub HTTP URL 2016-10-31 20:58:06 -07:00
Alex Jordan
77da54ea0c Convert a bunch of HTTP links to HTTPS (#1757) 2016-10-31 19:21:03 -04:00
Akinori MUSHA
58fabb885c Add a new Liquid filter rebase_hrefs 2016-10-29 20:40:52 +09:00
Akinori MUSHA
8b897f5da3 Add Liquid variables _response_.url and _url_ to WebsiteAgent 2016-10-29 20:40:51 +09:00
Akinori MUSHA
fe35df8752 Add a new option template to WebsiteAgent
If given, it is used as a Liquid template for each event created by the
Agent, instead of directly emitting the results of extraction as events.

An existing spec needs to be fixed because WebsiteAgent now has the
`template` option, which may not be a hash of hashes.
2016-10-29 20:40:51 +09:00
Andrew Cantino
9a3290ef40 Language changes 2016-10-28 19:05:46 -04:00
Akinori MUSHA
faa2789a0c Fix the order of receivers in the DotHelper specs
This should fix occasional build failure on CI.
2016-10-27 16:31:24 +09:00
Akinori MUSHA
4f93db60e7 Merge pull request #1754 from cantino/ignore_empty_author
Ignore empty author and link entries in RssAgent.

Fixes #1753.
2016-10-27 13:07:21 +09:00
Akinori MUSHA
1e14358648 Merge pull request #1751 from cantino/encoding_detection
Improve encoding detection in WebsiteAgent
2016-10-27 13:00:56 +09:00
Akinori MUSHA
50b5833a3f Improve encoding detection in WebsiteAgent
Previously, WebsiteAgent always assumed that a content with no charset
specified in the Content-Type header would be encoded in UTF-8.  This
enhancement is to make use of the encoding detector implemented in
Nokogiri for HTML/XML documents, instead of blindly falling back to
UTF-8.

When the document `type` is `html` or `xml`, WebsiteAgent tries to
detect the encoding of a fetched document from the presence of a BOM,
XML declaration, or HTML `meta` tag.

This fixes #1742.
2016-10-27 13:00:37 +09:00
Akinori MUSHA
4d10132709 Fix a bug where an empty <link> is wrongly parsed
Due to a problem in sax-machine's internals, an empty `<link/>` in RSS
would be parsed to JSON as `{ "href": "no_buffer" }`.  Now empty
`<link/>` elements in RSS and ATOM are simply ignored just like other
collection elements like `<category>`.
2016-10-27 09:17:46 +09:00
Akinori MUSHA
2bb97b53bc Add failing specs for empty <link> elements 2016-10-27 09:17:40 +09:00
Akinori MUSHA
5f5e246552 Use Struct#each_pair 2016-10-27 08:13:43 +09:00
Akinori MUSHA
e5c938aa85 Exclude empty entries from authors 2016-10-27 08:12:48 +09:00
Akinori MUSHA
852f39d480 Rescue error from Mail::Address#name and #address
`Mail::Address.new('')` does not raise any error but calling `name` on
the created instance does.
2016-10-27 08:11:10 +09:00
Akinori MUSHA
445665ee3a Add a failing test for #1753 2016-10-27 08:09:57 +09:00
Akinori MUSHA
cb0e8f68f9 Rename onethingwell.atom to .rss because it is actually an RSS file 2016-10-27 07:31:19 +09:00
Akinori MUSHA
950975d7a8 Merge pull request #1744 from cantino/xpath-string
Use the XPath expression `string(.)` instead of `.//text()`
2016-10-23 09:57:06 +09:00
Akinori MUSHA
7ed40a6901 Use the XPath expression string(.) instead of .//text()
That is the correct way to extract a raw string with all text nodes
concatenated without entity escaping.
2016-10-21 00:23:00 +09:00
Akinori MUSHA
0fcd8e285e Normalize URL in to_uri and uri_expand liquid filters 2016-10-17 15:02:57 +09:00
Akinori MUSHA
22645a9441 Merge remote-tracking branch 'origin/data_output_agent_limits_events_after_ordering' 2016-10-17 14:38:52 +09:00
Akinori MUSHA
308f5ec888 Reset memory of all DataOutputAgents for the last fix to take effect 2016-10-17 14:37:13 +09:00
Dominik Sander
04e0436540 Merge pull request #1736 from dsander/delayed_job_active_record
Fix delayed_job_active_record overriding defaults
2016-10-15 16:37:44 +02:00
Dominik Sander
bf35a49f6c Fix delayed_job_active_record overriding defaults
Because `delayed_job_active_record` was loading active_record to early it used to override defaults set in our
initializers.
2016-10-15 12:45:23 +02:00
Akinori MUSHA
00fe9d890d Merge pull request #1729 from cantino/retire_ar_mysql_column_charset
Retire ar_mysql_column_charset
2016-10-14 21:35:44 +09:00
Dominik Sander
005f01a4ad Merge pull request #1716 from dsander/liquid-as-object
Add as_object Liquid filter
2016-10-14 12:53:24 +02:00
Dominik Sander
d2cbd04ac8 Add as_object Liquid filter
The `as_object` returns the received data/object as is without casting it to a string like liquid normally does. It
can be used as a JSONPath replacement or to emit result of a Liquid filter chain as an array.

`catch` and `throw` needs to be used to break out of Liquid render chain. Liquid aggregates the output of every
expression an array and [joins](https://github.com/Shopify/liquid/blob/v3.0.6/lib/liquid/block.rb#L147) it together that
join makes it impossible to get anything else than a string out of a Liquid template.
2016-10-14 12:33:31 +02:00
YURA BUYLOV
864b0708a7 Agent form: ace-editor highlighting and theme (#1727) 2016-10-11 19:25:12 -04:00
Akinori MUSHA
ba0d2ee0de Retire ar_mysql_column_charset
Rails 5 has native support for charset/collate.
2016-10-11 22:25:54 +09:00
Enfop
8efe50ad99 manual event agent validate JSON field before form submit (#1728) 2016-10-11 07:53:56 -04:00
Andrew Cantino
005be58d30 Update forecast_io gem and language (#1722) 2016-10-09 10:49:12 -04:00
Dominik Sander
3d709088f1 Merge pull request #1725 from dsander/ruby-version-docs
Update documentation
2016-10-09 13:54:29 +02:00
Dominik Sander
b0ae9581d2 Update documentation
Ruby > 2.2 is now required
Manual installation guide works on Ubuntu 16.04
Remove obsolete `string-scrub` gem
2016-10-09 13:18:07 +02:00
Akinori MUSHA
12534837da Update CHANGES.md 2016-10-07 20:36:39 +09:00
Akinori MUSHA
8a4b10c268 Use rb-kqueue on BSD systems
See: https://github.com/guard/listen#on-bsd
2016-10-07 20:06:28 +09:00
Akinori MUSHA
4bd7822454 Merge branch 'rss_agent-feedjira' 2016-10-07 20:06:04 +09:00