Scraper for XML, CSV, HTML, PDF, Websites
Yashodhan
Support for scraping documents and website out of box with control and filtering.
------------------------------------
BEST PRACTICE: See Text parser, section Data Scraping: https://support.integromat.com/hc/en-us/articles/360006249413#data-scraping
Log In
Eyal Gershon
You can use a module I made under my app that converts an XML string to JSON object:
Alexander Voll
Eyal Gershon: How would I do that?
Eyal Gershon
Alexander Voll: add the app and use the XML to json module.
Alexander Voll
Eyal Gershon: How would I use that to scrape a web page e.g.?
Eyal Gershon
Alexander Voll: it all depends on what you are doing, if you have a url that returns an XML you can use http module to get the content and then my module to parse the XML.
If it's more than that then you need something like parsehub/octoparse/apify.
Thomas
https://simplescraper.io is one of the most simple, reliable and widely compatible of the basic web scrapers I've tried. It lacks some features offered by other basic scrapers, but works well overall. Easy to use with Integromat (call API via HTTP module)
Bert Schoofs
Thomas: Thanks works great. Very easy to configure and use. Unfortunately only 50 "scrapes" in the free plan. scales immediately to 35$ for more. Anyway you saved me a lot of time. tx
Thomas
Bert Schoofs: Yeah, I agree it's a really steep price, since many users (myself included) probably just need some hundreds or low thousands of scrapes, not the minimum 5000. Would be much better if they had plans starting around $15, with options for extra credits if needed. I reckon they'd get more paying customers with that entry point, but I guess the competition isn't there to push them. If I can find a scraper that works as well and as easy, with lower entry plan, I would switch, but I haven't found one yet.
Marcus Quinn
Also maybe of interest: https://ui.vision/
M k
Hi
This capability would be great!
Mary Kay
P
Patrik Šimek
Have you tried Apify? It's an excellent tool for crawling websites, and we already have it integrated.
Nicolas Lapierre
Patrik Šimek: very hard to setup and not user friendly. I would hardly prefer Octoparse!! Integromat even give a link to octoparse here: https://support.integromat.com/hc/en-us/articles/360006249413#data-scraping why not integrate it yet :-D