Artificial intelligence (AI) and machine learning (ML) seem to have piqued the interest of automated data collection providers. While web scraping has been around for some time, AI/ML implementations have appeared in the line of sight of providers only recently.
Aleksandras ulenko, Product Owner at Oxylabs.io, who has been working with these solutions for several years, shares his insights on the importance of artificial intelligence, machine learning, and web scraping.
BN: How has the implementation of AI/ML solutions changed the way you approach development?
AS: AI/ML has an interesting work-payoff ratio. Good models can sometimes take months to write and develop. Until then, you dont really have anything. Dedicated scrapers or parsers, on the other hand, can take up to a day or two. When you have an ML model, however, maintaining it takes a lot less time for the amount of work it covers.
So, theres always a choice. You can build dedicated scrapers and parsers, which will take significant amounts of time and effort to maintain once they start stacking up. The other choice is to have "nothing" for a significant amount of time, but a brilliant solution later on, which will save you tons of time and effort.
Theres some theoretical point where developing custom solutions is no longer worth it. Unfortunately, theres no mathematical formula to arrive at the correct answer. You have to make a decision when all the repetitive tasks are just too much of a hog on resources.
BN: Have these solutions had a visible impact on the deliverability and overall viability of the project?
AS: Getting started with machine learning is tough, though. Its still, comparatively speaking, a niche specialization. In other words, you wont find many developers that dabble in ML, and knowing how hard it can be to find one for any discipline, its definitely a tough river to cross.
Yet, if the business approach to scraping is based on a long-term vision, ML will definitely come in handy sometime down the road. Every good vision has scaling in it and with scaling comes repetitive tasks. These are best handled with machine learning.
Our awesome achievement we call Adaptive Parser is a great example. It was once almost unthinkable that a machine learning model could be of such high benefit. Now the solution can deliver parsed results from a multitude of e-commerce product pages, irrespective of the changes between them or any that happen over time. Such a solution is completely irreplaceable.
BN: In a previous interview, youve mentioned the importance of making things more user-friendly for web scraping solutions. Is there any particular reason you would recommend moving development towards no-code implementations?
AS: Even companies that have large IT departments may have issues with integration. Developers are almost always busy. Taking time out of their schedules for integration purposes is tough. Most end-users of the data Scraper APIs, after all, arent tech-savvy.
Additionally, the departments that would need scraping the most such as marketing, data analytics, etc., might not have enough sway in deciding the roadmaps of developers. As such, even relatively small hurdles can become impactful enough. Scrapers should now be developed with a non-tech user in mind.
There should be plenty of visuals that allow for a simplified construction of workflows with a dashboard thats used to deliver information clearly. Scraping is becoming something done by everyone.
BN: What do you think lies in the future of scraping? Will websites become increasingly protective of their data, or will they eventually forego most anti-scraping sentiment?
AS: There are two of the answers I can give. One is "more of the same". Surely, a boring one, but its inevitable. Delving deeper into scaling and proliferation of web scraping isnt as fun as the next question -- the legal context.
Currently, it seems as if our position in the industry isnt perfectly decided. Case law forms the basis of how we think and approach web scraping. Yet, it all might change on a whim. Were closely monitoring the developments due to the inherent fragility of the situation.
Theres a possibility that companies will realize the value of their data and start selling it on third-party marketplaces. It would reduce the value of web scraping as a whole as you could simply acquire what you need for a small price. Most businesses, after all, need the data and the insights, not web scraping. Its a means to an end.
Theres a lot of potential in the grand vision of Web 3.0 -- the initiative to make the whole Web interconnected and machine-readable. If this vision came to life, the whole data gathering landscape would be vastly transformed: the Web would become much easier to explore and organize, parsing would become a thing of the past, and webmasters would get used to the idea of their data being consumed by non-human actors.
Finally, I think user-friendliness will be the focus in the future. I dont mean just the no-code part of scraping. A large part of getting data is exploration -- finding where and how its stored and getting to it. Customers will often formulate an abstract request and developers will follow up with methods to acquire what is needed.
In the future, I expect, the exploration phase will be much simpler. Maybe well be able to take the abstract requests and turn them into something actionable through an interface. In the end, web scraping is breaking away from its shell of being something code-ridden or hard to understand and evolving into a daily activity for everyone.
Photo Credit: Photon photo/Shutterstock
See more here:
Tying Artificial intelligence and web scraping together [Q&A] - BetaNews
- What is Artificial Intelligence (AI)? - Definition from ... [Last Updated On: June 12th, 2016] [Originally Added On: June 12th, 2016]
- Artificial Intelligence | Neuro AI [Last Updated On: June 12th, 2016] [Originally Added On: June 12th, 2016]
- Association for the Advancement of Artificial Intelligence [Last Updated On: June 13th, 2016] [Originally Added On: June 13th, 2016]
- A.I. Artificial Intelligence - Wikipedia, the free ... [Last Updated On: June 17th, 2016] [Originally Added On: June 17th, 2016]
- Artificial Intelligence - The New York Times [Last Updated On: June 17th, 2016] [Originally Added On: June 17th, 2016]
- Intro to Artificial Intelligence Course and Training ... [Last Updated On: June 28th, 2016] [Originally Added On: June 28th, 2016]
- Artificial Intelligence | Neuro AI [Last Updated On: July 1st, 2016] [Originally Added On: July 1st, 2016]
- What is Artificial Intelligence (AI)? Webopedia Definition [Last Updated On: July 1st, 2016] [Originally Added On: July 1st, 2016]
- Intro to Artificial Intelligence Course and Training Online ... [Last Updated On: July 5th, 2016] [Originally Added On: July 5th, 2016]
- Artificial Intelligence News -- ScienceDaily [Last Updated On: September 16th, 2016] [Originally Added On: September 16th, 2016]
- Artificial intelligence positioned to be a game-changer - CBS ... [Last Updated On: October 13th, 2016] [Originally Added On: October 13th, 2016]
- Artificial Intelligence: A Modern Approach - amazon.com [Last Updated On: October 31st, 2016] [Originally Added On: October 31st, 2016]
- Artificial Intelligence - IndiaBIX [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- The Non-Technical Guide to Machine Learning & Artificial ... [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Artificial Intelligence - Graduate Schools of Science ... [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Artificial Intelligence in Medicine: An Introduction [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- What does artificial intelligence mean? - Definitions.net [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Artificial Intelligence Lockheed Martin [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Artificial Intelligence Course - Computer Science at CCSU [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- FREE Artificial Intelligence Essay - Example Essays [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Elon Musk's artificial intelligence group signs Microsoft ... [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Real FX - Slotless Racing with Artificial Intelligence [Last Updated On: November 23rd, 2016] [Originally Added On: November 23rd, 2016]
- Artificial Intelligence: What It Is and How It Really Works [Last Updated On: January 4th, 2017] [Originally Added On: January 4th, 2017]
- Artificial Intelligence Market Size and Forecast by 2024 [Last Updated On: January 4th, 2017] [Originally Added On: January 4th, 2017]
- Algorithm-Driven Design: How Artificial Intelligence Is ... [Last Updated On: January 4th, 2017] [Originally Added On: January 4th, 2017]
- 9 Development in Artificial Intelligence | Funding a ... [Last Updated On: January 4th, 2017] [Originally Added On: January 4th, 2017]
- Artificial Intelligence Tops Humans in Poker Battle What's the Big Deal? - PokerNews.com [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Is AI a Threat to Christianity? - The Atlantic [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Allow mathematicians to pierce artificial intelligence frontiers - Livemint [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Montreal sees its future in smart sensors, artificial intelligence (with video) - Computerworld [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Silicon Valley Hedge Fund Takes On Wall Street With AI Trader - Bloomberg [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- The Observer view on artificial intelligence - The Guardian [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Artificial Intelligence Is Coming Whether You Like It Or Not - Mother Jones [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- RealDoll Creating Artificial Intelligence System, Robotic Sex Dolls ... - Breitbart News [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Forget lessons, these smart skis are loaded with artificial intelligence - Mashable [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Artificial Intelligence Correctly Predicted the Patriots' 34-28 Super ... - Digital Trends [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Why C-Levels Need To Think About eLearning And Artificial Intelligence - Forbes [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Artificial Intelligence-Driven Robots: More Brains Than Brawn - Forbes [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Artificial intelligence: How to build the business case - ZDNet [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- What 'social artificial intelligence' means for marketers - VentureBeat [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Actress Kristen Stewart's Research Paper On Artificial Intelligence: A Critical Evaluation - Forbes [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Baidu cut its healthcare business to concentrate on artificial intelligence - Asia Times [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Google Android Wear 2.0 update puts artificial intelligence inside your wristwatch - The Sun [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- How criminals use Artificial Intelligence and Machine Learning - BetaNews [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- In the Labs: Connected vehicles in Ohio, artificial intelligence in Illinois and Massachusetts - Network World [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Keeping an eye on artificial intelligence - The National Business Review [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Actors, teachers, therapists think your job is safe from artificial intelligence? Think again - The Guardian [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Wells Fargo Innovation Group to Focus on Artificial Intelligence, Payments and APIs - Wall Street Journal (blog) [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- SAP aims to step up its artificial intelligence, machine learning game as S/4HANA hits public cloud - ZDNet [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Artificial Intelligence Is Coming To Police Bodycams, Raising Privacy Concerns - Forbes [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Nvidia Beats Earnings Estimates As Its Artificial Intelligence Business Keeps On Booming - Forbes [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Could Artificial Intelligence Ever Become A Threat To Humanity? - Forbes [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Artificial intuition will supersede artificial intelligence, experts say - Network World [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- The Peril of Inaction with Artificial Intelligence - Gigaom [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- TASER International Bringing Artificial Intelligence to Law Enforcement - Motley Fool [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- LG G6 teasers emphasize battery life, artificial intelligence - CNET [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Wells Fargo sets up artificial intelligence team in tech push - Reuters [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Ford spending $1 billion on self-driving artificial intelligence - CNET [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Artificial Intelligence in Business Process Automation - Nanalyze [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- An artificial intelligence gamble that paid off - Minneapolis Star Tribune [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Ford to Invest $1 Billion in Artificial Intelligence Start-Up - New York Times [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Wells Fargo Pushes Into Artificial Intelligence - Fortune [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- Artificial intelligence predictions surpass reality - UT The Daily Texan [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- Creating artificial intelligence-driven technology products is almost like unleashing the Frankenstein's monster - Economic Times (blog) [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- Inside Intel Corporation's Artificial Intelligence Strategy - Motley Fool [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- The artificial intelligence revolutionising healthcare - Irish Times [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- Ford Announces Investment in Artificial Intelligence Company Argo AI - Motor Trend [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- Ford Invests $1-Billion in Artificial Intelligence - AutoGuide.com [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- Salesforce adds some artificial intelligence to customer service products - TechCrunch [Last Updated On: February 13th, 2017] [Originally Added On: February 13th, 2017]
- No hype, just fact: Artificial intelligence in simple business terms - ZDNet [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- Artificial Intelligence and The Confusion of Our Age - Patheos (blog) [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- How Artificial Intelligence Startups Struck Gold - Entrepreneur [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- Terrifyingly, Google's Artificial Intelligence acts aggressive when cornered - Chron.com [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- This Startup Has Developed A New Artificial Intelligence That Can (Sometimes) Beat Google - Forbes [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- RPI artificial intelligence expert looks at Westworld - Albany Times Union [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- Google's DeepMind artificial intelligence becomes 'highly aggressive' when stressed. Skynet, anyone? - Mirror.co.uk [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- Artificial Intelligence Enters The Classroom - News One [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- John Pisarek Talks Artificial Intelligence - Customer Think [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- Can Artificial Intelligence Predict Earthquakes? - Scientific American [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]
- Artificial Intelligence Is Becoming A Major Disruptive Force In Banks' Finance Departments - Forbes [Last Updated On: February 15th, 2017] [Originally Added On: February 15th, 2017]