What to use for parsing websites in PHP?

Hello!
Just want to stop holivar about the fact that php is for parsing ill-suited technical job is that you need to write in PHP using Yii2 and it's not negotiable by the customer. Moreover, the server has another version of php 5.6.
You need to parse from multiple sites and bring it all into the database - the volume of information large enough.
Not a problem for me to solve it all, even with the use of curl/wget to parse with regexps, and create demons to run continuously in the background. But I would not like to reinvent the wheel, especially do not want to parse html with regular expressions.
Because actually the question to those who have already created something similar in PHP. What is now there are the most suitable tools for such tasks? You may already have something "all in one"? Or at least the parts to assemble everything in one system?
June 10th 19 at 16:53
3 answers
June 10th 19 at 16:55
Solution
Thank you, see! Not all problems are solved, but there is some reserve. And even rasp will not have under Yii2. In General it will be necessary to touch. - Roderick92 commented on June 10th 19 at 16:58
June 10th 19 at 16:57
Solution
Direct all-in-one will not prompt.

To parse is a great thing: Zend\Dom\Query, and the regular season are not good for parsing html.

To actually download Guzzle - a bit more comfortable than curl.

If you need to pump a lot, you farm demons can have on gearman. php.net/manual/ru/book.gearman.php

The question remains - what does Yii2. To make some kind of muzzle report?
Thanks for the reply!
Zend\Dom\Query like the description, there is support for XPath and selectors like jquery, it looks like what you need. I just don't know how he copes with invalid html code. Type
<b><i>text</i></b>
or
<span tag="value" "></span>

It will be necessary to touch.

Guzzle too much. I had my own add-on curl, but it is outdated, even for php 5.3 writing. Again, the ability to install Guzzle from github in my case a big plus.

The question remains - what does Yii2. To make some kind of muzzle report?

On Yii2 already done at the customer many things, including the website and the admin panel. Actually for this adminki and have to do additional module for automatic replenishment and tracking of relevance of the database. - Roderick92 commented on June 10th 19 at 17:00

Have you worked with Zend\Dom\Query ?
Can help in this issue? - Roderick92 commented on June 10th 19 at 17:03
June 10th 19 at 16:59
Do the parsing using xpath.
In General, yii2 will help you more in storage.

Solutions for parsing there. You choose it based on your needs.
To learn the parsing — where to start?
Thanks for the reply.
For me, parsing is not new. Already had to do some serious systems, including in the Chinese sites translation of data and use of proxies, bypass captchas, etc. But I usually used either Perl or Python, and then suddenly the harsh conditions to do with PHP.
Poked in the Internet, but roofing felts not looking for felts there is something worthy in this language for such purposes. - Roderick92 commented on June 10th 19 at 17:02

Find more questions by tags ParsingYii