Questions tagged [Parsing] (1687)

1
answer

How to find the value that is generated in js?

There is a website opened by the Manager - Network tab, look a download action. First loads the HTML, then see the downloaded files like this: Link This js code as I understand it generates a <script>, because after load the HTML, add different data, probably generated js. In my case I need to get the value: 4924598...
emerald_Lehner asked April 19th 20 at 12:45
1
answer

Parsing Yandex.Market, the problem with the change of page?

Hi! I need a large list of AsRock, the source decided to take Yandex.Market. Faced with a ban + a very strange thing, after 7 pages inclusive issued for 33 cards goods, although in the settings it is still "Show 48". When switching to 7 page and further does not change the Page parameter for the page, thus in a file .csv co...
ron_Kilba asked April 19th 20 at 12:44
1
answer

How to put number of followers in Instagram?

I need to put the number of podpraporschik in histograme, file_get_contents to the page the user is banned after 200 queries. how STE make....
Alexane.Kuhic asked April 19th 20 at 12:36
0
answer

How to change your Python project "ProxyBroker" to parse and check the proxy and make it work indefinitely?

Launched the server on Ubuntu 16.04 Server installing and running this Python project on the check and the parsing of the proxy: https://github.com/constverum/ProxyBroker screenshot Tell me how to change these settings this Python project? - specify which site cecati, - as to leave only the https and socks5 proxies ...
stevie5 asked April 19th 20 at 12:35
1
answer

How to parse with python requests?

Everything is great you know the main library for parsing python's requests. So the question arose, do I need to go to the site every 5-7 seconds, it is obvious that so frequent treatment of the site will perceive as ddos and limit the access to the site. Is there for example a possibility as once open the site and continuo...
Lilla.Ebe asked April 19th 20 at 12:12
3
answers

Bypass the hash on the website during authorization in python?

Trying to set up automatic authorization on the site(the python module requests). Tried on different sites on the website md.samdu.uz/login/index.php in the POST method on the site in addition to go anchor:, username: and password: is the value logintoken: after drive password and try to log logintoken issues every time a n...
Emilia49 asked April 19th 20 at 12:08
2
answers

How to put hhru?

Goal: https://stats.hh.ru/ufa The whole website picked, can't find where the data is loaded. Or rather, how they are processed. The data on the page match the data from here: https://stats.hh.ru/_api/data/data.json?u9lg7r But there just numbers,numbers,numbers and all. Not a single word which can identify a specific block...
Shana_Cormier asked April 19th 20 at 12:07
1
answer

When parsing, all fit without gaps?

import requests from bs4 import BeautifulSoup url = 'https://koronavirusa.site/ru' page = requests.get(url) soup = BeautifulSoup(page.text, "html.parser") news = soup.find('div', class_='sppb-container-inner') print(news.getText()) problem: 1,990,097ЗАРАЗИЛИСЬ125,859УМЕРЛО466,948ВЫЗДОРОВЕЛО all without spaces, how can th...
Noah asked April 19th 20 at 12:02
0
answer

How to put text on the links?

Hello need to put the text on the links. Tried Content downloader, but ran into a problem that it parses everything, and I have only text with a certain minimum number of characters. For example, the text of which is the number of characters 6K+ Tell me how to do it.
kitty86 asked April 19th 20 at 11:58
1
answer

How to avoid hashing on the website during authorization in python?

Trying to set up automatic authorization on the site(the python module requests). Tried on different sites on the website md.samdu.uz/login/index.php in the POST method on the site in addition to go anchor:, username: and password: is the value logintoken: after drive password and try to log logintoken issues every time a n...
ludwig_Hal asked April 19th 20 at 11:54