Why Beautiful Soup parses incorrectly this unit?
Here's a simplified code, it bug is also reproduced:
soupIndex = BeautifulSoup("'<div class="vk-comment">
The name of the author
The text of the comment
17 minutes ago
template = soupIndex.select_one('.vk-comment')
This variation in the output there are two extra div, but... If the length of the review to increase several times, then begins to copy the block vk-comment-date. I understand the longer a character representation of the block, the greater the number of symbols is duplicated at the end.
UPD: as the parser is the default html5lib, OS - Windows 7. Tried html parser, there is generally some nonsense going on, the img tag, for example, adds a closing tag.