Preview
Issue #2
The author of the page is Ирина Гумыркина.
- Арман
- The reason is because it is part of the article text. It has no clear selector like id or class to target it. It was added by the author of the text and can be present or absent, or be in some other format.
While it is doable using regex pattern matching, it is going to be error prone and not 100% accurate. There are even cases where it is defined in the middle of paragraph and in the middle of text: http://kaztag.kz/interview/detail.php?ID=261387
And after all, even though author name is not inside the "Date, Author" box, it is not missing in the IV. I think that, rather than fiddling with picking it out of the text of the article, it might be better to just leave it as part of the text.
- Declined by admin
- It seems that it is not possible to consistently parse author names from articles on this wesite. In such cases, it is much better to omit author names than to generate wrong names in some cases.
- Type of issue
- IV page is missing essential content
- Reported
- Jun 3, 2017