I am trying to retrieve regular (126,37€) and reduced (101,10€) price information from this website.
Simplified HTML code looks like this:
<div class="vw-productFeatures ">
<ul class="feature-list -price-container">
<li class="feature -price">
<span class="value">126,37</span>
</li>
</ul>
<ul class="feature-list vw-productVoucher">
<li class="voucher-information">Mit Code
<span class="voucher-reduced-price">101,10</span>
</li>
</ul>
</div>
So, I basically go step by step (div class -> ul class -> li class -> span class) and get the innerText at the end.
I am able to get the regular price, however, spanclass.innerText
of reduced price returns empty.
This is the code I am working with:
Function getHTMLelemFromCol(HTMLColIn As MSHTML.IHTMLElementCollection, tagNameIn As String, classNameIn As String) As MSHTML.IHTMLElement
Dim HTMLitem As MSHTML.IHTMLElement
For Each HTMLitem In HTMLColIn
If (HTMLitem.tagName = tagNameIn) Then
If (HTMLitem.className = classNameIn) Then
Set getHTMLelemFromCol = HTMLitem
Exit For
End If
End If
Next HTMLitem
End Function
Function getPrice(webSite As String, divClass As String, ulClass As String, liClass As String, spanClass As String) As String
Dim XMLPage As New msxml2.XMLHTTP60
Dim HTMLDoc As New MSHTML.HTMLDocument
Dim HTMLitem As MSHTML.IHTMLElement
Dim HTMLObjCol As MSHTML.IHTMLElementCollection
XMLPage.Open "GET", webSite, False
XMLPage.send
HTMLDoc.body.innerHTML = XMLPage.responseText
Set HTMLObjCol = HTMLDoc.getElementsByClassName(divClass)
Set HTMLitem = getHTMLelemFromCol(HTMLObjCol, "DIV", divClass) ' Find the div class we are interested in first
Set HTMLitem = getHTMLelemFromCol(HTMLitem.Children, "UL", ulClass) ' Find the ul class we are interested in
Set HTMLitem = getHTMLelemFromCol(HTMLitem.Children, "LI", liClass) ' Find the li class we are interested in
Set HTMLitem = getHTMLelemFromCol(HTMLitem.Children, "SPAN", spanClass) ' Find the span class we are interested in
getPrice = HTMLitem.innerText
End Function
Sub Run()
Dim webSite As String, divClass As String, ulClass As String, liClass As String, spanClass As String, regularPrice As String, reducedPrice As String
webSite = "https://www.rakuten.de/produkt/msi-b450-tomahawk-max-atx-mainboard-4x-ddr4-max-64gb-1x-dvi-d-1x-hdmi-14-1x-usb-c-31-2843843890"
divClass = "vw-productFeatures "' Get the regular price
ulClass = "feature-list -price-container"
liClass = "feature -price"
spanClass = "value"
regularPrice = getPrice(webSite, divClass, ulClass, liClass, spanClass)
' Get the reduced price
ulClass = "feature-list vw-productVoucher -hide"
liClass = "voucher-information"
spanClass = "voucher-reduced-price"
reducedPrice = getPrice(webSite, divClass, ulClass, liClass, spanClass)
Debug.Print "Regular price: "& regularPrice
Debug.Print "Reduced price: "& reducedPrice
End Sub
The output I am getting:
Regular price: 126,37
Reduced price:
Debugger shows that it is able to find the correct span class, but it does not have any attribute (including innerText) that has the price information.
How can I get the reduced price information?