February 2019 – Ed's Playground

The Blocklist Collection to use with pihole

A link to the Blocklist that can be used to greatly reduce the number of ads shown when browsing en increase privacy via pihole

https://firebog.net/

add the urls in : /etc/adlists.list
Go to dashboard of pihole -> Tools -> Update Gravity

U will see a big increase in the Domains on Blocklist:

PiHole Experiabox V10 toont alsnog advertenties

Post in Nederlands i.v.m. issue Experiabox.

Na installatie van PiHole en het aanpassen van de DNS server in de Experiabox V10 (ZTE H369A) bleek dat er alsnog advertenties doorgelaten worden.

Na onderzoek bleek dat de IPv6 routeringen het e.e.a. alsnog doorlaten. Er blijkt helaas in de DNS geen IPV6 DNS in te stellen. Dat laat een aantal mogelijkheden.

De DHCP van PiHole gebruiken
De IPV6 instellingen op elk apparaat uitzetten
De IPV6 switch in de experiabox uitzetten.

Voor nu heb ik voor het laatste gekozen.

Pihole admin page not opening

When the admin page of pihole does not start. One of the reasons can be that due to the use of tmpfs the log directory has not been created the solution to this is:

sudo mkdir /var/log/lighttpd
sudo chown www-data /var/log/lighttpd
sudo service lighttpd restart.

The issue Should be fixed in 1.4.53.2.

Raspberry Pi move log to RAM

In order to lower the number of writes to the SD-card, it is wise to move the log life to a tmpfs mount in RAM.

i found this site that explains how to do it: https://domoticproject.com/extending-life-raspberry-pi-sd-card/

As a result of this , the MQTT service running on my Domoticz Pi, raised an error. Error: MQTT: Failed to start, return code: 14 (Check IP/Port)

Apperantly the dir /var/log/mosquitto can not be created on RAM .

A suggested solution that I need to test is in: https://www.raspberrypi.org/forums/viewtopic.php?t=34820

Scrape Javascript heavy website on RaspberryPi3B+ using Python with Selenium

When I was trying to scrape a Javascript heavy website with my Raspberry using Python, I ran into some interesting issues that needed to be solved.

I found that modules like request,request_html, urlllib did not deliver the complete content with Javascripts websites containing shadow-dom (#shadowroot). When searching for solution i found some, like the use of PhantomJS or other discontinued modules.

The solution I found was using Chromedriver in headless mode. But the version I got my hands on kept throwing errors on the version of the browser.

After extensive searches I found the solution in:

1. Download the latest chromedriver from:

https://github.com/electron/electron/releases

(get the arvmv7 version)

2. Install this using the instructions i found on:

https://www.raspberrypi.org/forums/viewtopic.php?t=194176

mkdir /tmp
wget <url latest version arm7>
unzip <zip file>
mv chromedriver /usr/local/bin
sudo chmod +x /usr/local/bin/chromedriver
sudo apt-get install libminizip1
sudo apt-get install libwebpmux2
sudo apt-get install libgtk-3-0

In your code add these two arguments, when you start the driver:
-headless
-disable-gpu

3 Update the Chromium bowser

When trying to execute the script I still got the error on Chromium version.I was able to solve that using:

sudo apt-get install -y chromium-browser

IT WORKS

now the script finally worked

The Python Script to get the page content

from selenium import webdriver
import time
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.chrome.options import Options

# Define the site to be opened
site = “http://….”

# Set Chrome Options
chrome_options = Options()
chrome_options.add_argument(“–headless”)
# Open Chrome Headless
driver = webdriver.Chrome(chrome_options=chrome_options)
driver.set_page_load_timeout(20)
driver.get(site)

4. Analyze the content of the page

With the content of the page in driver it is possible to further decompose the page.

content1= driver.find_element_by_tag_name(‘…..’)
shadow_content1 = expand_shadow_element(content1)

To get access to the shadow element the function below needs to be used:

# function to expand a shadow element to useable content
def expand_shadow_element(element):
shadow_root = driver.execute_script(‘return arguments[0].shadowRoot’, element)
return shadow_root