Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Article Extraction - Wordpress
#1
Hi everyone!
Please be warned, I am a doctoral candidate who realized that there is no way around learning how to use Python, but I have come accross numerous roadblocks where I hope you may be able to help?

I have managed to scrape/crawl Twitter feeds of selected users, but now I am looking to extract all articles from wordpress pages (excl. images), including primarily the following:
- Title
- Article Link
- Time & Date
- Text
Optimally as an output within CSV / Excel.

I have come accross the following website:
https://indianpythonista.wordpress.com/2...iful-soup/
https://www.digitalocean.com/community/t...d-python-3
https://zach-adams.com/2015/04/python-sc...wordpress/

But truly am struggling to get any of these codes, in all of its variants to work. (Scrapy wont install on my PyCharm, so I resorted to BeautifulSoup.)

A sample of websites I want to scrape (particularly subsections may include infite scrolling):
1) https://electrek.co/guides/tesla/
2) https://www.teslarati.com/tag/tesla/

Is there one of you out there who would be able to give a hand to amend on of the beaoutiful-soup scripts to the above 2 sample pages? I would take it from there and use it on any other wordpress blogs, but I guess I need a starting hand!

Appreciate your time! Have a great weekend and stay safe.
Reply


Messages In This Thread
Article Extraction - Wordpress - by svzekio - Jun-07-2020, 12:33 PM
RE: Article Extraction - Wordpress - by snippsat - Jun-07-2020, 03:02 PM
RE: Article Extraction - Wordpress - by svzekio - Jun-08-2020, 08:04 AM
RE: Article Extraction - Wordpress - by svzekio - Jun-08-2020, 11:21 AM
RE: Article Extraction - Wordpress - by snippsat - Jun-08-2020, 01:19 PM
RE: Article Extraction - Wordpress - by snippsat - Jul-10-2020, 12:49 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Python, Salesforce and WordPress arthurk88 1 788 Nov-21-2023, 10:13 AM
Last Post: Larz60+
  Python API for Wordpress Simlock 4 3,803 May-23-2022, 06:47 PM
Last Post: LaverneDejardin
Question Scraping Wikipedia Article (Name in 1 column & URL in 2nd column) ->CSV! Anyone? BrandonKastning 4 2,101 Jan-27-2022, 04:36 AM
Last Post: Larz60+
  how to run a python script in the background on my wordpress website rockie12us 3 2,785 Aug-13-2021, 05:39 PM
Last Post: ndc85430
  Python Scrapy Date Extraction Issue tr8585 1 3,421 Aug-05-2020, 04:32 AM
Last Post: tr8585
  If I use a php script, like WordPress and Elgg, can I program an plugin by Python? Abdulaziz 0 1,648 Jun-23-2020, 06:54 PM
Last Post: Abdulaziz
  Follow Up: Web Calendar based Extraction AgileAVS 0 1,549 Feb-23-2020, 05:39 AM
Last Post: AgileAVS
  Post comments to Wordpress Blog SergeyLV 1 2,530 Aug-01-2019, 01:38 AM
Last Post: Larz60+
  Download article without photo caption Helene_python 2 2,498 Feb-14-2019, 01:13 PM
Last Post: snippsat
  fb data extraction error periraviteja 1 2,241 Jan-05-2019, 01:07 AM
Last Post: stullis

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020