PyData Edinburgh: A beginner’s guide to web scraping in Python

Welcome to our latest virtual PyData Edinburgh! As in the case of the last one, we're keeping this into the 1-hour format, and we will have a main talk plus a Q&A session at the end. We will meet at 7pm, use the Zoom link which will be provided in the registration email once you RSVP.

As always, let us know if you would like to give a talk, whether a main or a lightning one, we're always looking out for speakers (as you know!).

TALK: A beginner’s guide to web scraping in Python
==================================================

Caterina Constantinescu

In this talk I’ll be sharing my recent experiences with Scrapy, a tool for webscraping in Python. This introductory talk will briefly discuss some alternative tools within the webscraping landscape, then cover a few examples of increasing complexity (i.e., pages to scrape) - all accompanied by a discussion of the relevant Scrapy code generating the results, as well as some minimal examples of data wrangling used on the information extracted. In the interest of full disclosure, I will be discussing this topic from the perspective of a regular R user, but a Python neophyte - so feedback and advice will be welcome.

Bio: Dr Caterina Constantinescu is a data scientist working at Tesco Bank, whose past work ranges across areas such as research methods, national health data, occupational therapy, transport and data for good. Her academic background prior to this involved researching if various emotion-generating stimuli used in lab settings could approximate emotional states occurring in daily life. For several years she have also organised meetings for the R user group in Edinburgh (EdinbR), focusing on statistical programming.

----------------------------------------------------------------------------------------------------

LOGISTICS
===========
1855: Zoom Waiting room will open
1900: Meetup will start - welcome & community announcements followed by our main speaker, and then Q&A
We'll aim to be finished by 2000

REMOTE MEETINGS
===================
Please remember the Code of Conduct applies to a remote meeting as well.

When you join the meeting, your microphone will be muted, and your camera switched on, we miss you all, it's nice to see you! Once the talk starts, feel free to switch your camera off.

We'll use the chat channel in Zoom to deal with any issues, and we'll advise on how we will run the Q&A at the start of the session.

SPONSORS
=============================
While we might not be making such full use of our sponsors as we normally do, we still appreciate all they do for us to allow us to run this group - so thanks to Cathcart Associates, Wood Mackenzie, Solarwinds, Lloyds Banking, Canon Medical and Effini.

CODE OF CONDUCT
====================

Though virtual, our Code of Conduct still applies to all attendees, organisers and speakers. Please take the time to read through if you haven't before, we really appreciate your help to maintain a welcoming and friendly PyData community.

The PyData Code of Conduct (pydata.org) governs this meetup. To discuss any issues or concerns relating to the code of conduct or the behavior of anyone at a PyData meetup, please contact the local group organizers (message us on the meetup page). Please also submit a report of any potential Code of Conduct violation directly to NumFOCUS using the link found at numfocus.org.

to (Europe/London time)

More details and tickets: www.meetup.com

Imported From: www.meetup.com

More Information

We don't know any more about PyData Edinburgh.