For further technical information check my GitHub repository.

This project aims to collect data about American soccer teams and players by scraping applicable websites. A few example output files are available in the schedule subdirectory.

Data collection

It will scrape schedules, rosters, injury reports and news articles using tools like lxml, Scrapy and BeatifulSoup.