For further technical information check my GitHub repository.
This project aims to collect data about American soccer teams and players by scraping applicable websites. A few example output files are available in the schedule subdirectory.
It will scrape schedules, rosters, injury reports and news articles using tools like lxml, Scrapy and BeatifulSoup.