Skip to content

dli1/tar_data_collection

Repository files navigation

Description

tar_data_collection is a Python script used for constructing data set for CLEF eHealth 2017 Task 2.

https://sites.google.com/site/clefehealth2017/task-2

Dan Li (d.li@uva.nl)

Requirement

Functions

  • batch_download_pid: Download pids for all the systematic reviews
  • extract_pid: Extract pids from downloaded xml and rewrite to new dir
  • batch_download_title: Download title for all the systematic reviews
  • make_release_file: Make release files: topic file or qrel file
  • download_abstract: Download abstract for all the pids
  • trec_format_abstract: Make the downloaded abstracts TRECTEXT format
  • statistics: Statistics of the released data

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages