tools for preprocessing source materials, including scanning, scraping, clipping, and cleaning of PDFs and HTML. Manipulating semi-structured data with advanced regularexpression techniques. Proficient in C/C++ with a deep understanding of recursion, delegation, and dependency injection. Expertise in string manipulation and regularmore »
tools for preprocessing source materials, including scanning, scraping, clipping, and cleaning of PDFs and HTML. Manipulating semi-structured data with advanced regularexpression techniques. Proficient in C/C++ with a deep understanding of recursion, delegation, and dependency injection. Expertise in string manipulation and regularmore »
Experience of using annotation tools e.g.Labelstudio, LightTag, Brat (A and I) Knowledge Essential Notable knowledge and experience of pattern matching languages such as regularexpressions Good working knowledge of Microsoft Office applications and other common desktop software such as antivirus and productivity suites Good working knowledge of data more »