Serrano

a user-friendly and extensible web data extraction

Irena Holubová, Tomáš Novella

Description

Serrano is a new wrapping language that has three goals: (1) ability to run in a restricted environment, such as a browser extension, (2) extensibility to balance the tradeoffs between expressiveness of a command set and safety, and (3) processing capabilities to eliminate the need for additional programs to clean the extracted data. Serrano has been successfully deployed in a number of projects and provided competitive results.

Where to get it

Serrano is open-source and is therefore freely accessible. Users are welcome with their feedback.

Links:

Contact email:

tomasnovella<at>gmail.com

Research

Research group at the department:

XML and Web Technologies Research Group

Publications:

  • Novella T., Holubová I.: User-friendly and Extensible Web Data Extraction, in Proceedings of the 26th International Conference on Information Systems Development, Larnaca, Cyprus, AIS Electronic Library, ISBN: 978-9963-2288-3-6, pp. 1-12, 2017 - text
Tato stránka podléhá licenci Creative Commons Uveďte autora-Neužívejte komerčně 3.0 Česko