The CSV MetaData Editor is an user interface that allows the assisted generation of metadata for a CSV. The JSON output is compliant to the W3C standard for CSV on the Web (CSVW). The metadata can describe CSV specifics such as the delimiter, the encoding and the quotation character. It also describes the columns and […]

Read more

The CSV Profiler analyses the input CSV and provides basic informations and metrics: File encoding and delimiter of the input CSV. The header values of the table. Completeness metric: the data-field completeness (i.e. non-empty metric) for (i) all cells, (ii) the columns, and (iii) the headers. The number of distinct values per column. Simple datatype […]

Read more

Lots of the data to be published as Linked Data are in the form of tabular data. Nevertheless, in order to semantically interpret such tabular data and publish them as quality Linked Data, non-trivial effort is needed in terms of linkage of the data to existing Linked Data resources and and in terms of selecting […]

Read more

he CSV Clean Service is a tool to automatically clean-up CSV data sources. It is able to parse and automatically detect the encoding of the input CSV and to determine several types of delimiter derivations, such as tab- or semicolon-separated-value files. The cleaned file is UTF-8 encoded and an RFC 4180 compliant CSV document. RFC […]

Read more

UnifiedViews is an open source Extract-Transform-Load (ETL) framework that allows users – publishers, consumers, or analysts – to define, execute, monitor, debug, schedule, and share RDF data processing tasks. The data processing tasks may use custom plugins created by users. UnifiedViews differs from other ETL frameworks by natively supporting RDF data and ontologies. UnifiedViews has […]

Read more