Batch edit in OpenRefine -


so, have bunch of .csv files need cleaning. need go through same steps, i've extracted openrefine's operation history in order apply other ones.

i open each file 1 one in openrefine , apply extracted json history. there lot of files...

also, don't have enough memory open them @ once in openrefine (multiple selecting when opening files).

is there way edit them or automatically using json extracted openrefine?

that's created batchrefine for, readme should pretty self-explanatory. if not, let me know.

i converted 4 million csv records rdf using batchrefine, took me less 10 minutes on macbook pro.

i execute batchrefine simple shell script:

#!/bin/bash  file in ./input/*.tsv   filename=$(basename "$file")    if [ ! -f "target/"$filename"-transformed" ]       echo processing $filename...     curl -xpost -h 'accept: text/turtle' -h 'content-type:text/csv' --data-binary "@"$file -o "target/"$filename"-transformed" 'localhost:8310/?refinejson=http://localhost:8000/bar-config.json'    else     echo found "target/"$filename"-transformed", skipping $file   fi done; 

note need adjust acceptheader in script, guess want csv output again, not rdf.


Comments

Popular posts from this blog

How to understand 2 main() functions after using uftrace to profile the C++ program? -

c# - Update a combobox from a presenter (MVP) -

How to put a lock and transaction on table using spring 4 or above using jdbcTemplate and annotations like @Transactional? -