Batch edit in OpenRefine -
so, have bunch of .csv files need cleaning. need go through same steps, i've extracted openrefine's operation history in order apply other ones.
i open each file 1 one in openrefine , apply extracted json history. there lot of files...
also, don't have enough memory open them @ once in openrefine (multiple selecting when opening files).
is there way edit them or automatically using json extracted openrefine?
that's created batchrefine for, readme should pretty self-explanatory. if not, let me know.
i converted 4 million csv records rdf using batchrefine, took me less 10 minutes on macbook pro.
i execute batchrefine simple shell script:
#!/bin/bash file in ./input/*.tsv filename=$(basename "$file") if [ ! -f "target/"$filename"-transformed" ] echo processing $filename... curl -xpost -h 'accept: text/turtle' -h 'content-type:text/csv' --data-binary "@"$file -o "target/"$filename"-transformed" 'localhost:8310/?refinejson=http://localhost:8000/bar-config.json' else echo found "target/"$filename"-transformed", skipping $file fi done; note need adjust acceptheader in script, guess want csv output again, not rdf.
Comments
Post a Comment