NOTE: It is important to reset all facets on the column you wish to work with before beginning this process.
- Go to the column you would like to split and click the arrow button on the column header.
- Select the “Edit cells” option and choose “Split multi-valued cells.”
- In the pop-up window, type in the separator (often a semi-colon, comma, or some other special character), and select “OK.”
- Cells with the values indicated as separators will be split into different rows based on those separators.
NOTE: The new rows will still be part of the same record. For more information on rows and records see the Layout section
In OpenRefine 3.3, the pop-up window allows for several options and control when splitting cells. The options are:
- “By separator” allows you to input the separator value (often a semi-colon, comma, or other special character).
- “By field lengths” allows you to input a string of integers to indicate where in the original column value you would like OpenRefine to split the values. For example:
- A column contains dates in the YYYY/MM/DD format. You wish to divide this into three columns (Year, Month, Day). Under “by field lengths,” use the list “4, 1, 2, 1, 2” indicating “year, slash, month, slash, day.” The result will be five columns, two with only slashes which you can remove, and three with the Year, Month, and Day respectively
Helpful Tips:
- This method of splitting cell values is most useful for data you wish to remain as one column, such as multiple authors, for future faceting or transformation purposes.
- There are many characters and strings that can be used as "separators," including: spaces, semi-colons, pipes, slashes, commas, dashes, and full words.