4.6.4 Reduce Duplicated Rows


Origin has a tool for removing rows in a worksheet based on duplication in one or more columns. When multiple columns are selected, all values within a row must match to be considered as duplicates. You can opt to specify a tolerance value.

To open this dialog box:

  1. Activate the workbook.
  2. Click Worksheet: Remove/Combine Duplicated Rows... and open the wdeldup dialog box which utilizes the wdeldup x-function.
Notes: To remove duplicated values in a column, use function unique.

Dialog Options

Recalculate

Controls recalculation of analysis results

  • None
  • Auto
  • Manual

For more information, see: Recalculating Analysis Results

Reference Column

The source column(s) which contain the duplicated data. If multiple columns has been selected, only the rows contains duplicate values in all selected columns will be reduced.

Merge Duplications by

Specify how to reduce worksheet with duplicated rows. Option list:

  • Remove All
    Remove all duplicated rows including first duplicated row
  • Keep First Row
    Keep first duplicated row and remove all other duplicated rows
  • Keep Last Row
    Keep last duplicated row and remove all other duplicated rows
  • Average
    Reduce other columns in worksheet by their average value according to the duplicated rows in Reference Column
  • Min
    Reduce other columns in worksheet by the minimum value according to the duplicated rows in Reference Column
  • Max
    Reduce other columns in worksheet by the maximum value according to the duplicated rows in Reference Column
  • Sum
    Reduce other columns in worksheet by their sum according to the duplicated rows in Reference Column
  • SD
    Reduce other columns in worksheet by their standard deviation according to the duplicated rows in Reference Column
Case Sensitive

Specify whether to be case sensitive in comparing two strings.

Duplicate Tolerance

Check the box and provide a tolerance value to treat close values as duplicates.

Output Worksheet

Specify output worksheet.

For help with the range controls, see: Output Results

Output Counts

Count and output the number of duplicate rows.