lisa.datautils.series_deduplicate#

lisa.datautils.series_deduplicate(series, keep, consecutives)[source]#

Remove duplicate values in a pandas.Series.

Parameters:
  • keep (str) – Keep the first occurrences if first, or the last if last.

  • consecutives (bool) –

    If True, will only remove consecutive duplicates, for example:

    s = pd.Series([1,2,2,3,4,2], index=[1,2,20,30,40,50])
    s2 = series_deduplicate(s, keep='first', consecutives=True)
    assert (s2 == [1,2,3,4,2]).all()
    
    s3 = series_deduplicate(s, keep='first', consecutives=False)
    assert (s3 == [1,2,3,4]).all()