water.exceptions.H2OParseSetupException: Column types do not match between files.

JIRA | Eric Eckstrand | 2 years ago
  1. 0

    Multi-file parse of: bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2013.csv bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2014.csv gives: /3/ParseSetup, route: /3/ParseSetup, parms: {source_keys=["nfs://Users/ece/0xdata/h2o-dev/./bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2013.csv","nfs://Users/ece/0xdata/h2o-dev/./bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2014.csv"]} onExCompletion for water.parser.ParseSetup$GuessSetupTsk@3759c15 water.exceptions.H2OParseSetupException: Column types do not match between files. at water.parser.ParseSetup$GuessSetupTsk.unifyColumnTypes(ParseSetup.java:414) at water.parser.ParseSetup$GuessSetupTsk.reduce(ParseSetup.java:345) at water.parser.ParseSetup$GuessSetupTsk.reduce(ParseSetup.java:255) at water.MRTask.reduce4(MRTask.java:527) at water.MRTask.reduce2(MRTask.java:463) at water.MRTask.onCompletion(MRTask.java:450) at jsr166y.CountedCompleter.tryComplete(CountedCompleter.java:386) at water.MRTask.compute2(MRTask.java:439) at water.H2O$H2OCountedCompleter.compute(H2O.java:682) at jsr166y.CountedCompleter.exec(CountedCompleter.java:429) --------------------------------------------------------------------------------------------------------------------------------------------------------- from spencer: guesses wrong column types: Python call: wthr1 = h2o.import_frame(path=[h2o.locate("bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2013.csv"), h2o.locate("bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2014.csv")]) Case is when one column is all NA and the other has a single factor level. Gets a numerc v. enum type mismatch

    JIRA | 2 years ago | Eric Eckstrand
    water.exceptions.H2OParseSetupException: Column types do not match between files.
  2. 0

    Multi-file parse of: bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2013.csv bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2014.csv gives: /3/ParseSetup, route: /3/ParseSetup, parms: {source_keys=["nfs://Users/ece/0xdata/h2o-dev/./bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2013.csv","nfs://Users/ece/0xdata/h2o-dev/./bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2014.csv"]} onExCompletion for water.parser.ParseSetup$GuessSetupTsk@3759c15 water.exceptions.H2OParseSetupException: Column types do not match between files. at water.parser.ParseSetup$GuessSetupTsk.unifyColumnTypes(ParseSetup.java:414) at water.parser.ParseSetup$GuessSetupTsk.reduce(ParseSetup.java:345) at water.parser.ParseSetup$GuessSetupTsk.reduce(ParseSetup.java:255) at water.MRTask.reduce4(MRTask.java:527) at water.MRTask.reduce2(MRTask.java:463) at water.MRTask.onCompletion(MRTask.java:450) at jsr166y.CountedCompleter.tryComplete(CountedCompleter.java:386) at water.MRTask.compute2(MRTask.java:439) at water.H2O$H2OCountedCompleter.compute(H2O.java:682) at jsr166y.CountedCompleter.exec(CountedCompleter.java:429) --------------------------------------------------------------------------------------------------------------------------------------------------------- from spencer: guesses wrong column types: Python call: wthr1 = h2o.import_frame(path=[h2o.locate("bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2013.csv"), h2o.locate("bigdata/laptop/citibike-nyc/31081_New_York_City__Hourly_2014.csv")]) Case is when one column is all NA and the other has a single factor level. Gets a numerc v. enum type mismatch

    JIRA | 2 years ago | Eric Eckstrand
    water.exceptions.H2OParseSetupException: Column types do not match between files.

    Root Cause Analysis

    1. water.exceptions.H2OParseSetupException

      Column types do not match between files.

      at water.parser.ParseSetup$GuessSetupTsk.unifyColumnTypes()
    2. water.parser
      ParseSetup$GuessSetupTsk.reduce
      1. water.parser.ParseSetup$GuessSetupTsk.unifyColumnTypes(ParseSetup.java:414)
      2. water.parser.ParseSetup$GuessSetupTsk.reduce(ParseSetup.java:345)
      3. water.parser.ParseSetup$GuessSetupTsk.reduce(ParseSetup.java:255)
      3 frames
    3. water
      MRTask.onCompletion
      1. water.MRTask.reduce4(MRTask.java:527)
      2. water.MRTask.reduce2(MRTask.java:463)
      3. water.MRTask.onCompletion(MRTask.java:450)
      3 frames
    4. jsr166y
      CountedCompleter.tryComplete
      1. jsr166y.CountedCompleter.tryComplete(CountedCompleter.java:386)
      1 frame
    5. water
      H2O$H2OCountedCompleter.compute
      1. water.MRTask.compute2(MRTask.java:439)
      2. water.H2O$H2OCountedCompleter.compute(H2O.java:682)
      2 frames
    6. jsr166y
      CountedCompleter.exec
      1. jsr166y.CountedCompleter.exec(CountedCompleter.java:429)
      1 frame