java.lang.IllegalArgumentException

There are no available Samebug tips for this exception. Do you have an idea how to solve this issue? A short tip would help users who saw this issue last week.

  • from h2o stream: https://groups.google.com/forum/#!topic/h2ostream/ZrGos7sLgCE pFrames <- h2o.splitFrame(rawFrame, ratios = c(0.7, 0.15)) pFrames.train <- pFrames[[1]] pFrames.test <- pFrames[[2]] pFrames.val <- pFrames[[3]] model.1 <- h2o.glrm(pFrames.train, k = 2, validation_frame = pFrames.val, transform = "NORMALIZE", multi_loss = "Categorical", init = "Random", seed = 123456) |============== | 17% Got exception 'class java.lang.IllegalArgumentException', with msg 'Can not make vectors of different length compatible!' java.lang.IllegalArgumentException: Can not make vectors of different length compatible! at water.fvec.Frame.makeCompatible(Frame.java:1275) at water.fvec.Frame.bulkAdd(Frame.java:529) at water.fvec.Frame.add(Frame.java:514) at water.fvec.Frame.add(Frame.java:560) at hex.glrm.GLRMModel.scoreMetricsOnly(GLRMModel.java:615) at hex.glrm.GLRM$GLRMDriver.compute2(GLRM.java:711) at water.H2O$H2OCountedCompleter.compute(H2O.java:1069) at jsr166y.CountedCompleter.exec(CountedCompleter.java:468) at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) Erreur : 'Can not make vectors of different length compatible! h2o.clusterInfo() R is connected to the H2O cluster: H2O cluster uptime: 7 days 3 hours H2O cluster version: 3.7.0.3256 H2O cluster name: H2ODemo H2O cluster total nodes: 3 H2O cluster total memory: 71.91 GB H2O cluster total cores: 12 H2O cluster allowed cores: 12 H2O cluster healthy: TRUE
    via by Spencer Aiello,
  • from h2o stream: https://groups.google.com/forum/#!topic/h2ostream/ZrGos7sLgCE pFrames <- h2o.splitFrame(rawFrame, ratios = c(0.7, 0.15)) pFrames.train <- pFrames[[1]] pFrames.test <- pFrames[[2]] pFrames.val <- pFrames[[3]] model.1 <- h2o.glrm(pFrames.train, k = 2, validation_frame = pFrames.val, transform = "NORMALIZE", multi_loss = "Categorical", init = "Random", seed = 123456) |============== | 17% Got exception 'class java.lang.IllegalArgumentException', with msg 'Can not make vectors of different length compatible!' java.lang.IllegalArgumentException: Can not make vectors of different length compatible! at water.fvec.Frame.makeCompatible(Frame.java:1275) at water.fvec.Frame.bulkAdd(Frame.java:529) at water.fvec.Frame.add(Frame.java:514) at water.fvec.Frame.add(Frame.java:560) at hex.glrm.GLRMModel.scoreMetricsOnly(GLRMModel.java:615) at hex.glrm.GLRM$GLRMDriver.compute2(GLRM.java:711) at water.H2O$H2OCountedCompleter.compute(H2O.java:1069) at jsr166y.CountedCompleter.exec(CountedCompleter.java:468) at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) Erreur : 'Can not make vectors of different length compatible! h2o.clusterInfo() R is connected to the H2O cluster: H2O cluster uptime: 7 days 3 hours H2O cluster version: 3.7.0.3256 H2O cluster name: H2ODemo H2O cluster total nodes: 3 H2O cluster total memory: 71.91 GB H2O cluster total cores: 12 H2O cluster allowed cores: 12 H2O cluster healthy: TRUE
    via by Spencer Aiello,
  • this test's goaldoing a bunch of GBMs, cancelling them and then reusing the key (I think) but it never gets to where it cancels. fails after kicking off 3 GBMs (they have different destination key names) the datasets are the synthetic manyfiles gz, just one, no header. I don't put C1 in the header so somehow h2o is thinking there's duplicate C1 in the header an exec has been done on the response col (378+1) to make it binomial I think maybe the exec is causing the problem? actually the commands.log looks like it's kicking off 3 GBM's and then failing the destination keys are all unique artifacts here http://test.0xdata.com/view/Nightly%20Multi%20Local%20VM/job/h2o.multi.vm.temporary/lastSuccessfulBuild/artifact/test-reports/sandbox_test_GBM_cancel_model_reuse.zip #*********************** python_test_name: /var/lib/jenkins/jobs/h2o.multi.vm.temporary/workspace/py/testdir_multi_jvm/test_GBM_cancel_model_reuse.py #*********************** 2014-04-27 05:01:39.521977 -- Start http://10.137.71.117:54323/ImportFiles.json?path=/home/0xdiag/home-0xdiag-datasets/manyfiles-nflx-gz 2014-04-27 05:01:40.488633 -- Start http://10.137.71.117:54323/Parse.json?destination_key=c.hex&source_key=nfs://home/0xdiag/home-0xdiag-datasets/manyfiles-nflx-gz/file_1.dat.gz 2014-04-27 05:01:54.665547 -- Start http://10.137.71.117:54323/2/SummaryPage2.json?source=c.hex&cols=378&max_ncols=1 2014-04-27 05:02:02.063284 -- Start http://10.137.71.117:5432/2/GBM.json?learn_rate=0.1&destination_key=GBMBad0&classification=0&min_rows=1&ntrees=2&response=C379&ignored_cols_by_name=C4,C5,C6,C7,C8,C9,C10,C11,C12,C15,C17,C18,C19,C20,C21,C425,C426,C427,C541,C542,C379&source=c.hex&grid_parallelism=4&max_depth=8 2014-04-27 05:02:04.336475 -- Start http://10.137.71.117:54323/2/GBM.json?learn_rate=0.1&destination_key=GBMBad1&classification=0&min_rows=1&ntrees=2&response=C379&ignored_cols_by_name=C4,C5,C6,C7,C8,C9,C10,C11,C12,C15,C17,C18,C19,C20,C21,C425,C426,C427,C541,C542,C379&source=c.hex&grid_parallelism=4&max_depth=8 2014-04-27 05:02:04.373612 -- Start http://10.137.71.117:54323/2/GBM.json?learn_rate=0.1&destination_key=GBMBad2&classification=0&min_rows=1&ntrees=2&response=C379&ignored_cols_by_name=C4,C5,C6,C7,C8,C9,C10,C11,C12,C15,C17,C18,C19,C20,C21,C425,C426,C427,C541,C542,C379&source=c.hex&grid_parallelism=4&max_depth=8 java.lang.IllegalArgumentException: Duplicate name 'C1' in Frame 05:02:04.389 FJ-0-5 INFO WATER: at water.fvec.Frame.add(Frame.java:167) 05:02:04.389 FJ-0-5 INFO WATER: at hex.gbm.SharedTreeModelBuilder.buildModel(SharedTreeModelBuilder.java:136) 05:02:04.389 FJ-0-5 INFO WATER: at hex.gbm.GBM.execImpl(GBM.java:139) 05:02:04.389 FJ-0-5 INFO WATER: at water.Func.exec(Func.java:42) 05:02:04.390 FJ-0-5 INFO WATER: at water.Job$3.compute2(Job.java:328) 05:02:04.390 FJ-0-5 INFO WATER: at water.H2O$H2OCountedCompleter.compute(H2O.java:731) 05:02:04.391 FJ-0-5 INFO WATER: at jsr166y.CountedCompleter.exec(CountedCompleter.java:429) 05:02:04.391 FJ-0-5 INFO WATER: at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) 05:02:04.391 FJ-0-5 INFO WATER: at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) 05:02:04.392 FJ-0-5 INFO WATER: at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) 05:02:04.392 FJ-0-5 INFO WATER: at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 05:02:04.387 FJ-0-5 ERRR WATER: + java.lang.IllegalArgumentException: Duplicate name 'C1' in Frame + at water.fvec.Frame.add(Frame.java:167) + at hex.gbm.SharedTreeModelBuilder.buildModel(SharedTreeModelBuilder.java:136) + at hex.gbm.GBM.execImpl(GBM.java:139) + at water.Func.exec(Func.java:42) + at water.Job$3.compute2(Job.java:328) + at water.H2O$H2OCountedCompleter.compute(H2O.java:731) + at jsr166y.CountedCompleter.exec(CountedCompleter.java:429) + at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) + at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) + at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) + at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 05:02:04.407 FJ-0-13 INFO WATER: Building H2O GBM model with these parameters: 05:02:04.409 FJ-0-13 INFO WATER: { 05:02:04.410 FJ-0-13 INFO WATER: "job_key": "$03010a89477534d4ffffffff$_bc4267305bc3914e18d853f5cb714234", 05:02:04.410 FJ-0-13 INFO WATER: "destination_key": "GBMBad2", 05:02:04.410 FJ-0-13 INFO WATER: "description": "Distributed GBM", 05:02:04.410 FJ-0-13 INFO WATER: "start_time": 1398600124403, 05:02:04.410 FJ-0-13 INFO WATER: "end_time": 0, 05:02:04.410 FJ-0-13 INFO WATER: "state": "RUNNING", 05:02:04.411 FJ-0-13 INFO WATER: "source": { 05:02:04.411 FJ-0-13 INFO WATER: "_key": "c.hex.autoframe", 05:02:04.411 FJ-0-13 INFO WATER: "num_cols": 542, 05:02:04.411 FJ-0-13 INFO WATER: "num_rows": 100000, 05:02:04.411 FJ-0-13 INFO WATER: "ignored_cols": "3,4,5,6,7,8,9,10,11,14,16,17,18,19,20,424,425,426,540,541,378", 05:02:04.411 FJ-0-13 INFO WATER: "num_used_cols": 521 05:02:04.412 FJ-0-13 INFO WATER: }, 05:02:04.412 FJ-0-13 INFO WATER: "response": { 05:02:04.412 FJ-0-13 INFO WATER: "name": "C379" 05:02:04.412 FJ-0-13 INFO WATER: }, 05:02:04.412 FJ-0-13 INFO WATER: "classification": "false", 05:02:04.412 FJ-0-13 INFO WATER: "ntrees": 2, 05:02:04.413 FJ-0-13 INFO WATER: "max_depth": 8, 05:02:04.413 FJ-0-13 INFO WATER: "min_rows": 1, 05:02:04.413 FJ-0-13 INFO WATER: "nbins": 20, 05:02:04.413 FJ-0-13 INFO WATER: "score_each_iteration": "false", 05:02:04.413 FJ-0-13 INFO WATER: "learn_rate": 0.1 05:02:04.413 FJ-0-13 INFO WATER: }
    via by Kevin Normoyle,
  • this test's goaldoing a bunch of GBMs, cancelling them and then reusing the key (I think) but it never gets to where it cancels. fails after kicking off 3 GBMs (they have different destination key names) the datasets are the synthetic manyfiles gz, just one, no header. I don't put C1 in the header so somehow h2o is thinking there's duplicate C1 in the header an exec has been done on the response col (378+1) to make it binomial I think maybe the exec is causing the problem? actually the commands.log looks like it's kicking off 3 GBM's and then failing the destination keys are all unique artifacts here http://test.0xdata.com/view/Nightly%20Multi%20Local%20VM/job/h2o.multi.vm.temporary/lastSuccessfulBuild/artifact/test-reports/sandbox_test_GBM_cancel_model_reuse.zip #*********************** python_test_name: /var/lib/jenkins/jobs/h2o.multi.vm.temporary/workspace/py/testdir_multi_jvm/test_GBM_cancel_model_reuse.py #*********************** 2014-04-27 05:01:39.521977 -- Start http://10.137.71.117:54323/ImportFiles.json?path=/home/0xdiag/home-0xdiag-datasets/manyfiles-nflx-gz 2014-04-27 05:01:40.488633 -- Start http://10.137.71.117:54323/Parse.json?destination_key=c.hex&source_key=nfs://home/0xdiag/home-0xdiag-datasets/manyfiles-nflx-gz/file_1.dat.gz 2014-04-27 05:01:54.665547 -- Start http://10.137.71.117:54323/2/SummaryPage2.json?source=c.hex&cols=378&max_ncols=1 2014-04-27 05:02:02.063284 -- Start http://10.137.71.117:5432/2/GBM.json?learn_rate=0.1&destination_key=GBMBad0&classification=0&min_rows=1&ntrees=2&response=C379&ignored_cols_by_name=C4,C5,C6,C7,C8,C9,C10,C11,C12,C15,C17,C18,C19,C20,C21,C425,C426,C427,C541,C542,C379&source=c.hex&grid_parallelism=4&max_depth=8 2014-04-27 05:02:04.336475 -- Start http://10.137.71.117:54323/2/GBM.json?learn_rate=0.1&destination_key=GBMBad1&classification=0&min_rows=1&ntrees=2&response=C379&ignored_cols_by_name=C4,C5,C6,C7,C8,C9,C10,C11,C12,C15,C17,C18,C19,C20,C21,C425,C426,C427,C541,C542,C379&source=c.hex&grid_parallelism=4&max_depth=8 2014-04-27 05:02:04.373612 -- Start http://10.137.71.117:54323/2/GBM.json?learn_rate=0.1&destination_key=GBMBad2&classification=0&min_rows=1&ntrees=2&response=C379&ignored_cols_by_name=C4,C5,C6,C7,C8,C9,C10,C11,C12,C15,C17,C18,C19,C20,C21,C425,C426,C427,C541,C542,C379&source=c.hex&grid_parallelism=4&max_depth=8 java.lang.IllegalArgumentException: Duplicate name 'C1' in Frame 05:02:04.389 FJ-0-5 INFO WATER: at water.fvec.Frame.add(Frame.java:167) 05:02:04.389 FJ-0-5 INFO WATER: at hex.gbm.SharedTreeModelBuilder.buildModel(SharedTreeModelBuilder.java:136) 05:02:04.389 FJ-0-5 INFO WATER: at hex.gbm.GBM.execImpl(GBM.java:139) 05:02:04.389 FJ-0-5 INFO WATER: at water.Func.exec(Func.java:42) 05:02:04.390 FJ-0-5 INFO WATER: at water.Job$3.compute2(Job.java:328) 05:02:04.390 FJ-0-5 INFO WATER: at water.H2O$H2OCountedCompleter.compute(H2O.java:731) 05:02:04.391 FJ-0-5 INFO WATER: at jsr166y.CountedCompleter.exec(CountedCompleter.java:429) 05:02:04.391 FJ-0-5 INFO WATER: at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) 05:02:04.391 FJ-0-5 INFO WATER: at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) 05:02:04.392 FJ-0-5 INFO WATER: at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) 05:02:04.392 FJ-0-5 INFO WATER: at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 05:02:04.387 FJ-0-5 ERRR WATER: + java.lang.IllegalArgumentException: Duplicate name 'C1' in Frame + at water.fvec.Frame.add(Frame.java:167) + at hex.gbm.SharedTreeModelBuilder.buildModel(SharedTreeModelBuilder.java:136) + at hex.gbm.GBM.execImpl(GBM.java:139) + at water.Func.exec(Func.java:42) + at water.Job$3.compute2(Job.java:328) + at water.H2O$H2OCountedCompleter.compute(H2O.java:731) + at jsr166y.CountedCompleter.exec(CountedCompleter.java:429) + at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) + at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) + at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) + at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) 05:02:04.407 FJ-0-13 INFO WATER: Building H2O GBM model with these parameters: 05:02:04.409 FJ-0-13 INFO WATER: { 05:02:04.410 FJ-0-13 INFO WATER: "job_key": "$03010a89477534d4ffffffff$_bc4267305bc3914e18d853f5cb714234", 05:02:04.410 FJ-0-13 INFO WATER: "destination_key": "GBMBad2", 05:02:04.410 FJ-0-13 INFO WATER: "description": "Distributed GBM", 05:02:04.410 FJ-0-13 INFO WATER: "start_time": 1398600124403, 05:02:04.410 FJ-0-13 INFO WATER: "end_time": 0, 05:02:04.410 FJ-0-13 INFO WATER: "state": "RUNNING", 05:02:04.411 FJ-0-13 INFO WATER: "source": { 05:02:04.411 FJ-0-13 INFO WATER: "_key": "c.hex.autoframe", 05:02:04.411 FJ-0-13 INFO WATER: "num_cols": 542, 05:02:04.411 FJ-0-13 INFO WATER: "num_rows": 100000, 05:02:04.411 FJ-0-13 INFO WATER: "ignored_cols": "3,4,5,6,7,8,9,10,11,14,16,17,18,19,20,424,425,426,540,541,378", 05:02:04.411 FJ-0-13 INFO WATER: "num_used_cols": 521 05:02:04.412 FJ-0-13 INFO WATER: }, 05:02:04.412 FJ-0-13 INFO WATER: "response": { 05:02:04.412 FJ-0-13 INFO WATER: "name": "C379" 05:02:04.412 FJ-0-13 INFO WATER: }, 05:02:04.412 FJ-0-13 INFO WATER: "classification": "false", 05:02:04.412 FJ-0-13 INFO WATER: "ntrees": 2, 05:02:04.413 FJ-0-13 INFO WATER: "max_depth": 8, 05:02:04.413 FJ-0-13 INFO WATER: "min_rows": 1, 05:02:04.413 FJ-0-13 INFO WATER: "nbins": 20, 05:02:04.413 FJ-0-13 INFO WATER: "score_each_iteration": "false", 05:02:04.413 FJ-0-13 INFO WATER: "learn_rate": 0.1 05:02:04.413 FJ-0-13 INFO WATER: }
    via by Kevin Normoyle,
  • I am running into a lot of errors using a small, but wide dataset on an 8GB H2O cluster. I don't think memory should be a problem, but perhaps that's the issue, or perhaps it's something else. Three types of errors: {code} library(h2o) h2o.init(nthreads = -1, max_mem_size = "8G") file <- "http://www.stat.berkeley.edu/~ledell/data/rotterdam.csv.gz" df <- h2o.importFile(file) dim(df) # 286 22284 y <- "relapse" x <- setdiff(names(df), y) df[,y] <- as.factor(df[,y]) #Convert to factor (for binary classification) splits <- h2o.splitFrame(df, seed = 1) train <- splits[[1]] test <- splits[[2]] print(dim(train)) print(dim(test)) # Does not work: # Train a default PCA h2o_pca <- h2o.prcomp(train, k = 8, x = x) #Error: java.lang.IllegalArgumentException: Found validation errors: ERRR on field: _train: Gram matrices (one per thread) won't fit in the driver node's memory (59.19 GB > 6.93 GB) - try reducing the number of columns and/or the number of categorical factors. # Also kills the H2O cluster! # Try again with Power method instead, but this errors out and kills the cluster! # Train a PCA model using 20 principal components. h2o_pca20 <- h2o.prcomp(train, x = x, k = 20, transform = "STANDARDIZE", pca_method = "Power", use_all_factor_levels = TRUE, seed = 1) #ERROR: Unexpected HTTP Status code: 500 Server Error (url = http://localhost:54321/3/Jobs/$03017f00000132d4ffffffff$_813316ba1fc0d17fc26ffd94f7e24d85) #Error: lexical error: invalid char in json text. #<html> <head> <meta http-equiv= # (right here) ------^ #Train a PCA model using 20 principal components. h2o_pca20 <- h2o.prcomp(train, x = x, k = 20, transform = "STANDARDIZE", pca_method = "Randomized", use_all_factor_levels = TRUE, seed = 1) # java.lang.IllegalArgumentException: Can not make vectors of different length compatible! # # java.lang.IllegalArgumentException: Can not make vectors of different length compatible! # at water.fvec.Frame.makeCompatible(Frame.java:1391) # at water.fvec.Frame.makeCompatible(Frame.java:1379) # at water.fvec.Frame.bulkAdd(Frame.java:525) # at water.fvec.Frame.add(Frame.java:510) # at water.fvec.Frame.add(Frame.java:564) # at hex.svd.SVD$SVDDriver.randSubIter(SVD.java:210) # at hex.svd.SVD$SVDDriver.computeImpl(SVD.java:455) # at hex.ModelBuilder$Driver.compute2(ModelBuilder.java:169) # at hex.ModelBuilder.trainModelNested(ModelBuilder.java:225) # at hex.pca.PCA$PCADriver.computeImpl(PCA.java:247) # at hex.ModelBuilder$Driver.compute2(ModelBuilder.java:169) # at water.H2O$H2OCountedCompleter.compute(H2O.java:1206) # at jsr166y.CountedCompleter.exec(CountedCompleter.java:468) # at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) # at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) # at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) # at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104) # # Error: java.lang.IllegalArgumentException: Can not make vectors of different length compatible! {code} The only method that works here is GLRM: {code} h2o_pca20 <- h2o.prcomp(train, + x = x, k = 20, + transform = "STANDARDIZE", + pca_method = "GLRM", + use_all_factor_levels = TRUE, + seed = 1) |=======================================================================| 100% > {code}
    via by Erin LeDell,
    • java.lang.IllegalArgumentException: Can not make vectors of different length compatible! at water.fvec.Frame.makeCompatible(Frame.java:1275) at water.fvec.Frame.bulkAdd(Frame.java:529) at water.fvec.Frame.add(Frame.java:514) at water.fvec.Frame.add(Frame.java:560) at hex.glrm.GLRMModel.scoreMetricsOnly(GLRMModel.java:615) at hex.glrm.GLRM$GLRMDriver.compute2(GLRM.java:711) at water.H2O$H2OCountedCompleter.compute(H2O.java:1069) at jsr166y.CountedCompleter.exec(CountedCompleter.java:468) at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263) at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974) at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477) at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)
    No Bugmate found.