org.pentaho.di.core.exception.KettleStepException: Error while processing The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject

Pentaho BI Platform Tracking | Hemal Govind | 2 years ago
  1. 0

    Implementing a Pentaho MapReduce application. In the Mapper transformation customer needs to validate XML file. The XSD file to validate the XML, is stored in HDFS. In the XSD Validator, I set the following fields: XSD Source = is a file, filename is defined in a field XSD filename field = xsd_file_url where the xsd_file_url = hdfs://my.namenode:8020/test/hbase_mr_xml/xsd/Car_v1.xsd I get the following error: 2015/03/25 22:10:28 - XSD Validator.0 - ERROR (version 5.3.0.0-213, build 1 from 2015-02-02_12-17-08 by buildguy) : Unexpected error 2015/03/25 22:10:28 - XSD Validator.0 - ERROR (version 5.3.0.0-213, build 1 from 2015-02-02_12-17-08 by buildguy) : org.pentaho.di.core.exception.KettleStepException: 2015/03/25 22:10:28 - XSD Validator.0 - Error while processing 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - at org.pentaho.di.trans.steps.xsdvalidator.XsdValidator.processRow(XsdValidator.java:303) 2015/03/25 22:10:28 - XSD Validator.0 - at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62) 2015/03/25 22:10:28 - XSD Validator.0 - at java.lang.Thread.run(Unknown Source) 2015/03/25 22:10:28 - XSD Validator.0 - Caused by: org.pentaho.di.core.exception.KettleStepException: 2015/03/25 22:10:28 - XSD Validator.0 - The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - at org.pentaho.di.trans.steps.xsdvalidator.XsdValidator.processRow(XsdValidator.java:226) 2015/03/25 22:10:28 - XSD Validator.0 - ... 2 more Reproduction steps: 1. Download the attached KTR, XML, and XSD files 2. Copy the XSD file into an HDFS directory 3. Open the transformation. Update file references for the XML and XSD files. 4. Execute the transformation. Expected Results: The transformation runs successfully, and the XML file is reported as "Valid". Actual Results: The transformation ends with an error (see above)

    Pentaho BI Platform Tracking | 2 years ago | Hemal Govind
    org.pentaho.di.core.exception.KettleStepException: Error while processing The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject
  2. 0

    Implementing a Pentaho MapReduce application. In the Mapper transformation customer needs to validate XML file. The XSD file to validate the XML, is stored in HDFS. In the XSD Validator, I set the following fields: XSD Source = is a file, filename is defined in a field XSD filename field = xsd_file_url where the xsd_file_url = hdfs://my.namenode:8020/test/hbase_mr_xml/xsd/Car_v1.xsd I get the following error: 2015/03/25 22:10:28 - XSD Validator.0 - ERROR (version 5.3.0.0-213, build 1 from 2015-02-02_12-17-08 by buildguy) : Unexpected error 2015/03/25 22:10:28 - XSD Validator.0 - ERROR (version 5.3.0.0-213, build 1 from 2015-02-02_12-17-08 by buildguy) : org.pentaho.di.core.exception.KettleStepException: 2015/03/25 22:10:28 - XSD Validator.0 - Error while processing 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - at org.pentaho.di.trans.steps.xsdvalidator.XsdValidator.processRow(XsdValidator.java:303) 2015/03/25 22:10:28 - XSD Validator.0 - at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62) 2015/03/25 22:10:28 - XSD Validator.0 - at java.lang.Thread.run(Unknown Source) 2015/03/25 22:10:28 - XSD Validator.0 - Caused by: org.pentaho.di.core.exception.KettleStepException: 2015/03/25 22:10:28 - XSD Validator.0 - The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject 2015/03/25 22:10:28 - XSD Validator.0 - 2015/03/25 22:10:28 - XSD Validator.0 - at org.pentaho.di.trans.steps.xsdvalidator.XsdValidator.processRow(XsdValidator.java:226) 2015/03/25 22:10:28 - XSD Validator.0 - ... 2 more Reproduction steps: 1. Download the attached KTR, XML, and XSD files 2. Copy the XSD file into an HDFS directory 3. Open the transformation. Update file references for the XML and XSD files. 4. Execute the transformation. Expected Results: The transformation runs successfully, and the XML file is reported as "Valid". Actual Results: The transformation ends with an error (see above)

    Pentaho BI Platform Tracking | 2 years ago | Hemal Govind
    org.pentaho.di.core.exception.KettleStepException: Error while processing The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject
  3. 0

    Pentaho - Transformation is killing the other steps

    Stack Overflow | 1 year ago | Rahul Nadkarni
    org.pentaho.di.core.exception.KettleStepException: Error while running the step Unexpected conversion error while converting value [flag_SALES_WEEK Integer] to an Integer java.lang.String cannot be cast to java.lang.Long
  4. Speed up your debug routine!

    Automated exception search integrated into your IDE

  5. 0

    When creating a Regex Evaluation step with a regular expression like .*XXX(140110|145250)XXX.* and not checking 'Create fields for capture groups' and also not specifying any 'Capture Group Fields' then the transformation stops with an error like this: Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : The number of capture groups in the regular expression (3) does not match the number of fields specified (0)! Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Unexpected error Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : org.pentaho.di.core.exception.KettleStepException: Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Error in step Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : The number of capture groups in the regular expression (3) does not match the number of fields specified (0)! Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : at org.pentaho.di.trans.steps.regexeval.RegexEval.processRow(RegexEval.java:208) Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : at org.pentaho.di.trans.step.RunThread.run(RunThread.java:40) Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : at java.lang.Thread.run(Thread.java:680) Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Caused by: org.pentaho.di.core.exception.KettleStepException: Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : The number of capture groups in the regular expression (3) does not match the number of fields specified (0)! Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : at org.pentaho.di.trans.steps.regexeval.RegexEval.processRow(RegexEval.java:161) Regex Evaluation.0 - ERROR (version 4.2.0-stable, build 15748 from 2011-09-08 13.11.42 by buildguy) : ... 2 more

    Pentaho BI Platform Tracking | 5 years ago | Axel Christ
    org.pentaho.di.core.exception.KettleStepException: Error in step The number of capture groups in the regular expression (3) does not match the number of fields specified (0)!

    Not finding the right solution?
    Take a tour to get the most out of Samebug.

    Tired of useless tips?

    Automated exception search integrated into your IDE

    Root Cause Analysis

    1. org.pentaho.di.core.exception.KettleStepException

      The schema cannot be created by a org.pentaho.hdfs.vfs.HDFSFileObject

      at org.pentaho.di.trans.steps.xsdvalidator.XsdValidator.processRow()
    2. org.pentaho.di
      RunThread.run
      1. org.pentaho.di.trans.steps.xsdvalidator.XsdValidator.processRow(XsdValidator.java:226)
      2. org.pentaho.di.trans.step.RunThread.run(RunThread.java:62)
      2 frames
    3. Java RT
      Thread.run
      1. java.lang.Thread.run(Unknown Source)
      1 frame