[英]Hive factorial UDF
I'm trying to find the factorial of a number in Hive. 我正在尝试在Hive中查找数字的阶乘。 There's currently no Hive function to do that, so I've tried to write my own.
当前没有Hive函数可以执行此操作,因此我尝试编写自己的函数。 This is my code:
这是我的代码:
package com.guy.hive.udf;
import org.apache.hadoop.hive.ql.exec.UDF;
import org.apache.hadoop.io.LongWritable;
import org.apache.commons.math3.util.ArithmeticUtils;
public final class Factorial extends UDF {
public LongWritable evaluate(final LongWritable s){
int n = (int) s.get();
int fact = (int) ArithmeticUtils.factorial(n);
return new LongWritable(fact);
}
}
When I run this Hive query: 当我运行此Hive查询时:
select factorial(c) from (select count(*) as c from test_table) ;
I get the exception: 我得到例外:
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unable to execute method public org.apache.hadoop.io.LongWritable com.vm.hive.udf.Factorial.evaluate(long) on object com.vm.hive.udf.Factorial@37483748 of class com.vm.hive.udf.Factorial with arguments {39514210:java.lang.Long} of size 1
Can anyone help with this? 有人能帮忙吗?
Stacktrace: 堆栈跟踪:
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unable to execute method public org.apache.hadoop.io.LongWritable com.vm.hive.udf.Factorial.evaluate(org.apache.hadoop.io.LongWritable) on object com.vm.hive.udf.Factorial@5faa5faa of class com.vm.hive.udf.Factorial with arguments {39514210:org.apache.hadoop.io.LongWritable} of size 1
at org.apache.hadoop.hive.ql.exec.FunctionRegistry.invoke(FunctionRegistry.java:1030)
at org.apache.hadoop.hive.ql.udf.generic.GenericUDFBridge.evaluate(GenericUDFBridge.java:181)
at org.apache.hadoop.hive.ql.exec.ExprNodeGenericFuncEvaluator._evaluate(ExprNodeGenericFuncEvaluator.java:166)
at org.apache.hadoop.hive.ql.exec.ExprNodeEvaluator.evaluate(ExprNodeEvaluator.java:77)
at org.apache.hadoop.hive.ql.exec.ExprNodeEvaluator.evaluate(ExprNodeEvaluator.java:65)
at org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(SelectOperator.java:80)
at org.apache.hadoop.hive.ql.exec.Operator.process(Operator.java:504)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:842)
at org.apache.hadoop.hive.ql.exec.GroupByOperator.forward(GroupByOperator.java:1052)
at org.apache.hadoop.hive.ql.exec.GroupByOperator.flush(GroupByOperator.java:1077)
... 10 more
Caused by: java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:60)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:37)
at java.lang.reflect.Method.invoke(Method.java:611)
at org.apache.hadoop.hive.ql.exec.FunctionRegistry.invoke(FunctionRegistry.java:1006)
... 19 more
Caused by: org.apache.commons.math3.exception.MathArithmeticException: arithmetic exception
at org.apache.commons.math3.util.ArithmeticUtils.factorial(ArithmeticUtils.java:317)
at com.vm.hive.udf.Factorial.evaluate(Factorial.java:50)
... 24 more
[EDIT 1 - added imports to Java code.] [编辑1-将导入添加到Java代码中。]
[EDIT 2 - added StackTrace [编辑2-添加了StackTrace
I see your problem. 我明白你的问题。 The problem is not in Hive but in the ArithmeticUtils
factorial
method. 问题不在于Hive,而在于ArithmeticUtils
factorial
方法。 See that it throws a MathArithmeticException
? 看到它抛出了
MathArithmeticException
吗? According to the docs, this will come when "the result is too large to be represented by a long." 根据文档,当“结果太大而无法用长整数表示”时,就会出现这种情况。
This must be what's happening in your case. 这一定是您的情况。 Try passing in a smaller number to the method.
尝试将较小的数字传递给该方法。
Also, note that the factorial
method is deprecated. 另外,请注意不推荐使用
factorial
方法。 The docs recommend using the CombinatoricsUtils.factorialLog(int) method instead. 文档建议改为使用CombinatoricsUtils.factorialLog(int)方法。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.