FCMP

http://llvm.org/docs/LangRef.html#fcmp-instruction

llvm IR `fcmp` Instruction ¶

Syntax:¶

Text Only

<result> = fcmp [fast-math flags]* <cond> <ty> <op1>, <op2>     ; yields i1 or <N x i1>:result

Overview:¶

The ‘fcmp’ instruction returns a boolean value or vector of boolean values based on comparison of its operands.

If the operands are floating-point scalars, then the result type is a boolean (i1).

If the operands are floating-point vectors, then the result type is a vector of boolean with the same number of elements as the operands being compared.

The ‘fcmp’ instruction takes three operands. The first operand is the condition code indicating the kind of comparison to perform. It is not a value, just a keyword. The possible condition codes are:

false: no comparison, always returns false
oeq: ordered and equal
ogt: ordered and greater than
oge: ordered and greater than or equal
olt: ordered and less than
ole: ordered and less than or equal
one: ordered and not equal
ord: ordered (no nans)
ueq: unordered or equal
ugt: unordered or greater than
uge: unordered or greater than or equal
ult: unordered or less than
ule: unordered or less than or equal
une: unordered or not equal
uno: unordered (either nans)
true: no comparison, always returns true

Unordered means that either operand may be a QNAN, ordered means that neither operand is a QNAN.

order没有QNAN, undered有QNAN

有QNAN XSCMPEQDP FALSE

Each of val1 and val2 arguments must be either a floating-point type or a vector of floating-point type. They must have identical types.

PowerPC（MI）¶

ABI可对比FCMPU/FCMPO，注意FCMPO/XSCMPODP未使用。

给CR的结果一一样的：任一个是NaN(QNaN/SNaN)时，返回的结果是 0b0001到所用的CR。

Text Only

小于:  0b1000
大于:  0b0100
等于:  0b0010
有NAN: 0B0001

标志寄存器修改是不同的：

FCMPU: 其中一个Op是SNaN时会设置VXSNAN。

FCMPO: 其中一个Op是SNaN时会设置VXSNAN，并且如果VE是0时，会设VXVC。

其中一个Op是QNaN，另一个不是SNaN时会设置VXSNAN，并且会设VXVC。

Semantics:¶

The ‘fcmp’ instruction compares op1 and op2 according to the condition code given as cond. If the operands are vectors, then the vectors are compared element by element. Each comparison performed always yields an i1 result, as follows:

false: always yields false, regardless of operands.
oeq: yields true if both operands are not a QNAN and op1 is equal to op2.
ogt: yields true if both operands are not a QNAN and op1 is greater than op2.
oge: yields true if both operands are not a QNAN and op1 is greater than or equal to op2.
olt: yields true if both operands are not a QNAN and op1 is less than op2.
ole: yields true if both operands are not a QNAN and op1 is less than or equal to op2.
one: yields true if both operands are not a QNAN and op1 is not equal to op2.
ord: yields true if both operands are not a QNAN.
ueq: yields true if either operand is a QNAN or op1 is equal to op2.
ugt: yields true if either operand is a QNAN or op1 is greater than op2.
uge: yields true if either operand is a QNAN or op1 is greater than or equal to op2.
ult: yields true if either operand is a QNAN or op1 is less than op2.
ule: yields true if either operand is a QNAN or op1 is less than or equal to op2.
une: yields true if either operand is a QNAN or op1 is not equal to op2.
uno: yields true if either operand is a QNAN.
true: always yields true, regardless of operands.

The fcmp instruction can also optionally take any number of fast-math flags, which are optimization hints to enable otherwise unsafe floating-point optimizations.

Any set of fast-math flags are legal on an fcmp instruction, but the only flags that have any effect on its semantics are those that allow assumptions to be made about the values of input arguments; namely nnan, ninf, and reassoc. See Fast-Math Flags for more information.

Example:¶

Text Only

<result> = fcmp oeq float 4.0, 5.0    ; yields: result=false
<result> = fcmp one float 4.0, 5.0    ; yields: result=true
<result> = fcmp olt float 4.0, 5.0    ; yields: result=true
<result> = fcmp ueq double 1.0, 2.0   ; yields: result=false

Fast-Math Flags ¶

LLVM IR floating-point operations (fneg, fadd, fsub, fmul, fdiv, frem, fcmp), phi, select and call may use the following flags to enable otherwise unsafe floating-point transformations.

nnan

No NaNs - Allow optimizations to assume the arguments and result are not NaN. If an argument is a nan, or the result would be a nan, it produces a poison value instead.

ninf

No Infs - Allow optimizations to assume the arguments and result are not +/-Inf. If an argument is +/-Inf, or the result would be +/-Inf, it produces a poison value instead.

nsz

No Signed Zeros - Allow optimizations to treat the sign of a zero argument or result as insignificant.

arcp

Allow Reciprocal - Allow optimizations to use the reciprocal of an argument rather than perform division.

contract

Allow floating-point contraction (e.g. fusing a multiply followed by an addition into a fused multiply-and-add).

afn

Approximate functions - Allow substitution of approximate calculations for functions (sin, log, sqrt, etc). See floating-point intrinsic definitions for places where this can apply to LLVM’s intrinsic math functions.

reassoc

Allow reassociation transformations for floating-point instructions. This may dramatically change results in floating-point.

fast

This flag implies all of the others.

UnsafeFPMath¶

https://llvm.org/doxygen/classllvm_1_1TargetOptions.html#a0544e2966374684ff74255e5a4290fa7

UnsafeFPMath - This flag is enabled when the -enable-unsafe-fp-math flag is specified on the command line.

When this flag is off (the default), the code generator is not allowed to produce results that are "less precise" than IEEE allows. This includes use of X86 instructions like FSIN and FCOS instead of libcalls.

NoNaNsFPMath¶

https://llvm.org/doxygen/classllvm_1_1TargetOptions.html#aaf9cc05758a26e784f7cfa554e76f175

This flag is enabled when the -enable-no-nans-fp-math flag is specified on the command line.

When this flag is off (the default), the code generator is not allowed to assume the FP arithmetic arguments and results are never NaNs.

FCMP

llvm IR fcmp Instruction¶

Syntax:¶

Overview:¶

Arguments:¶

PowerPC（MI）¶

Semantics:¶

Example:¶

Fast-Math Flags¶

UnsafeFPMath¶

NoNaNsFPMath¶

llvm IR `fcmp` Instruction ¶

Fast-Math Flags ¶