_mm_nmsub_ps

Artykuł
02/04/2013

Visual Studio 2010 SP1 is required.

Microsoft Specific

Generates the FMA4 XMM instruction vfnmsubps to perform a single-round floating-point negative multiply-subtract of its sources.

__m128 _mm_nmsub_ps (
   __m128 src1,
   __m128 src2,
   __m128 src3
);

Parameters

[in] src1
A 128-bit parameter that contains four 32-bit floating-point values.
[in] src2
A 128-bit parameter that contains four 32-bit floating-point values.
[in] src3
A 128-bit parameter that contains four 32-bit floating-point values.

Return value

A 128-bit result r that contains four 32-bit floating-point values.

r[i] := -(src1[i] * src2[i]) - src3[i];

Requirements

Intrinsic	Architecture
_mm_nmsub_ps	FMA4

Header file <intrin.h>

Remarks

Each of the four single-precision floating-point values in src1 is multiplied by the corresponding value in src2. The result is negated, the corresponding value in src3 is subtracted from it, and the result is stored as the corresponding value in the destination. Each multiply-add pair is performed with a single round at the end, as if intermediate results were computed to infinite precision.

The vfnmsubps instruction is part of the FMA4 family of instructions. Before you use this intrinsic, you must ensure that the processor supports this instruction. To determine hardware support for this instruction, call the __cpuid intrinsic with InfoType = 0x80000001 and check bit 16 of CPUInfo[2] (ECX). This bit is 1 when the instruction is supported, and 0 otherwise.

Example

#include <stdio.h>
#include <intrin.h>
int main()
{
    __m128 a, b, c, d;
    int i;
    for (i = 0; i < 4; i++) {
        a.m128_f32[i] = i;
        b.m128_f32[i] = 2.;
        c.m128_f32[i] = 3.;
    }
    d = _mm_nmsub_ps(a, b, c);
    for (i = 0; i < 4; i++) printf_s(" %.3f", d.m128_f32[i]);
    printf_s("\n");
}

-3.000 -5.000 -7.000 -9.000

Udostępnij za pośrednictwem

_mm_nmsub_ps

Parameters

Return value

Requirements

Remarks

Example

See Also

Reference

Dodatkowe zasoby