[ARM] Fix costing of vmul+vcvt combine pattern

Message ID	562F8257.6080006@arm.com
State	Accepted
Commit	6a9ee02f7afa32a1bded5d4d0644ac1b02064148
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of gcc-patches-return-411678-patch=linaro.org@gcc.gnu.org designates 209.132.180.131 as permitted sender) client-ip=209.132.180.131; DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender :message-id:date:from:mime-version:to:cc:subject:content-type; q=dns; s=default; b=qEreceBK+7jKCitQxa3G0R5iJSpVW5m8wUG9YzPNU5L halqYtkYUuqU6JN/PwYVzrJx8ZJ57l7xmvqwh2XCqroEudRnzgA38RCjxbd6xRyR GBq0D47sTqjMFGBDZTUI7ZXtk7xGA9rezRnP2h1qUEFD7TiY/ilQjs9lrr4IfB2k = Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org Message-ID: <562F8257.6080006@arm.com> Date: Tue, 27 Oct 2015 13:55:35 +0000 From: Kyrill Tkachov <kyrylo.tkachov@arm.com> User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0 MIME-Version: 1.0 To: GCC Patches <gcc-patches@gcc.gnu.org> CC: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>, Richard Earnshaw <Richard.Earnshaw@arm.com> Subject: [PATCH][ARM] Fix costing of vmul+vcvt combine pattern Content-Type: multipart/mixed; boundary="------------020806040002030808040104"

Message ID

562F8257.6080006@arm.com

State

Accepted

Commit

6a9ee02f7afa32a1bded5d4d0644ac1b02064148

Headers

Received-SPF: pass (google.com: domain of
	gcc-patches-return-411678-patch=linaro.org@gcc.gnu.org
	designates 209.132.180.131 as permitted sender)
	client-ip=209.132.180.131; 
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id
	:list-unsubscribe:list-archive:list-post:list-help:sender
	:message-id:date:from:mime-version:to:cc:subject:content-type;
	q=dns; s=default; b=qEreceBK+7jKCitQxa3G0R5iJSpVW5m8wUG9YzPNU5L
	halqYtkYUuqU6JN/PwYVzrJx8ZJ57l7xmvqwh2XCqroEudRnzgA38RCjxbd6xRyR
	GBq0D47sTqjMFGBDZTUI7ZXtk7xGA9rezRnP2h1qUEFD7TiY/ilQjs9lrr4IfB2k
	=
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
Sender: gcc-patches-owner@gcc.gnu.org
Message-ID: <562F8257.6080006@arm.com>
Date: Tue, 27 Oct 2015 13:55:35 +0000
From: Kyrill Tkachov <kyrylo.tkachov@arm.com>
User-Agent: Mozilla/5.0 (X11; Linux x86_64;
	rv:31.0) Gecko/20100101 Thunderbird/31.2.0
MIME-Version: 1.0
To: GCC Patches <gcc-patches@gcc.gnu.org>
CC: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>,
	Richard Earnshaw <Richard.Earnshaw@arm.com>
Subject: [PATCH][ARM] Fix costing of vmul+vcvt combine pattern
Content-Type: multipart/mixed;
	boundary="------------020806040002030808040104"

Commit Message

Kyrylo Tkachov Oct. 27, 2015, 1:55 p.m. UTC

Hi all,

This patch allows us to handle the *combine_vcvtf2i pattern in rtx costs by properly identifying it
as a toint coversion. Before this I saw a pattern like:
(set (reg/i:SI 0 r0)
     (fix:SI (fix:SF (mult:SF (reg:SF 16 s0 [ a ])
                 (const_double:SF 3.2e+1 [0x0.8p+6])))))

being assigned a cost of 40 because the costs blindly recursed into the operands.
With this patch for -mcpu=cortex-a57 I see it being assigned a cost of 4.

Bootstrapped and tested on arm-none-linux-gnueabihf.

Ok for trunk?

Thanks,
Kyrill

2015-10-27  Kyrylo Tkachov  <kyrylo.tkachov@arm.com>

     * config/arm/arm.c (arm_new_rtx_costs, FIX case): Handle
     combine_vcvtf2i pattern.

Comments

Kyrylo Tkachov Nov. 3, 2015, 10:22 a.m. UTC | #1

Ping.
https://gcc.gnu.org/ml/gcc-patches/2015-10/msg02898.html

Thanks,
Kyrill

On 27/10/15 13:55, Kyrill Tkachov wrote:
> Hi all,

>

> This patch allows us to handle the *combine_vcvtf2i pattern in rtx costs by properly identifying it

> as a toint coversion. Before this I saw a pattern like:

> (set (reg/i:SI 0 r0)

>     (fix:SI (fix:SF (mult:SF (reg:SF 16 s0 [ a ])

>                 (const_double:SF 3.2e+1 [0x0.8p+6])))))

>

> being assigned a cost of 40 because the costs blindly recursed into the operands.

> With this patch for -mcpu=cortex-a57 I see it being assigned a cost of 4.

>

> Bootstrapped and tested on arm-none-linux-gnueabihf.

>

> Ok for trunk?

>

> Thanks,

> Kyrill

>

> 2015-10-27  Kyrylo Tkachov  <kyrylo.tkachov@arm.com>

>

>     * config/arm/arm.c (arm_new_rtx_costs, FIX case): Handle

>     combine_vcvtf2i pattern.

commit 1e040710d1022ce816eac9b4f6065bc7aa2be9cf
Author: Kyrylo Tkachov <kyrylo.tkachov@arm.com>
Date:   Wed Oct 14 11:26:07 2015 +0100

    [ARM] Fix costing of vmul+vcvt combine pattern

diff --git a/gcc/config/arm/arm.c b/gcc/config/arm/arm.c
index b37b507..33ad433 100644
--- a/gcc/config/arm/arm.c
+++ b/gcc/config/arm/arm.c
@@ -11064,6 +11064,23 @@  arm_new_rtx_costs (rtx x, enum rtx_code code, enum rtx_code outer_code,
     case UNSIGNED_FIX:
       if (TARGET_HARD_FLOAT)
 	{
+	  /* The *combine_vcvtf2i reduces a vmul+vcvt into
+	     a vcvt fixed-point conversion.  */
+	  if (code == FIX && mode == SImode
+	      && GET_CODE (XEXP (x, 0)) == FIX
+	      && GET_MODE (XEXP (x, 0)) == SFmode
+	      && GET_CODE (XEXP (XEXP (x, 0), 0)) == MULT
+	      && vfp3_const_double_for_bits (XEXP (XEXP (XEXP (x, 0), 0), 1))
+		 > 0)
+	    {
+	      if (speed_p)
+		*cost += extra_cost->fp[0].toint;
+
+	      *cost += rtx_cost (XEXP (XEXP (XEXP (x, 0), 0), 0), mode,
+				 code, 0, speed_p);
+	      return true;
+	    }
+
 	  if (GET_MODE_CLASS (mode) == MODE_INT)
 	    {
 	      mode = GET_MODE (XEXP (x, 0));