diff mbox

[LRA] Fix ICE for paradoxical subregs on strict-alignment platforms

Message ID 30870806.6LhnUrUFi2@polaris
State New
Headers show

Commit Message

Eric Botcazou Dec. 13, 2016, 10:05 p.m. UTC
Hi,

the Ada runtime library doesn't build on SPARC 32-bit with LRA because of an 
ICE on a couple of files:

eric@polaris:~/build/gcc/sparc-sun-solaris2.10> gcc/gnat1 -I gcc/ada/rts -
quiet -dumpbase g-debpoo.adb -auxbase g-debpoo -O2 -gnatpg -fPIC -mlra -
mcpu=v9 g-debpoo.adb -o g-debpoo.s
+===========================GNAT BUG DETECTED==============================+
| 7.0.0 20161213 (experimental) [trunk revision 243595] (sparc-sun-
solaris2.10) GCC error:|
| in lra_set_insn_recog_data, at lra.c:965                                 |
| Error detected around g-debpoo.adb:1896:8     

The problem is that curr_insn_transform attempts to reload subregs before 
reloading addresses:

  if (! check_only_p)
    /* Make equivalence substitution and memory subreg elimination
       before address processing because an address legitimacy can
       depend on memory mode.  */

Now, on strict-alignment platforms, simplify_operand_subreg has a special 
provision for paradoxical subregs of memory references:

	 /* If we change the address for a paradoxical subreg of memory, the
	    address might violate the necessary alignment or the access might
	    be slow.  So take this into consideration.  We need not worry
	    about accesses beyond allocated memory for paradoxical memory
	    subregs as we don't substitute such equiv memory (see processing
	    equivalences in function lra_constraints) and because for spilled
	    pseudos we allocate stack memory enough for the biggest
	    corresponding paradoxical subreg.  */
	  if (!(MEM_ALIGN (reg) < GET_MODE_ALIGNMENT (mode)
		&& SLOW_UNALIGNED_ACCESS (mode, MEM_ALIGN (reg)))
	      || (MEM_ALIGN (reg) < GET_MODE_ALIGNMENT (innermode)
		  && SLOW_UNALIGNED_ACCESS (innermode, MEM_ALIGN (reg))))
	    return true;

That is to say, if the adjusted memory reference is not sufficient aligned, 
the routine will reload the original memory reference instead of the adjusted 
one on strict-alignment platforms.  But there is a hitch: if the address of 
this original memory reference is invalid, the routine will nevertheless 
reload the entire memory reference, leading to the aforementioned ICE.

The proposed fix is to reload the address, if it is invalid, just before 
reloading the memory reference in simplify_operand_subreg; this yields the 
correct sequence of reloads for the case I investigated.  The patch also 
contains formatting fixes and adds a 'return' for the sake of clarity.

Tested on x86-64/Linux and SPARC/Solaris with LRA, OK for the mainline?


2016-12-13  Eric Botcazou  <ebotcazou@adacore.com>

	* lra-constraints.c (process_address): Add forward declaration.
	(simplify_operand_subreg): In the MEM case, if the adjusted memory
	reference is not sufficient aligned and the address was invalid,
	reload the address before reloading the original memory reference.
	Fix long lines and add a final return for the sake of clarity.

-- 
Eric Botcazou

Comments

Vladimir Makarov Dec. 16, 2016, 10:43 p.m. UTC | #1
On 12/13/2016 05:05 PM, Eric Botcazou wrote:
> Hi,

>

> the Ada runtime library doesn't build on SPARC 32-bit with LRA because of an

> ICE on a couple of files:

>

> eric@polaris:~/build/gcc/sparc-sun-solaris2.10> gcc/gnat1 -I gcc/ada/rts -

> quiet -dumpbase g-debpoo.adb -auxbase g-debpoo -O2 -gnatpg -fPIC -mlra -

> mcpu=v9 g-debpoo.adb -o g-debpoo.s

> +===========================GNAT BUG DETECTED==============================+

> | 7.0.0 20161213 (experimental) [trunk revision 243595] (sparc-sun-

> solaris2.10) GCC error:|

> | in lra_set_insn_recog_data, at lra.c:965                                 |

> | Error detected around g-debpoo.adb:1896:8

>

> The problem is that curr_insn_transform attempts to reload subregs before

> reloading addresses:

>

>    if (! check_only_p)

>      /* Make equivalence substitution and memory subreg elimination

>         before address processing because an address legitimacy can

>         depend on memory mode.  */

>

> Now, on strict-alignment platforms, simplify_operand_subreg has a special

> provision for paradoxical subregs of memory references:

>

> 	 /* If we change the address for a paradoxical subreg of memory, the

> 	    address might violate the necessary alignment or the access might

> 	    be slow.  So take this into consideration.  We need not worry

> 	    about accesses beyond allocated memory for paradoxical memory

> 	    subregs as we don't substitute such equiv memory (see processing

> 	    equivalences in function lra_constraints) and because for spilled

> 	    pseudos we allocate stack memory enough for the biggest

> 	    corresponding paradoxical subreg.  */

> 	  if (!(MEM_ALIGN (reg) < GET_MODE_ALIGNMENT (mode)

> 		&& SLOW_UNALIGNED_ACCESS (mode, MEM_ALIGN (reg)))

> 	      || (MEM_ALIGN (reg) < GET_MODE_ALIGNMENT (innermode)

> 		  && SLOW_UNALIGNED_ACCESS (innermode, MEM_ALIGN (reg))))

> 	    return true;

>

> That is to say, if the adjusted memory reference is not sufficient aligned,

> the routine will reload the original memory reference instead of the adjusted

> one on strict-alignment platforms.  But there is a hitch: if the address of

> this original memory reference is invalid, the routine will nevertheless

> reload the entire memory reference, leading to the aforementioned ICE.

>

> The proposed fix is to reload the address, if it is invalid, just before

> reloading the memory reference in simplify_operand_subreg; this yields the

> correct sequence of reloads for the case I investigated.  The patch also

> contains formatting fixes and adds a 'return' for the sake of clarity.

>

> Tested on x86-64/Linux and SPARC/Solaris with LRA, OK for the mainline?

OK.  Thanks for working on this issue, Eric.

The subreg reload code is very sensitive to changes and actually had a 
lot of changes.  So we should be aware that the patch might create new 
PRs.  But I believe the probability of this is tiny.

>

> 2016-12-13  Eric Botcazou  <ebotcazou@adacore.com>

>

> 	* lra-constraints.c (process_address): Add forward declaration.

> 	(simplify_operand_subreg): In the MEM case, if the adjusted memory

> 	reference is not sufficient aligned and the address was invalid,

> 	reload the address before reloading the original memory reference.

> 	Fix long lines and add a final return for the sake of clarity.

>
diff mbox

Patch

Index: lra-constraints.c
===================================================================
--- lra-constraints.c	(revision 243595)
+++ lra-constraints.c	(working copy)
@@ -1454,6 +1454,7 @@  insert_move_for_subreg (rtx_insn **befor
 }
 
 static int valid_address_p (machine_mode mode, rtx addr, addr_space_t as);
+static bool process_address (int, bool, rtx_insn **, rtx_insn **);
 
 /* Make reloads for subreg in operand NOP with internal subreg mode
    REG_MODE, add new reloads for further processing.  Return true if
@@ -1480,13 +1481,13 @@  simplify_operand_subreg (int nop, machin
   type = curr_static_id->operand[nop].type;
   if (MEM_P (reg))
     {
-      rtx subst;
-
+      const bool addr_was_valid
+	= valid_address_p (innermode, XEXP (reg, 0), MEM_ADDR_SPACE (reg));
       alter_subreg (curr_id->operand_loc[nop], false);
-      subst = *curr_id->operand_loc[nop];
+      rtx subst = *curr_id->operand_loc[nop];
       lra_assert (MEM_P (subst));
-      if (! valid_address_p (innermode, XEXP (reg, 0),
-			     MEM_ADDR_SPACE (reg))
+
+      if (!addr_was_valid
 	  || valid_address_p (GET_MODE (subst), XEXP (subst, 0),
 			      MEM_ADDR_SPACE (subst))
 	  || ((get_constraint_type (lookup_constraint
@@ -1503,10 +1504,10 @@  simplify_operand_subreg (int nop, machin
 						    ADDRESS, SCRATCH)][0]],
 				  MEM_ADDR_SPACE (subst))))
 	{
-	  /* If we change address for paradoxical subreg of memory, the
+	  /* If we change the address for a paradoxical subreg of memory, the
 	     address might violate the necessary alignment or the access might
-	     be slow.  So take this into consideration.  We should not worry
-	     about access beyond allocated memory for paradoxical memory
+	     be slow.  So take this into consideration.  We need not worry
+	     about accesses beyond allocated memory for paradoxical memory
 	     subregs as we don't substitute such equiv memory (see processing
 	     equivalences in function lra_constraints) and because for spilled
 	     pseudos we allocate stack memory enough for the biggest
@@ -1517,28 +1518,36 @@  simplify_operand_subreg (int nop, machin
 		  && SLOW_UNALIGNED_ACCESS (innermode, MEM_ALIGN (reg))))
 	    return true;
 
+	  *curr_id->operand_loc[nop] = operand;
+
+	  /* But if the address was not valid, we cannot reload the MEM without
+	     reloading the address first.  */
+	  if (!addr_was_valid)
+	    process_address (nop, false, &before, &after);
+
 	  /* INNERMODE is fast, MODE slow.  Reload the mem in INNERMODE.  */
 	  enum reg_class rclass
 	    = (enum reg_class) targetm.preferred_reload_class (reg, ALL_REGS);
-	  if (get_reload_reg (curr_static_id->operand[nop].type, innermode, reg,
-			      rclass, TRUE, "slow mem", &new_reg))
+	  if (get_reload_reg (curr_static_id->operand[nop].type, innermode,
+			      reg, rclass, TRUE, "slow mem", &new_reg))
 	    {
 	      bool insert_before, insert_after;
 	      bitmap_set_bit (&lra_subreg_reload_pseudos, REGNO (new_reg));
 
 	      insert_before = (type != OP_OUT
-			       || GET_MODE_SIZE (innermode) > GET_MODE_SIZE (mode));
+			       || GET_MODE_SIZE (innermode)
+				    > GET_MODE_SIZE (mode));
 	      insert_after = type != OP_IN;
 	      insert_move_for_subreg (insert_before ? &before : NULL,
 				      insert_after ? &after : NULL,
 				      reg, new_reg);
 	    }
-	  *curr_id->operand_loc[nop] = operand;
 	  SUBREG_REG (operand) = new_reg;
 
 	  /* Convert to MODE.  */
 	  reg = operand;
-	  rclass = (enum reg_class) targetm.preferred_reload_class (reg, ALL_REGS);
+	  rclass
+	    = (enum reg_class) targetm.preferred_reload_class (reg, ALL_REGS);
 	  if (get_reload_reg (curr_static_id->operand[nop].type, mode, reg,
 			      rclass, TRUE, "slow mem", &new_reg))
 	    {
@@ -1561,6 +1570,7 @@  simplify_operand_subreg (int nop, machin
 	 the memory.  Typical case is when the index scale should
 	 correspond the memory.  */
       *curr_id->operand_loc[nop] = operand;
+      return false;
     }
   else if (REG_P (reg) && REGNO (reg) < FIRST_PSEUDO_REGISTER)
     {