| From | Subject | Date |
|---|---|---|
| Bernd Schubert | Re: ext4: (2.6.34-rc4): This should not happen!! Data w ...
That will not help if the command is "SYNCHRONIZE_CACHE", as that ignores
device settings, but uses scsi default timeout (30s), which is far too small
for SATA based raid units. Scsi maintainers ignored that and a couple of other
patches I wrote to improve error handling with Infortrend units. Will send the
patches again soon.
Also, if the abort command succeeds, it the command should be re-queued and
should not result in an error. I think my patches also would increase
verbosity to point ...
| Apr 17, 9:55 am 2010 |
| Andre Noll | Re: ext4: (2.6.34-rc4): This should not happen!! Data w ...
Please CC me when you do so. The machine I am having trouble with is
only our fallback server. I can use it freely for testing and am willing
to give your patches a try.
Thanks
Andre
--
The only person who always got his work done by Friday was Robinson Crusoe
| Apr 17, 11:19 am 2010 |
| Bernd Schubert | Re: ext4: (2.6.34-rc4): This should not happen!! Data w ...
There is actually not much to test, as the patches had been the only solution
to stabilize a large Lustre environment with dozens of Infortrend Raids. I
spent months to debug Infortrend Raids, scsi stack and the LSI MPT fusion
driver. Nowadays some patches are also used for DDN customers.
I'm just always out of time to forward port it to more recent linux-git and to
resend.
Cheers,
Bernd
--
| Apr 17, 11:43 am 2010 |
| Eric Sandeen | Re: ext4: (2.6.34-rc4): This should not happen!! Data w ...
Bernd Schubert wrote:
Interesting that it only happens at enospc time though - the fs would
be sending barriers in general usage.
Although, as ext4 fills, it does do more syncing/flushing to try to
eke out that last bit of space.... hmm....
Andre, just for fun, you might try mounting -o nodelalloc, fill it again,
and see if you see the same behavior.
If not, you might do it again w/ defaults, and capture blktrace data, which
I think will trace barrier requests as well.
(or, maybe also ...
| Apr 17, 1:45 pm 2010 |
| Andre Noll | Re: ext4: (2.6.34-rc4): This should not happen!! Data w ...
Will do. This will take some time though as I destroyed the fs by
writing directly to the device, so I had to recreate it and start from
scratch. I'm currently filling the new fs using a 60s device timeout as
I'll try to reproduce the problem using different timeout values and the
ext4 options you suggest. If I can find a reliable reproducer, I'll run
blktrace and post the results.
Regards
Andre
--
The only person who always got his work done by Friday was Robinson Crusoe
| Apr 17, 3:38 pm 2010 |
| Dmitry Monakhov | Apr 17, 3:57 am 2010 | |
| Kailas Joshi | Re: Help on Implementation of EXT3 type Ordered Mode in EXT4
Hi
I have implemented alloc_on_commit for EXT4.
I haven't tested it thoroughly, but I could run some test scripts and
postmark without any errors.
Though it's working, the performance it very poor.
As it was predicted by Ted, I guess it is because of the increased
time in stalling of filesystem operations as block allocation is done
while transaction is in LOCKED mode.
I am sending the patch(for kernel 2.6.32.4) for my implementation.
Please go through the patch and let me know if I am ...
| Apr 16, 9:42 pm 2010 |
| previous day | today | next day |
|---|---|---|
| April 16, 2010 | April 17, 2010 | April 18, 2010 |
