Commit Graph

5552 Commits (cc01c3cb5ae708689228d1ec6e751f91e7f90741)

Author SHA1 Message Date
Ben Skeggs a8bdf0e00c nouveau: add reloc refcnt to pending bo list
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-11-04 14:58:19 +10:00
Ben Skeggs 63910f8aa6 nouveau: drop rendering on the floor rather than asserting if flush fails
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-11-04 13:24:52 +10:00
Ben Skeggs 68c2ac9640 nouveau: update view of available aperture space after each flush
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-11-04 13:24:52 +10:00
Mathias Fröhlich 6eafd1cf38 radeon: fix allocation
The old code increments the command stream size by another kbyte, but does
not make sure that the requested packet size fits into the stream. The patch
ensures that the whole next packet fits there and rounds the allocated size to
a power of two.

Signed-off-by: Alex Deucher <alexdeucher@gmail.com>
2009-11-03 11:41:26 -05:00
Kristian Høgsberg b0b96636db Add support for vblank events
The kernel will now write data to the DRM fd for various event types if
requested.  Currently, the only supported event is a vblank event: it contains
the vblank count for the event as well as a timestamp from when the event
ocurred.  Since the DRM fd is now pollable, it's easy to integrate into
existing event loops.
2009-10-29 17:14:09 -07:00
Eric Anholt 0d7ad7e43c intel: Only call clock_gettime once per unreference_final.
Notably when freeing a batchbuffer, we often end up freeing many of the
buffers it points at as well.  Avoiding repeated calls brings us a 9% CPU
win for cairo-gl.

[ # ]  backend                         test   min(s) median(s) stddev. count
before:
[  0]       gl            firefox-talos-gfx   58.941   58.966   0.75%    3/3
after:
[  0]       gl            firefox-talos-gfx   54.186   54.195   0.49%    3/3
2009-10-20 14:24:44 -07:00
Eric Anholt 66d2714f54 intel: Improve bo_references performance by skipping the tree walk.
If the target we're asking about hasn't ever been used as a relocation
target, then it obviously hasn't been used as a target by the batch's reloc
tree.  This is the common case for good GL programming where you only map
fresh buffers, and gives us a 5% win in cairo-gl.

[ # ]  backend                         test   min(s) median(s) stddev. count
before:
[  0]       gl            firefox-talos-gfx   64.680   64.756   0.06%    3/3
after:
[  0]       gl            firefox-talos-gfx   60.816   60.970   0.29%    3/3
2009-10-20 13:31:55 -07:00
Ben Skeggs 67628aa39d nouveau: retry if pushbuf ioctl interrupted by signal
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-10-20 12:57:46 +10:00
Ben Skeggs fdd960a95c nouveau: function to return status of bo being referenced in pushbuf
Will be used to implement gallium's is_{texture,buffer}_referenced()
callbacks properly.

Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-10-19 09:07:45 +10:00
Patrice Mandin 3a387a983e nouveau: nv30: add render target logbase2 width,height 2009-10-13 22:50:46 +02:00
Eric Anholt a221e4fd92 intel: Add the defines for the kernel overlay support landing in 2.6.33. 2009-10-13 11:36:01 -07:00
Chris Wilson 901bacd29c intel: Fallback to atomic-ops.h [libatomic-ops-dev]
Use the external implementation for atomic operations across a wide
range of architectures.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-13 15:13:00 +01:00
Chris Wilson 24c905f841 configure: Typo in error message.
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-13 14:25:54 +01:00
Chris Wilson fc8f6be5a9 configure: Conditionally build libdrm_intel
Only build libdrm_intel automatically if we have support for atomic
operations. To force configure to build drm pass --enable-intel, which
will cause the configure to error if no support is found. Or pass
--disable-intel to explicitly prevent libdrm_intel from being built.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-12 23:58:47 +01:00
Eric Anholt a107e5b129 Bump to 2.4.15 for release. 2009-10-08 16:59:17 -07:00
Eric Anholt 9764061ab1 intel: Remove the asserts about the ignored alignment parameter.
I slipped it in with the alloc_tiled changes, since we were explicitly
throwing the parameter away.  It caught some bogus released code, which
we've now fixed, so remove the asserts to keep old drivers working.
2009-10-08 15:55:02 -07:00
Jesse Barnes 3a7dfcdfaf intel: Add a bo_alloc function for tiled BOs.
This simplifies driver code in handling object allocation, and also gives us
an opportunity to possibly cache tiled buffers if it turns out to be a win.

[anholt: This is chopped out of the execbuf2 patch, as it seems to be useful
separately and cleans up the execbuf2 changes to be more obvious]
2009-10-06 15:45:10 -07:00
Eric Anholt 02c775fc75 intel: Fix up some stale doxygen comments. 2009-10-06 15:45:10 -07:00
Eric Anholt d70d60529f intel: Reformat to the kernel coding style. Welcome to the 8-space future.
This is done with:
Lindent *.[ch]
perl -pi -e 's|drm_intel_bo \* |drm_intel_bo *|g' *.[ch]
perl -pi -e 's|drm_intel_bufmgr \* |drm_intel_bufmgr *|g' *.[ch]
perl -pi -e 's|drm_intel_bo_gem \* |drm_intel_bo_gem *|g' *.[ch]
perl -pi -e 's|drm_intel_bufmgr_gem \* |drm_intel_bufmgr_gem *|g' *.[ch]
perl -pi -e 's|_fake \* |_fake *|g' *.[ch]
hand-editing to whack indented comments into line and other touchups.
2009-10-06 15:45:06 -07:00
Eric Anholt 3c9bd068e0 intel: Don't allocate more relocation entries than the BO could support.
This saves 32k of relocation entry storage for many 965 state buffers.  No
noticeable impact on performance for cairo-gl firefox.
2009-10-06 13:30:07 -07:00
Chris Wilson ac34f599ea tests: Disable intel-specific tests with --disable-intel
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-03 23:18:19 +01:00
Chris Wilson ab3300c581 intel: Add a configure option to *disable* building libdrm-intel
In conjunction with the atomic operation patch, it may be more
convenient for some people to disable building libdrm-intel and its
dependencies upon the atomic intrinsics then it is for them to use a
supported compiler.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-03 22:58:25 +01:00
Chris Wilson 61bddf8504 intel: report errno
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-03 22:58:16 +01:00
Michel Dänzer 11f0e450dd libdrm_radeon: Update RADEON_TILING_* flags to what's in current kernels. 2009-10-03 17:37:07 +02:00
Nicolai Hähnle a5fb264257 libdrm_radeon: Zero-initialize structures to silence valgrind warnings
Signed-off-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-10-03 13:46:10 +02:00
Chris Wilson 04495eeec2 intel: Use atomic refcounters
As the target architecture for Intel GPUs is the x86, we can presume to
have reasonable compiler support for Intel atomic intrinsics, i.e. gcc,
and so use those in preference to pulling in a complicated mess of
fragile assembly.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
[anholt: hand-resolved against my previous commit.  This brings cairo-gl
firefox-talos-gfx time from 65 seconds back down to 62 seconds.]
Signed-off-by: Eric Anholt <eric@anholt.net>
2009-10-02 13:21:23 -07:00
Chris Wilson 0fb215ae31 intel: Mark cached bo as purgeable
Set the DONTNEED flag on cached buffers so that the kernel is free to
discard those when under memory pressure.

[anholt: This takes firefox-talos-gfx time from ~62 seconds to ~65 seconds
on my GM965, but it seems like a hit worth taking for the improved
functionality from saving memory]

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Eric Anholt <eric@anholt.net>
2009-10-02 12:54:13 -07:00
Eric Anholt 12d9b7cc85 intel: Don't free the reloc list when putting a freed BO in the cache.
This takes firefox-talos-gfx from 74 seconds to 70 seconds on my GM965.
2009-10-02 11:11:31 -07:00
Eric Anholt 769b105780 intel: Add a new function to check if a BO's reloc tree references some BO.
There are a bunch of places in GL where if we can't do this we have to
flush the batchbuffer, and the cost of lookups here is outweighed by flush
savings.
2009-10-01 19:09:26 -07:00
Eric Anholt ac71f08499 Bump to 2.4.14 for release. 2009-09-21 15:29:58 -07:00
Eric Anholt 51b89733c5 intel: Remove the max_entries stuff that complicated bo reuse.
I thought I was going to do all sorts of crazy experiments with it.  I never
did, and it turned out the free-after-a-few-seconds plan is working out fine.
2009-09-21 15:23:05 -07:00
Eric Anholt 456a358b9a intel: Remove the old swrast flag for reducing cache flushing.
It hasn't been doing anything effective since
52e5d24fae, and we pretty much don't bo_map
pinned buffers any more anyway.
2009-09-21 15:23:05 -07:00
Keith Packard 0a24654129 drmModeFreeConnector: free encoders and properties
These were leaking.

Signed-off-by: Keith Packard <keithp@keithp.com>
2009-09-17 17:28:49 -07:00
Dave Airlie cdd325b59a radeon: fix 32/64 bit issue with sign extension
Not sure what intptr_t was up to here.

Reported and tested by: Kevin DeKorte
Signed-off-by: Dave Airlie <airlied@redhat.com>
2009-09-15 07:30:22 +10:00
Zhenyu Wang 67e4172394 libdrm_intel: include B43 chipset check
Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>
2009-09-07 16:17:04 +08:00
Zhenyu Wang 121b9648f8 libdrm_intel: add new pci ids
New ids for G41, Clarkdale and Arrandale.
Make sure we don't need to count fence also on new chips.

Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>
2009-09-04 09:24:23 +08:00
Eric Anholt 73b59c8943 Bump to version 2.4.13 for release. 2009-08-28 15:20:22 -07:00
Eric Anholt 8214a65ad1 Add drm_intel_bo_busy to query whether mapping a BO would block. 2009-08-28 14:56:41 -07:00
Michel Dänzer 19d6fadfa2 Revert "libdrm_radeon: Always wait for BO idle in bo_map."
This reverts commit 0a732983f0.

Paul Nieminen and Dave Airlie pointed out on IRC that this shouldn't be
necessary. I was seeing visual corruption in X before I made this change, but
I can't reproduce that anymore so it was probably an unrelated issue.
2009-08-28 11:42:07 +02:00
Michel Dänzer 0a732983f0 libdrm_radeon: Always wait for BO idle in bo_map.
This allows users to eliminate explicit bo_wait calls before bo_map calls.
2009-08-27 08:36:58 +02:00
Jakob Bornecrantz ce6c68dc8a Kill last remnant of replacefb ioctl
Kenrels doesn't expose this ioctl
2009-08-25 12:05:48 +02:00
Alex Deucher 02a4d22e95 radeon: pull in z pipe changes from kernel 2009-08-24 18:15:03 -04:00
Pauli Nieminen caad8d8555 radeon: add support for busy/domain check interface.
airlied: modified the interface to drop busy return value, just return
it normally, also fixed int->uint32_t for domain

Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-22 13:16:18 +10:00
Ben Skeggs cbb3ae3dab nouveau: fix a thinko in copyless pushbuf ioctl
No idea why G80 doesn't hit this, but, this fixes at least one NV40 card.
2009-08-19 15:56:24 +10:00
Pauli Nieminen 8c43b79b21 libdrm_radeon: Optimize copy of table to cs buffer with specialized call.
Using this call in OUT_BATCH_TABLE reduces radeonEmitState cpu usage from
9% to 5% and emit_vpu goes from 7% to 1.5%. I did use calgrind to profile
gears for cpu hotspots with r500 card.

Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-18 14:54:25 -04:00
Pauli Nieminen a474fd978c libdrm_radeon: Fix loops so that compiler can optimize them.
GCC did war about optimization not possible because possible forever loop.

Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-18 14:53:46 -04:00
Pauli Nieminen 64cef1e465 libdrm/radeon: Update head of linked list not to point freed memory.
Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-18 14:52:43 -04:00
Ben Skeggs 28f4bfa04b nouveau: support for copy-less pushbuf ioctl 2009-08-18 15:55:42 +10:00
Ben Skeggs 250ab3a38e nouveau: for the moment, assert if we exceed some reloc limits
Nasty, but nicer than silently not writing into the pushbuf
2009-08-18 15:55:02 +10:00
Dave Airlie 1978f6d8d1 radeon: fix bo wait at map time. 2009-08-17 21:21:53 +10:00