Warning for GeForce Experience Users

Postby **Xebeth** » Sun Sep 18, 2016 10:24 am

I had the same as Blazing, GFE proceeded to remove the old version then failed on the install of version 3.

I tried the 2.4 GFE link but that wouldn't install either, and I kept getting this message

So I downloaded 372.54 directly and did a clean install

GFE has gone back to its old version and everything seems to be working now.

Postby **JohnLuke** » Sun Sep 18, 2016 10:28 am

blazingmaniac wrote:For those that don't want to trawl through the forums on that link..

<<snipped lots of helpful info>>

Hope this helps people.

Thanks for posting the repair info, Blazing!

that's awesome!

Roger Wilco Jr wrote:I stopped using G.E. when I went VR. Is Shadow Play working with VR now?

For me.... pretty much 'yes' but it's different than when I ran in 2-D. I configure ShadowPlay to record the desktop. It will record all activity, including the E-D window that is always on top of other things that I may have running at the time. So.... yes it's capturing video, but there is a lot of clutter in the background. If it's a video that I want to share, I'll open it in Avidemux 2.6 that I found on-line. It's a free video editing program that is capable of cropping. I can then crop down the video to just the E-D window and save it. Then just a step or two to format it for YouTube, and you're all set. It took me a bit to learn how to crop in Avidemux, so if anyone else tries it and wants some pointers, let me know.

Postby **Roger Wilco Jr** » Sun Sep 18, 2016 2:13 pm

Damn, I was hoping they fixed it. I can't believe they haven't fixed Shadow Play in VR for 6 months and counting, and longer if it was a problem with the DK1/2 too.

Postby **JohnLuke** » Sun Sep 18, 2016 3:32 pm

Roger Wilco Jr wrote:Damn, I was hoping they fixed it. I can't believe they haven't fixed Shadow Play in VR for 6 months and counting, and longer if it was a problem with the DK1/2 too.

It works just fine for me, Roger. I imagine that if you don't want to crop the video after the fact, you could set your monitor resolution to match the Steam VR window, then ShadowPlay would appear to be recording in full screen. I don't do that because I like to be able to see and use other programs on the desktop and cropping the occasional video is a minor inconvenience.

YouTube · Postby **thebs** » Mon Sep 19, 2016 11:44 am

thebs wrote:Okay, I've now had no less than 2 out-of-memory errors at inopportune times with my Oculus Rift. I want to roll back the nVidia drivers to confirm it's not the nVidia drivers.

Any recommendations?

BTW, I already uninstalled the GeForce Experience. I never upgraded to v3, but I am on 3.72.70 nVidia drivers.

EDIT: Rolled back to 365.19. We'll see how it goes.

BTW, whenever I tried to open the GeForce Experience, it tried to upgrade to version 3 automagically, no matter what setting I used, so I uninstalled it.

Played for several hours with the nVidia 365.19 drivers and had no out-of-memory errors. So it looks like the problem is neither Oculus nor Frontier code, but the nVidia drivers.

Postby **TorTorden** » Mon Sep 19, 2016 4:36 pm

I have had a few of these ram crashes (seem to happen around 8gig consumed)

This for me usually takes the 19 jumps to sothis, a bunch of instance and board cycling followed by 19 jumps back. Usually on first or second delivery I get an error. So before I get one I need to play nearly two hours..

Usually I take a break before then.

I also sometimes glance at my phone through the nose gap and check members usage that way and do a pre-emptive restart of the game.

YouTube · Postby **thebs** » Mon Sep 19, 2016 5:34 pm

TorTorden wrote:I have had a few of these ram crashes (seem to happen around 8gig consumed)

Yeah, I'd see similarly, while my Z97 + i7-4790K Mini-ITX system has 16GiB.

But dropping back to the 365.19 drivers, pre-368/372, has solved it, seemingly. I'll keep playing this week on and off, but I think I'm much, much better now. Just wish I could continue using the GeForce Experience and capture, but it just wants to auto-update to v3, regardless of settings.

TorTorden wrote:This for me usually takes the 19 jumps to sothis, a bunch of instance and board cycling followed by 19 jumps back. Usually on first or second delivery I get an error. So before I get one I need to play nearly two hours..
Usually I take a break before then.
I also sometimes glance at my phone through the nose gap and check members usage that way and do a pre-emptive restart of the game.

I played for over 3 hours. I haven't been able to do that for a few months now, without a crash. It's usually been 2 hours before it screws up. It happened consistently. I was in combat one of the times. Not good.

I saw 0 artifacts and other issues. Sometimes I'd see those 30 minutes before the total out-of-memory. Something tells me nVidia has an issue with their DiME (Direct, in-Memory Execution) code in the newer 368+ drivers.

Side Note: Frankly, the GPU being on an I/O bus, instead of the CPU interconnect, is part of the greater issue. But that's Intel's fault, not nVidia's, and a much bigger issue. Hopefully AMD's ARMv8 will solve that for everyone in the 2020s, and we'll get away from the PC's non-sense design.

Postby **TorTorden** » Mon Sep 19, 2016 5:45 pm

Oh yeah I have 16gig as well, it just seems something panics somewhere when in reaches 8+ give or take and triggers the crash.

But something is definitely not right, Ed starts fresh using some like 2gb of ram, then as we play it keeps building and building, being the very definition of a memory leak.

YouTube · Postby **thebs** » Mon Sep 19, 2016 9:01 pm

TorTorden wrote:Oh yeah I have 16gig as well, it just seems something panics somewhere when in reaches 8+ give or take and triggers the crash.
But something is definitely not right, Ed starts fresh using some like 2gb of ram, then as we play it keeps building and building, being the very definition of a memory leak.

What drivers are you running with?

Again, I haven't seen it since downgrading to 365.19. But I've only had 1 session, although it was a long, 3+ hour session. I saw it once upgrading to 368+, usually within 2 hours. I'm only using a single card, as it's Mini-ITX, my GTX 980 Ti, no SLI possible.

As I said, I'm starting to think it's on the nVidia side, not Frontier (or Oculus).

For textures and other stores, main memory can be utilized.

► Show Spoiler

If I had more interest, I'd debug it further. But I think nVidia has totally screwed up their DiME code, at least with OpenGL, and it wouldn't be the first time. I've seen this many times, especially in combination with the CPU platform, because the system interconnect has to keep coherency between the CPU(s) and GPU.

► Show Spoiler

Mainly the TLB, with consideration for the L1-L3 cache, on the CPU has to be aware of the pages the GPU is utilizing, and modifying.

Again, this isn't a problem when the entire bus is shared. When the CPU and GPU have to 'fight' for the same access to the bus, they can be kept coherent, naturally.

AGP was a specialized 32-bit PCI bus that also did DiME, but Intel was still shared bus. But once the 32-bit Athlon MP (which actually used the 40-bit interconnect from the true 64-bit Alpha 264) came out, the ability of CPUs, even I/O, to directly access memory independently of other 'points' on the 'crossbar switch' introduced the biggest coherency mess.

Intel knew they'd eventually have to follow AMD's lead, especially once Opteron and, correspondingly, the AMD64 hit (and literally started smacking Intel out of datacenters), they designed serial PCI Express (PCIe) to support some features for DiME better. At the time, it was referred to as Next Generation I/O (NGIO) in the PCIe spec. But it would be many more years before QuickPath Interconnect (QPI) actually showed up, before Intel was able to take advantage of the aggregate throughput possible.

Had Intel actually had QPI in its processors at the time PCIe was introduced, we might all not have PCIe video cards. We'd probably have a direct GPU slot that is right on the QPI itself. This would solve a lot of concurrency issues.

Side Note: Nehalem, QPI and their first 38-bit (256GiB) Processor Address Extensions (PAE) capable x86-64 chips (Intel processors, x86 and x86-64 were only capable of 32-36-bit, 4-64GiB before then) actually had major issues in its first revision, especially when it came to multi-socket. It was the most radical system interconnect change for Intel since the old Pentium Pro of the early '90s, with introduced 36-bit (64GiB) PAE in the first place, with the i686.

The irony here is that Digital largely helped Intel develop (and they later sued Intel over it, long story) the 32-bit Pentium Pro (i686) TLB, in addition to the ALU because Intel had a design failure in the original i586 Pentium's design (the ALU sucked, which is why Intel used the FPU to load integers -- a 'Pentium optimization' that was stupid, but necessary) -- from Intel's limited i486 TLB. This introduced the 3-level, 36-bit (64GiB) PAE and paging became the 'blue print' for AMD's "Long Mode" that most people know as x86-64 today. It's 48-bit (256TiB) flat addressing mapped to (up to) 4-layer, 52-bit (4PiB) PAE paging.

Windows users didn't see it, not even Windows Server Datacenter edition users, but our Linux servers with 128-512GiB RAM (yes, RAM ... in 2007 -- only AMD was capable of 512GiB at the time) ran into it. I'm under NDA, but the 5 Intel errata from the time are in the Linux kernel release notes. Most people were unaware, because it was just early adopters who needed >>64GiB RAM, like Wall Street, Hollywood, etc... Made us quickly run back to AMD.

Now AMD had similar issues with the Processor 10h, as they moved to the full, 48-bit (256TiB) platform address that x86-64 is capable of, from 40-bit (1TiB) prior from the Alpha 264 lineage (all the way back to the original, 32-bit Athlon MP -- which was really the platform prototype for Opteron x86-64). But unlike Intel, AMD held off releasing their multi-socket Processor 10h when they discovered new coherency issue, to much industry criticism.

But some in the media picked up the fact that Intel had major issues with its initial QPI multi-socket products. Being at a Wall Street customer, I got to deal with this, but couldn't say a word, at the time.

Now AMD did create its own CPU/GPU/Communication slot with HyperTransport Extension (HTX), and even had several, custom, high-end visualization systems sporting unreal throughput for GPUs, as well as supercomputers with Infiniband (after dealing with Infiniband, you hate Ethernet

). But since AMD cannot influence commodity OEMs of systems and accessories, most people never saw it. HyperTransport was designed by API Networks, who AMD eventually acquired, before buying ATI a couple years later. API stands for Alpha Processor, Inc. (API), the entity created by Intel to avoid anti-trust issues when they bought Alpha from Digital in their "sell-off-a-thon" in the late '90s.

Part of the reason why AMD had some of the brightest designers in the '00s, and Intel was surviving on fabrication lead alone (which is really boils down to cash for fab investment), and even then, losing to AMD designs that were fabbed at technologies 2-3 years behind. I think at one point in the '90s, Digital Semiconductor owned 75% of the communciation/networking and 50% of the system and peripheral interconnect IC market.

Although the AMD APU and System-on-a-Chip (SoC) designs sport the ATI GPU directly on the HyperTransport interconnect today. And this is where AMD is headed with their ARM-based products. They're largely aimed at the growing MicroServer market for now, because power and cooling are even more important (let alone far more profitable for AMD) in datacenters. But a future ARM-based Console is not really a matter of 'if,' but 'when,' in a future revision of both Microsoft and Sony consoles.

The AMD Jaguar/Puma dual-APU x86-64 packages are currently in both consoles, including the recent refits. But that is another story.

YouTube · Postby **thebs** » Mon Sep 26, 2016 8:30 pm

Okay, now I gotta step back.

It seems I am on the 'right track' with textures, but it might not be the nVidia driver alone. It might be the updated Oculus driver. I've been doing a lot of memory profiling, and on my 16GiB system, I'm running into it with just 5-6GiB used by Elite. The other 1GiB+ is Ocululs.

The reason I'm saying I'm on the 'right track,' is that it seems to be directly related to planets and their textures. If I do a number of planetary landings, like I do grinding with engineers, it happens within 2-3 hours. If I don't, then it takes a long time.

Now I'm thinking of seeing if I can roll back the Oculus drivers too. That might be far more difficult, let alone the Rift firmware is bundled in the driver package. Sigh ...

thebs wrote: ...
As I said, I'm starting to think it's on the nVidia side, not Frontier (or Oculus).
For textures and other stores, main memory can be utilized.
► Show Spoiler
It's called Direct, in-Memory Execution (DiME).

It's how the PCIe Graphics (PEG) differs from a normal PCIe I/O card. The GPU on the card can directly 'execute on' (SIMD commands in the GPU can reference) the platforms main memory, unlike any other I/O. Normally with I/O, a Direct Memory Access (DMA) transfer from/to buffers/RAM on the card is required, and even the card's RAM is memory mapped (into the platform's space).

DiME is also the primary source of crashes in the PC architecture, long story.

It was also AMD's biggest problem before Intel added an I/O MMU, as Intel used to use a shared bus (with lots of contention), until Nehalem, which introduced QuickPath Interconnect. It was later drastically improved with SandyBridge and later processors, which is where Intel really started to pull away from AMD, especially on the server where AMD was still preferred at the time.

If I had more interest, I'd debug it further. But I think nVidia has totally screwed up their DiME code, at least with OpenGL, and it wouldn't be the first time. I've seen this many times, especially in combination with the CPU platform, because the system interconnect has to keep coherency between the CPU(s) and GPU.
► Show Spoiler
Mainly the TLB, with consideration for the L1-L3 cache, on the CPU has to be aware of the pages the GPU is utilizing, and modifying.

Again, this isn't a problem when the entire bus is shared. When the CPU and GPU have to 'fight' for the same access to the bus, they can be kept coherent, naturally.

AGP was a specialized 32-bit PCI bus that also did DiME, but Intel was still shared bus. But once the 32-bit Athlon MP (which actually used the 40-bit interconnect from the true 64-bit Alpha 264) came out, the ability of CPUs, even I/O, to directly access memory independently of other 'points' on the 'crossbar switch' introduced the biggest coherency mess.

Intel knew they'd eventually have to follow AMD's lead, especially once Opteron and, correspondingly, the AMD64 hit (and literally started smacking Intel out of datacenters), they designed serial PCI Express (PCIe) to support some features for DiME better. At the time, it was referred to as Next Generation I/O (NGIO) in the PCIe spec. But it would be many more years before QuickPath Interconnect (QPI) actually showed up, before Intel was able to take advantage of the aggregate throughput possible.

Had Intel actually had QPI in its processors at the time PCIe was introduced, we might all not have PCIe video cards. We'd probably have a direct GPU slot that is right on the QPI itself. This would solve a lot of concurrency issues.

Side Note: Nehalem, QPI and their first 38-bit (256GiB) Processor Address Extensions (PAE) capable x86-64 chips (Intel processors, x86 and x86-64 were only capable of 32-36-bit, 4-64GiB before then) actually had major issues in its first revision, especially when it came to multi-socket. It was the most radical system interconnect change for Intel since the old Pentium Pro of the early '90s, with introduced 36-bit (64GiB) PAE in the first place, with the i686.

The irony here is that Digital largely helped Intel develop (and they later sued Intel over it, long story) the 32-bit Pentium Pro (i686) TLB, in addition to the ALU because Intel had a design failure in the original i586 Pentium's design (the ALU sucked, which is why Intel used the FPU to load integers -- a 'Pentium optimization' that was stupid, but necessary) -- from Intel's limited i486 TLB. This introduced the 3-level, 36-bit (64GiB) PAE and paging became the 'blue print' for AMD's "Long Mode" that most people know as x86-64 today. It's 48-bit (256TiB) flat addressing mapped to (up to) 4-layer, 52-bit (4PiB) PAE paging.

Windows users didn't see it, not even Windows Server Datacenter edition users, but our Linux servers with 128-512GiB RAM (yes, RAM ... in 2007 -- only AMD was capable of 512GiB at the time) ran into it. I'm under NDA, but the 5 Intel errata from the time are in the Linux kernel release notes. Most people were unaware, because it was just early adopters who needed >>64GiB RAM, like Wall Street, Hollywood, etc... Made us quickly run back to AMD.

Now AMD had similar issues with the Processor 10h, as they moved to the full, 48-bit (256TiB) platform address that x86-64 is capable of, from 40-bit (1TiB) prior from the Alpha 264 lineage (all the way back to the original, 32-bit Athlon MP -- which was really the platform prototype for Opteron x86-64). But unlike Intel, AMD held off releasing their multi-socket Processor 10h when they discovered new coherency issue, to much industry criticism.

But some in the media picked up the fact that Intel had major issues with its initial QPI multi-socket products. Being at a Wall Street customer, I got to deal with this, but couldn't say a word, at the time.

Now AMD did create its own CPU/GPU/Communication slot with HyperTransport Extension (HTX), and even had several, custom, high-end visualization systems sporting unreal throughput for GPUs, as well as supercomputers with Infiniband (after dealing with Infiniband, you hate Ethernet ). But since AMD cannot influence commodity OEMs of systems and accessories, most people never saw it. HyperTransport was designed by API Networks, who AMD eventually acquired, before buying ATI a couple years later. API stands for Alpha Processor, Inc. (API), the entity created by Intel to avoid anti-trust issues when they bought Alpha from Digital in their "sell-off-a-thon" in the late '90s.

Part of the reason why AMD had some of the brightest designers in the '00s, and Intel was surviving on fabrication lead alone (which is really boils down to cash for fab investment), and even then, losing to AMD designs that were fabbed at technologies 2-3 years behind. I think at one point in the '90s, Digital Semiconductor owned 75% of the communciation/networking and 50% of the system and peripheral interconnect IC market.

Although the AMD APU and System-on-a-Chip (SoC) designs sport the ATI GPU directly on the HyperTransport interconnect today. And this is where AMD is headed with their ARM-based products. They're largely aimed at the growing MicroServer market for now, because power and cooling are even more important (let alone far more profitable for AMD) in datacenters. But a future ARM-based Console is not really a matter of 'if,' but 'when,' in a future revision of both Microsoft and Sony consoles.

The AMD Jaguar/Puma dual-APU x86-64 packages are currently in both consoles, including the recent refits. But that is another story.

Elite: Dangerous PvE - Mobius

Group Members: 40,000

Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Out of memory errors ... seemingly gone

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Re: Warning for GeForce Experience Users

Who is online