i2som-imx-linux

Author	SHA1	Message	Date
Greg Kroah-Hartman	cf2586e60e	Linux 4.9.22	2017-04-12 12:41:42 +02:00
Matjaz Hegedic	7eae64f919	x86/reboot/quirks: Fix typo in ASUS EeeBook X205TA reboot quirk [ Upstream commit `bba8376aea` ] The reboot quirk for ASUS EeeBook X205TA contains a typo in DMI_PRODUCT_NAME, improperly referring to X205TAW instead of X205TA, which prevents the quirk from being triggered. The model X205TAW already has a reboot quirk of its own. This fix simply removes the inappropriate final letter W. Fixes: `90b28ded88` ("x86/reboot/quirks: Add ASUS EeeBook X205TA reboot quirk") Signed-off-by: Matjaz Hegedic <matjaz.hegedic@gmail.com> Link: http://lkml.kernel.org/r/1489064417-7445-1-git-send-email-matjaz.hegedic@gmail.com Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Tobias Jakobi	af11789a05	usb-storage: Add ignore-residue quirk for Initio INIC-3619 [ Upstream commit `d595259fbb` ] This USB-SATA bridge chip is used in a StarTech enclosure for optical drives. Without the quirk MakeMKV fails during the key exchange with an installed BluRay drive: > Error 'Scsi error - ILLEGAL REQUEST:COPY PROTECTION KEY EXCHANGE FAILURE - KEY NOT ESTABLISHED' > occurred while issuing SCSI command AD010..080002400 to device 'SG:dev_11:2' Signed-off-by: Tobias Jakobi <tjakobi@math.uni-bielefeld.de> Acked-by: Alan Stern <stern@rowland.harvard.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Matjaz Hegedic	e5525c7419	x86/reboot/quirks: Add ASUS EeeBook X205TA/W reboot quirk [ Upstream commit `3b3e78552d` ] Without the parameter reboot=a, ASUS EeeBook X205TA/W will hang when it should reboot. This adds the appropriate quirk, thus fixing the problem. Signed-off-by: Matjaz Hegedic <matjaz.hegedic@gmail.com> Link: http://lkml.kernel.org/r/1488737804-20681-1-git-send-email-matjaz.hegedic@gmail.com Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Matjaz Hegedic	0605fff95d	x86/reboot/quirks: Add ASUS EeeBook X205TA reboot quirk [ Upstream commit `90b28ded88` ] Without the parameter reboot=a, ASUS EeeBook X205TA will hang when it should reboot. This adds the appropriate quirk, thus fixing the problem. Signed-off-by: Matjaz Hegedic <matjaz.hegedic@gmail.com> Cc: Andy Lutomirski <luto@kernel.org> Cc: Borislav Petkov <bp@alien8.de> Cc: Brian Gerst <brgerst@gmail.com> Cc: Denys Vlasenko <dvlasenk@redhat.com> Cc: H. Peter Anvin <hpa@zytor.com> Cc: Josh Poimboeuf <jpoimboe@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
João Paulo Rechi Vita	7c6b1ad9de	platform/x86: asus-wmi: Detect quirk_no_rfkill from the DSDT [ Upstream commit `71050ae7bf` ] Some Asus laptops that have an airplane-mode indicator LED, also have the WMI WLAN user bit set, and the following bits in their DSDT: Scope (_SB) { (...) Device (ATKD) { (...) Method (WMNB, 3, Serialized) { (...) If (LEqual (IIA0, 0x00010002)) { OWGD (IIA1) Return (One) } } } } So when asus-wmi uses ASUS_WMI_DEVID_WLAN_LED (0x00010002) to store the wlan state, it drives the airplane-mode indicator LED (through the call to OWGD) in an inverted fashion: the LED is ON when airplane mode is OFF (since wlan is ON), and vice-versa. This commit skips registering RFKill switches at all for these laptops, to allow the asus-wireless driver to drive the airplane mode LED correctly through the ASHS ACPI device. Relying on the presence of ASHS and ASUS_WMI_DSTS_USER_BIT avoids adding DMI-based quirks for at least 21 different laptops. Signed-off-by: João Paulo Rechi Vita <jprvita@endlessm.com> Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Kai-Chuan Hsieh	71f38c11cd	platform/x86: asus-wmi: Set specified XUSB2PR value for X550LB [ Upstream commit `8023eff10e` ] The bluetooth adapter Atheros AR3012 can't be enumerated and make the bluetooth function broken. T: Bus=02 Lev=01 Prnt=01 Port=05 Cnt=02 Dev#= 5 Spd=12 MxCh= 0 D: Ver= 1.10 Cls=e0(wlcon) Sub=01 Prot=01 MxPS=64 #Cfgs= 1 P: Vendor=13d3 ProdID=3362 Rev=00.02 S: Manufacturer=Atheros Communications S: Product=Bluetooth USB Host Controller S: SerialNumber=Alaska Day 2006 C: #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=100mA I: If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb I: If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb The error is: usb 2-6: device not accepting address 7, error -62 usb usb2-port6: unable to enumerate USB device It is caused by adapter's connected port is mapped to xHC controller, but the xHCI is not supported by the usb device. The output of 'sudo lspci -nnxxx -s 00:14.0': 00:14.0 USB controller [0c03]: Intel Corporation 8 Series USB xHCI HC [8086:9c31] (rev 04) 00: 86 80 31 9c 06 04 90 02 04 30 03 0c 00 00 00 00 10: 04 00 a0 f7 00 00 00 00 00 00 00 00 00 00 00 00 20: 00 00 00 00 00 00 00 00 00 00 00 00 43 10 1f 20 30: 00 00 00 00 70 00 00 00 00 00 00 00 0b 01 00 00 40: fd 01 36 80 89 c6 0f 80 00 00 00 00 00 00 00 00 50: 5f 2e ce 0f 00 00 00 00 00 00 00 00 00 00 00 00 60: 30 20 00 00 00 00 00 00 00 00 00 00 00 00 00 00 70: 01 80 c2 c1 08 00 00 00 00 00 00 00 00 00 00 00 80: 05 00 87 00 0c a0 e0 fe 00 00 00 00 a1 41 00 00 90: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 a0: 00 01 04 00 00 00 00 00 00 00 00 00 00 00 00 00 b0: 0f 00 03 00 00 00 00 00 00 00 00 00 00 00 00 00 c0: 03 c0 30 00 00 00 00 00 03 0c 00 00 00 00 00 00 d0: f9 01 00 00 f9 01 00 00 0f 00 00 00 0f 00 00 00 e0: 00 08 00 00 00 00 00 00 00 00 00 00 d8 d8 00 00 f0: 00 00 00 00 00 00 00 00 b1 0f 04 08 00 00 00 00 By referencing Intel Platform Controller Hub(PCH) datasheet, the xHC USB 2.0 Port Routing(XUSB2PR) at offset 0xD0-0xD3h decides the setting of mapping the port to EHCI controller or xHC controller. And the port mapped to xHC will enable xHCI during bus resume. The setting of disabling bluetooth adapter's connected port is 0x000001D9. The value can be obtained by few times 1 bit flip operation. The suited configuration should have the 'lsusb -t' result with bluetooth using ehci: /: Bus 03.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/4p, 5000M /: Bus 02.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/9p, 480M \|__ Port 5: Dev 2, If 0, Class=Video, Driver=uvcvideo, 480M \|__ Port 5: Dev 2, If 1, Class=Video, Driver=uvcvideo, 480M /: Bus 01.Port 1: Dev 1, Class=root_hub, Driver=ehci-pci/2p, 480M \|__ Port 1: Dev 2, If 0, Class=Hub, Driver=hub/8p, 480M \|__ Port 6: Dev 3, If 0, Class=Wireless, Driver=btusb, 12M \|__ Port 6: Dev 3, If 1, Class=Wireless, Driver=btusb, 12M Signed-off-by: Kai-Chuan Hsieh <kai.chiuan@gmail.com> Acked-by: Corentin Chary <corentin.chary@gmail.com> Reviewed-by: Andy Shevchenko <andy.shevchenko@gmail.com> [andy: resolve merge conflict in asus-wmi.h] Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Krzysztof Kozlowski	4dc1eb47fb	watchdog: s3c2410: Fix infinite interrupt in soft mode [ Upstream commit `0b445549ea` ] In soft (no-reboot) mode, the driver self-pings watchdog upon expiration of an interrupt. However the interrupt itself was not cleared thus on first hit, the system enters infinite interrupt handling loop. On Odroid U3 (Exynos4412), when booted with s3c2410_wdt.soft_noboot=1 argument the console is flooded: # killall -9 watchdog [ 60.523760] s3c2410-wdt 10060000.watchdog: watchdog timer expired (irq) [ 60.536744] s3c2410-wdt 10060000.watchdog: watchdog timer expired (irq) Fix this by writing something to the WTCLRINT register to clear the interrupt. The register WTCLRINT however appeared in S3C6410 so a new watchdog quirk and flavor are needed. Signed-off-by: Krzysztof Kozlowski <krzk@kernel.org> Reviewed-by: Guenter Roeck <linux@roeck-us.net> Signed-off-by: Guenter Roeck <linux@roeck-us.net> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Sinan Kaya	b18877ff66	PCI: Add ACS quirk for Qualcomm QDF2400 and QDF2432 [ Upstream commit `33be632b84` ] The Qualcomm QDF2xxx root ports don't advertise an ACS capability, but they do provide ACS-like features to disable peer transactions and validate bus numbers in requests. To be specific: * Hardware supports source validation but it will report the issue as Completer Abort instead of ACS Violation. * Hardware doesn't support peer-to-peer and each root port is a root complex with unique segment numbers. * It is not possible for one root port to pass traffic to the other root port. All PCIe transactions are terminated inside the root port. Add an ACS quirk for the QDF2400 and QDF2432 products. [bhelgaas: changelog] Signed-off-by: Sinan Kaya <okaya@codeaurora.org> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Alex Williamson <alex.williamson@redhat.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Andy Shevchenko	5feac34f67	PCI: Sort the list of devices with D3 delay quirk by ID [ Upstream commit `cd3e2eb890` ] Sort the list of Intel devices that have no PCI D3 delay by ID. Add a comment for group of devices that had not been marked yet. There is no functional change. Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
yangbo lu	6b69d1f644	mmc: sdhci-of-esdhc: remove default broken-cd for ARM [ Upstream commit `e9acc77dd0` ] Initially all QorIQ platforms were PowerPC architecture and they didn't support card detection except several platforms. The driver added the quirk SDHCI_QUIRK_BROKEN_CARD_DETECTION as default and this made broken-cd property in dts node didn't work. Now QorIQ platform turns to ARM architecture and most of them could support card detection. However it's a large number of dts trees that need to be fixed with broken-cd if we remove the default SDHCI_QUIRK_BROKEN_CARD_DETECTION in driver. And the users don't want to see this. So this patch is to remove this default quirk just for ARM and keep it for PowerPC.(Note, QorIQ PowerPC platform only has big-endian eSDHC while QorIQ ARM platform has big-endian or little-endian eSDHC) This makes broken-cd property work again for ARM. Signed-off-by: Yangbo Lu <yangbo.lu@nxp.com> Acked-by: Adrian Hunter <adrian.hunter@intel.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:22 +02:00
Dongdong Liu	0ddf07d2a1	PCI: Disable MSI for HiSilicon Hip06/Hip07 Root Ports [ Upstream commit `72f2ff0deb` ] The PCIe Root Port in Hip06/Hip07 SoCs advertises an MSI capability, but it cannot generate MSIs. It can transfer MSI/MSI-X from downstream devices, but does not support MSI/MSI-X itself. Add a quirk to prevent use of MSI/MSI-X by the Root Port. [bhelgaas: changelog, sort vendor ID #define, drop device ID #define] Signed-off-by: Dongdong Liu <liudongdong3@huawei.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gabriele Paoloni <gabriele.paoloni@huawei.com> Reviewed-by: Zhou Wang <wangzhou1@hisilicon.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Jon Mason	28dd2313a6	PCI: Add Broadcom Northstar2 PAXC quirk for device class and MPSS [ Upstream commit `ce709f8650` ] The Broadcom Northstar2 SoC has a number of quirks for the PAXC (internal/fake) PCI bus. Specifically, the PCI config space is shared between the root port and the first PF (ie., PF0), and a number of fields are tied to zero (thus preventing them from being set). These cannot be "fixed" in device firmware, so we must fix them with a quirk. Signed-off-by: Jon Mason <jon.mason@broadcom.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Will Deacon	35b366d584	ARM: smccc: Update HVC comment to describe new quirk parameter [ Upstream commit `3046ec674d` ] Commit `680a0873e1` ("arm: kernel: Add SMC structure parameter") added a new "quirk" parameter to the SMC and HVC SMCCC backends, but only updated the comment for the SMC version. This patch adds the new paramater to the comment describing the HVC version too. Signed-off-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Baoyou Xie	25bdb190ad	drm/msm/adreno: move function declarations to header file [ Upstream commit `a5725ab049` ] We get 2 warnings when building kernel with W=1: drivers/gpu/drm/msm/adreno/a3xx_gpu.c:535:17: warning: no previous prototype for 'a3xx_gpu_init' [-Wmissing-prototypes] drivers/gpu/drm/msm/adreno/a4xx_gpu.c:624:17: warning: no previous prototype for 'a4xx_gpu_init' [-Wmissing-prototypes] In fact, both functions are declared in drivers/gpu/drm/msm/adreno/adreno_device.c, but should be declared in a header file. So this patch moves both function declarations to drivers/gpu/drm/msm/adreno/adreno_gpu.h. Signed-off-by: Baoyou Xie <baoyou.xie@linaro.org> Reviewed-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Link: http://patchwork.freedesktop.org/patch/msgid/1477127865-9381-1-git-send-email-baoyou.xie@linaro.org Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Andy Gross	bec9918bb4	firmware: qcom: scm: Fix interrupted SCM calls [ Upstream commit `82bcd08702` ] This patch adds a Qualcomm specific quirk to the arm_smccc_smc call. On Qualcomm ARM64 platforms, the SMC call can return before it has completed. If this occurs, the call can be restarted, but it requires using the returned session ID value from the interrupted SMC call. The quirk stores off the session ID from the interrupted call in the quirk structure so that it can be used by the caller. This patch folds in a fix given by Sricharan R: https://lkml.org/lkml/2016/9/28/272 Signed-off-by: Andy Gross <andy.gross@linaro.org> Reviewed-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Andy Gross	007f0a2f2c	arm: kernel: Add SMC structure parameter [ Upstream commit `680a0873e1` ] This patch adds a quirk parameter to the arm_smccc_(smc/hvc) calls. The quirk structure allows for specialized SMC operations due to SoC specific requirements. The current arm_smccc_(smc/hvc) is renamed and macros are used instead to specify the standard arm_smccc_(smc/hvc) or the arm_smccc_(smc/hvc)_quirk function. This patch and partial implementation was suggested by Will Deacon. Signed-off-by: Andy Gross <andy.gross@linaro.org> Reviewed-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Ping Cheng	703f48a1c3	HID: wacom: don't apply generic settings to old devices [ Upstream commit `e7deb1570a` ] Non-generic devices have numbered_buttons set for both pen and touch interfaces by default. The actual number of buttons on the interface is normally manually decided later, which is different from what those HID generic devices are processed, where number of buttons are directly retrieved from HID descriptors. This patch adds the missed HID_GENERIC check and moves the statement to wacom_setup_pad_input_capabilities since it's not a quirk anymore. Signed-off-by: Ping Cheng <ping.cheng@wacom.com> Reviewed-by: Jason Gerecke <jason.gerecke@wacom.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Mylène Josserand	abb6408938	ASoC: sun4i-i2s: Add quirks to handle a31 compatible [ Upstream commit `2ad6f30de7` ] Some SoCs have a reset line that must be asserted/deasserted. This patch adds a quirk to handle the new compatible "allwinner,sun6i-a31-i2s" which will deassert the reset line on probe function and assert it on remove's one. This new compatible is useful in case of A33 codec driver, for example. Signed-off-by: Mylène Josserand <mylene.josserand@free-electrons.com> Acked-by: Maxime Ripard <maxime.ripard@free-electrons.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Zhang Rui	3d2f06d8d1	ACPI: save NVS memory for Lenovo G50-45 [ Upstream commit `cbc00c1310` ] In commit `821d6f0359` (ACPI / sleep: Do not save NVS for new machines to accelerate S3), to optimize S3 suspend/resume speed, code is introduced to ignore NVS memory saving during S3 for all the platforms later than 2012. But, Lenovo G50-45, a platform released in 2015, still needs NVS memory saving during S3. A quirk is introduced for this platform. Link: https://bugzilla.kernel.org/show_bug.cgi?id=189431 Tested-by: Przemek <soprwa@gmail.com> Signed-off-by: Zhang Rui <rui.zhang@intel.com> [ rjw: Drop unnecessary code ] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Pierre-Louis Bossart	09f78f116c	ASoC: Intel: cht_bsw_rt5645: add Baytrail MCLK support [ Upstream commit `a50477e55f` ] The existing code assumes a 19.2 MHz MCLK as the default hardware configuration. This is valid for CherryTrail but not for Baytrail. Add explicit MCLK configuration to set the 19.2 clock on/off depending on DAPM events. This is a prerequisite step to enable devices with Baytrail and RT5645 such as Asus X205TA Signed-off-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:21 +02:00
Pierre-Louis Bossart	584f4318f9	ASoC: Intel: cht_bsw_rt5645: harden ACPI device detection [ Upstream commit `42648c2270` ] Fix classic issue of having multiple codecs listed in DSDT but a single one actually enabled. The previous code did not handle such errors and could also lead to uninitalized configurations Signed-off-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Pierre-Louis Bossart	4060a32e9d	ASoC: Intel: Baytrail: add quirk for Lenovo Thinkpad 10 [ Upstream commit `fd0138dc5d` ] the BIOS reports this codec as RT5640 but it's a rt5670. Use the quirk mechanism to use the cht_bsw_rt5672 machine driver Signed-off-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Pierre-Louis Bossart	d572cfb662	ASoC: codecs: rt5670: add quirk for Lenovo Thinkpad 10 [ Upstream commit `93ffeaa8ee` ] the BIOS incorrectly reports this codec as 5640 but it is really a rt5670 Signed-off-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Takashi Iwai	58153cca22	ASoC: rt5670: Add missing 10EC5072 ACPI ID [ Upstream commit `d252800608` ] rt5670 driver supports also RT5672 codec, but its ACPI ID is missing. This was found on Dell Wyse 3040 box. Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Lv Zheng	e4ae51e44b	ACPI / button: Change default behavior to lid_init_state=open [ Upstream commit `77e9a4aa9d` ] More and more platforms need the button.lid_init_state=open quirk. This patch sets it the default behavior. If a platform doesn't send lid open event or lid open event is lost due to the underlying system problems, then we can compare various combinations: 1. systemd/acpid is used to suspend system or not, systemd has a special logic forcing open event after resuming; 2. _LID returns a cached value or not. The result is as follows: 1. lid_init_state=method 1. cached 1. resumed by lid: (x) event=close (x) systemd=suspends again (x) acpid=suspends again (x) state=close 2. resumed by other: (o) event=close (x) systemd=suspends again (x) acpid=suspends again (o) state=close 2. non-cached 1. resumed by lid: (o) event=open (o) systemd=resumes (o) acpid=resumes (o) state=open 2. resumed by other: (o) event=close (x) systemd=suspends again (x) acpid=suspends again (o) state=close 2. lid_init_state=open 1. cached 1. resumed by lid: (o) event=open (o) systemd=resumes (o) acpid=resumes (x) state=close 2. resumed by other: (x) event=open (o) systemd=resumes (o) acpid=resumes (o) state=close 2. non-cached 1. resumed by lid: (o) event=open (o) systemd=resumes (o) acpid=resumes (o) state=open 2. resumed by other: (x) event=open (o) systemd=resumes (o) acpid=resumes (o) state=close 3. lid_init_state=ignore 1. cached 1. resumed by lid: (o) event=none (x) systemd=suspends again (o) acpid=resumes (x) state=close 2. resumed by other: (o) event=none (x) systemd=suspends again (o) acpid=resumes (o) state=close 2. non-cached 1. resumed by lid: (o) event=none (x) systemd=suspends again (o) acpid=resumes (o) state=open 2. resumed by other: (o) event=none (x) systemd=suspends again (o) acpid=resumes (o) state=close As a conclusion: 1. With systemd changed, lid_init_state=ignore has only one problem and the problem comes from an underlying issue, not userspace and kernel lid handling. 2. Without systemd changed, lid_init_state=open can be the default behavior as the pass ratio is not much worse than lid_init_state=ignore. 3. lid_init_state=method is buggy, we can have a separate patch to make it deprectated. Link: https://bugzilla.kernel.org/show_bug.cgi?id=187271 Signed-off-by: Lv Zheng <lv.zheng@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Bartosz Golaszewski	b369fd719f	sata: ahci-da850: implement a workaround for the softreset quirk [ Upstream commit `f4d435f326` ] There's an issue with the da850 SATA controller: if port multiplier support is compiled in, but we're connecting the drive directly to the SATA port on the board, the drive can't be detected. To make SATA work on the da850-lcdk board: first try to softreset with pmp - if the operation fails with -EBUSY, retry without pmp. Signed-off-by: Bartosz Golaszewski <bgolaszewski@baylibre.com> Acked-by: Tejun Heo <tj@kernel.org> Signed-off-by: Sekhar Nori <nsekhar@ti.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Alex Williamson	f36d3f1fe7	PCI: Add ACS quirk for Intel Union Point [ Upstream commit `7184f5b451` ] Intel 200-series chipsets have the same errata as 100-series: the ACS capability doesn't follow the PCIe spec, the capability and control registers are dwords rather than words. Add PCIe root port device IDs to existing quirk. Signed-off-by: Alex Williamson <alex.williamson@redhat.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Patrice Chotard	a452e4eb6e	ARM: dts: STiH407-family: set snps,dis_u3_susphy_quirk [ Upstream commit `8413299cb3` ] Since v4.10-rc1, the following logs appears in loop : [ 801.953836] usb usb6-port1: Cannot enable. Maybe the USB cable is bad? [ 801.960455] xhci-hcd xhci-hcd.0.auto: Cannot set link state. [ 801.966611] usb usb6-port1: cannot disable (err = -32) [ 806.083772] usb usb6-port1: Cannot enable. Maybe the USB cable is bad? [ 806.090370] xhci-hcd xhci-hcd.0.auto: Cannot set link state. [ 806.096494] usb usb6-port1: cannot disable (err = -32) After analysis, xhci try to set link in U3 and returns an error. Using snps,dis_u3_susphy_quirk fix this issue. Signed-off-by: Patrice Chotard <patrice.chotard@st.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Daniel Vetter	9ec57c921b	drm/mga: remove device_is_agp callback [ Upstream commit `858b2c1bf8` ] It's only for a device quirk, and we might as well do that in the load callback. Acked-by: Alex Deucher <alexander.deucher@amd.com> Signed-off-by: Daniel Vetter <daniel.vetter@intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/20170125062657.19270-10-daniel.vetter@ffwll.ch Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:20 +02:00
Felipe Balbi	ae3a3e209e	usb: dwc3: host: pass quirk-broken-port-ped property for known broken revisions [ Upstream commit `e42a5dbb8a` ] dwc3 revisions <=3.00a have a limitation where Port Disable command doesn't work. Set the quirk-broken-port-ped property for such controllers so XHCI core can do the necessary workaround. [rogerq@ti.com] Updated code from platform data to device property. Signed-off-by: Roger Quadros <rogerq@ti.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Felipe Balbi	7bdf7bebbb	usb: host: xhci-plat: enable BROKEN_PED quirk if platform requested [ Upstream commit `21939f003a` ] In case 'quirk-broken-port-ped' property is passed in via device property, we should enable the corresponding BROKEN_PED quirk flag for XHCI core. [rogerq@ti.com] Updated code from platform data to device property and added DT binding. Signed-off-by: Felipe Balbi <balbi@ti.com> Signed-off-by: Roger Quadros <rogerq@ti.com> Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Felipe Balbi	15159247d2	usb: xhci: add quirk flag for broken PED bits [ Upstream commit `41135de1e7` ] Some devices from Texas Instruments [1] suffer from a silicon bug where Port Enabled/Disabled bit should not be used to silence an erroneous device. The bug is so that if port is disabled with PED bit, an IRQ for device removal (or attachment) will never fire. Just for the sake of completeness, the actual problem lies with SNPS USB IP and this affects all known versions up to 3.00a. A separate patch will be added to dwc3 to enabled this quirk flag if version is <= 3.00a. [1] - AM572x Silicon Errata http://www.ti.com/lit/er/sprz429j/sprz429j.pdf Section i896— USB xHCI Port Disable Feature Does Not Work Signed-off-by: Felipe Balbi <balbi@ti.com> Signed-off-by: Roger Quadros <rogerq@ti.com> Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Kevin Hilman	660b38eab8	ARM: davinci: PM: support da8xx DT platforms [ Upstream commit `7e431af8fa` ] Add PM support for DA850 device-tree boot. Signed-off-by: Kevin Hilman <khilman@baylibre.com> Signed-off-by: Sekhar Nori <nsekhar@ti.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Geert Uytterhoeven	db7c1706fa	Input: gpio_keys - add support for GPIO descriptors [ Upstream commit `5feeca3c1e` ] GPIO descriptors are the preferred way over legacy GPIO numbers nowadays. Convert the driver to use GPIO descriptors internally but still allow passing legacy GPIO numbers from platform data to support existing platforms. Based on commits `633a21d80b` ("input: gpio_keys_polled: Add support for GPIO descriptors") and `1ae5ddb6f8` ("Input: gpio_keys_polled - request GPIO pin as input."). Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be> Reviewed-by: Linus Walleij <linus.walleij@linaro.org> Tested-by: Mika Westerberg <mika.westerberg@linux.intel.com> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Vignesh R	80b0d7e623	serial: 8250_omap: Add OMAP_DMA_TX_KICK quirk for AM437x [ Upstream commit `b6ffcf2108` ] UART uses as EDMA as dma engine on AM437x SoC and therefore, requires OMAP_DMA_TX_KICK quirk just like AM33xx. So, enable OMAP_DMA_TX_KICK quirk for AM437x platform as well. While at that, drop use of of_machine_is_compatible() and instead pass quirks via device data. Signed-off-by: Vignesh R <vigneshr@ti.com> Acked-by: Tony Lindgren <tony@atomide.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Stephen Boyd	6b8deb1082	usb: chipidea: msm: Rely on core to override AHBBURST [ Upstream commit `dd3749099c` ] The core framework already handles setting this parameter with a platform quirk. Add the appropriate flag so that we always set AHBBURST to 0. Technically DT should be doing this, but we always do it for msm chipidea devices so setting the flag in the driver works just as well. If the burst needs to be anything besides 0, we expect the 'ahb-burst-config' dts property to be present. Acked-by: Peter Chen <peter.chen@nxp.com> Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Stephen Boyd <stephen.boyd@linaro.org> Signed-off-by: Peter Chen <peter.chen@nxp.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
subhashj@codeaurora.org	c5fc946a15	scsi: ufs: issue link starup 2 times if device isn't active [ Upstream commit `7caf489b99` ] If we issue the link startup to the device while its UniPro state is LinkDown (and device state is sleep/power-down) then link startup will not move the device state to Active. Device will only move to active state if the link starup is issued when its UniPro state is LinkUp. So in this case, we would have to issue the link startup 2 times to make sure that device moves to active state. Reviewed-by: Gilad Broner <gbroner@codeaurora.org> Signed-off-by: Subhash Jadavani <subhashj@codeaurora.org> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
Zang Leigang	a17bddc4a7	scsi: ufs: introduce a new ufshcd_statea UFSHCD_STATE_EH_SCHEDULED [ Upstream commit `141f816510` ] Add a new ufshcd_state, indicats that an err handler may get to run immediately. Use UFSHCD_STATE_ERROR here looks not literaly correct. Signed-off-by: Zang Leigang <zangleigang@hisilicon.com> Reviewed-by: Subhash Jadavani <subhashj@codeaurora.org> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:19 +02:00
youling257	e94ed34710	ASoC: Intel: bytcr_rt5640: quirks for Insyde devices [ Upstream commit `5718004878` ] There are literally dozens of Insyde devices with a different name but with the same audio routing. Use a generic quirk to match on vendor name only to avoid recurring edits of the same thing. Signed-off-by: youling257 <youling257@gmail.com> Signed-off-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Paulo Zanoni	0ed0810b8b	drm/i915: actually drive the BDW reserved IDs [ Upstream commit `98b2f01c8d` ] Back in 2014, commit `fb7023e0e2` ("drm/i915: BDW: Adding Reserved PCI IDs.") added the reserved PCI IDs in order to try to make sure we had working drivers in case we ever released products using these IDs (since we had instances of this type of problem in the past). The problem is that the patch only touched the macros used by early-quirks.c and by the user space components that rely on i915_pciids.h, it didn't touch the macros used by i915_pci.c. So we correctly handled the stolen memory for these theoretical IDs, but we didn't actually drive the devices from i915.ko. So this patch fixes the original commit by actually making i915.ko drive these IDs, which was the goal. There's no information on what would be the GT count on these IDs, so we just go with the safer intel_broadwell_info, at the risk of ignoring a possibly inexistent BSD2_RING. I did some checking, and it seems that these IDs are driven by intel-gpu-tools, xf86-video-intel and libdrm (since they contain old copies of i915_pciids.h), but they are not checked by mesa. The alternative to this patch would be to just assume we're actually never going to use these IDs, and then remove them from our ID lists and make sure our user space components sync the latest i915_pciids.h copy. I'm fine with either approaches, as long as we make sure that every component tries to drive the same list of PCI IDs. Fixes: `fb7023e0e2` ("drm/i915: BDW: Adding Reserved PCI IDs.") Cc: Rodrigo Vivi <rodrigo.vivi@intel.com> Cc: Ben Widawsky <ben@bwidawsk.net> Cc: Jani Nikula <jani.nikula@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/1483473860-17644-3-git-send-email-paulo.r.zanoni@intel.com Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Paulo Zanoni	70797929ee	drm/i915: more .is_mobile cleanups for BDW [ Upstream commit `0784bc624a` ] Commit `8d9c20e1d1` ("drm/i915: Remove .is_mobile field from platform struct") removed mobile vs desktop differences for HSW+, but forgot the Broadwell reserved IDs, so do it now. It's interesting to notice that these IDs are used by early-quirks.c but are not used by i915_pci.c. Cc: Carlos Santa <carlos.santa@intel.com> Cc: Rodrigo Vivi <rodrigo.vivi@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/1483473860-17644-2-git-send-email-paulo.r.zanoni@intel.com Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Paulo Zanoni	0b348464ee	drm/i915: fix INTEL_BDW_IDS definition [ Upstream commit `7fbd995ce4` ] Remove duplicated IDs from the list. Currently, this definition is only used by early-quirks.c. From my understanding of the code, having duplicated IDs shouldn't be causing any bugs. Fixes: `8d9c20e1d1` ("drm/i915: Remove .is_mobile field from platform struct") Cc: Carlos Santa <carlos.santa@intel.com> Cc: Rodrigo Vivi <rodrigo.vivi@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/1483473860-17644-1-git-send-email-paulo.r.zanoni@intel.com Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Jani Nikula	14ec1cf414	drm/edid: constify edid quirk list [ Upstream commit `23c4cfbdab` ] No reason not to be const. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Jani Nikula <jani.nikula@intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/1482923186-22430-1-git-send-email-jani.nikula@intel.com Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Marcel Hasler	f188ee38d4	HID: usbhid: Add quirk for Mayflash/Dragonrise DolphinBar. [ Upstream commit `8aa2cc7e74` ] The DolphinBar by Mayflash (identified as Dragonrise) needs HID_QUIRK_MULTI_INPUT to split it up into four input devices. Without this quirk the adapter is falsely recognized as a tablet. See also bug 115841 (https://bugzilla.kernel.org/show_bug.cgi?id=115841). Signed-off-by: Marcel Hasler <mahasler@gmail.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Alex Wood	7fd75759ba	HID: usbhid: Add quirk for the Futaba TOSD-5711BB VFD [ Upstream commit `f83f90cf7b` ] The Futaba TOSD-5711BB VFD crashes when the initial HID report is requested, register the display in hid-ids and tell hid-quirks to not do the init. Signed-off-by: Alex Wood <thetewood@gmail.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Lv Zheng	c1e94148f9	ACPI / sysfs: Provide quirk mechanism to prevent GPE flooding [ Upstream commit `9c4aa1eecb` ] Sometimes, the users may require a quirk to be provided from ACPI subsystem core to prevent a GPE from flooding. Normally, if a GPE cannot be dispatched, ACPICA core automatically prevents the GPE from firing. But there are cases the GPE is dispatched by _Lxx/_Exx provided via AML table, and OSPM is lacking of the knowledge to get _Lxx/_Exx correctly executed to handle the GPE, thus the GPE flooding may still occur. The existing quirk mechanism can be enabled/disabled using the following commands to prevent such kind of GPE flooding during runtime: # echo mask > /sys/firmware/acpi/interrupts/gpe00 # echo unmask > /sys/firmware/acpi/interrupts/gpe00 To avoid GPE flooding during boot, we need a boot stage mechanism. This patch provides such a boot stage quirk mechanism to stop this kind of GPE flooding. This patch doesn't fix any feature gap but since the new feature gaps could be found in the future endlessly, and can disappear if the feature gaps are filled, providing a boot parameter rather than a DMI table should suffice. Link: https://bugzilla.kernel.org/show_bug.cgi?id=53071 Link: https://bugzilla.kernel.org/show_bug.cgi?id=117481 Link: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/887793 Signed-off-by: Lv Zheng <lv.zheng@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Keith Busch	43cfff65c9	nvme: simplify stripe quirk [ Upstream commit `e6282aef7b` ] Some OEMs believe they own the Identify Controller vendor specific region and will repurpose it with their own values. While not common, we can't rely on the PCI VID:DID to tell use how to decode the field we reserved for this as the stripe size so we need to do something else for the list of devices using this quirk. The field was supposed to allow flexibility on the device's back-end striping, but it turned out that never materialized; the chunk is always the same as MDTS in the products subscribing to this quirk, so this patch removes the stripe_size field and sets the chunk to the max hw transfer size for the devices using this quirk. Signed-off-by: Keith Busch <keith.busch@intel.com> Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:18 +02:00
Lee, Chun-Yi	8d620dff40	platform/x86: acer-wmi: Only supports AMW0_GUID1 on acer family [ Upstream commit `5241b1938a` ] The AMW0_GUID1 wmi is not only found on Acer family but also other machines like Lenovo, Fujitsu and Medion. In the past, acer-wmi handled those non-Acer machines by quirks list. But actually acer-wmi driver was loaded on any machine that had AMW0_GUID1. This behavior is strange because those machines should be supported by appropriate wmi drivers. e.g. fujitsu-laptop, ideapad-laptop. This patch adds the logic to check the machine that has AMW0_GUID1 should be in Acer/Packard Bell/Gateway white list. But, it still keeps the quirk list of those supported non-acer machines for backward compatibility. Tested-by: Bjørn Mork <bjorn@mork.no> Signed-off-by: Lee, Chun-Yi <jlee@suse.com> Signed-off-by: Darren Hart <dvhart@linux.intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Nobutaka Okabe	2bd6cc1cce	ALSA: usb-audio: Add native DSD support for TEAC 501/503 DAC [ Upstream commit `7f38ca047b` ] This patch adds native DSD support for the following devices. - TEAC NT-503 - TEAC UD-503 - TEAC UD-501 (1) Add quirks for native DSD support for TEAC devices. (2) A specific vendor command is needed to switch between PCM/DOP and DSD mode, same as Denon/Marantz devices. Signed-off-by: Nobutaka Okabe <nob77413@gmail.com> Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Bjorn Helgaas	8ae7242fea	PCI: Expand "VPD access disabled" quirk message [ Upstream commit `044bc425bb` ] It's not very enlightening to see pci 0000:07:00.0: [Firmware Bug]: VPD access disabled in the dmesg log because there's no clue about what the firmware bug is. Expand the message to explain why we're disabling VPD. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Alberto Aguirre	c0aac1bbb5	ALSA: usb-audio: add implicit fb quirk for Axe-Fx II [ Upstream commit `17f08b0d9a` ] The Axe-Fx II implicit feedback end point and the data sync endpoint are in different interface descriptors. Add quirk to ensure a sync endpoint is properly configured. Signed-off-by: Alberto Aguirre <albaguirre@gmail.com> Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Subhash Jadavani	bedc629494	scsi: ufs: add quirk to increase host PA_SaveConfigTime [ Upstream commit `56d4a1866d` ] The maximum value PA_SaveConfigTime is 250 (10us) but this is not enough for some vendors. Gear switch from PWM to HS may fail even with this max. PA_SaveConfigTime. Gear switch can be issued by host controller as an error recovery and any software delay will not help on this case so we need to increase PA_SaveConfigTime to >32us as per vendor recommendation. This change adds a quirk to increase the PA_SaveConfigTime parameter. Reviewed-by: Venkat Gopalakrishnan <venkatg@codeaurora.org> Signed-off-by: Subhash Jadavani <subhashj@codeaurora.org> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Bjorn Helgaas	5c6b8ad1ab	PCI: thunder-pem: Factor out resource lookup [ Upstream commit `0d414268fb` ] Pull the register resource lookup out of thunder_pem_init() so we can easily add a corresponding lookup using ACPI. No functional change intended. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Bjorn Helgaas	5582c1980e	arm64: PCI: Add local struct device pointers [ Upstream commit `dfd1972c2b` ] Use a local "struct device *dev" for brevity. No functional change intended. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Acked-by: Lorenzo Pieralisi <lorenzo.pieralisi@arm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Tomasz Nowicki	cc49b39f3f	arm64: PCI: Manage controller-specific data on per-controller basis [ Upstream commit `093d24a204` ] Currently we use one shared global acpi_pci_root_ops structure to keep controller-specific ops. We pass its pointer to acpi_pci_root_create() and associate it with a host bridge instance for good. Such a design implies serious drawback. Any potential manipulation on the single system-wide acpi_pci_root_ops leads to kernel crash. The structure content is not really changing even across multiple host bridges creation; thus it was not an issue so far. In preparation for adding ECAM quirks mechanism (where controller-specific PCI ops may be different for each host bridge) allocate new acpi_pci_root_ops and fill in with data for each bridge. Now it is safe to have different controller-specific info. As a consequence free acpi_pci_root_ops when host bridge is released. No functional changes in this patch. Signed-off-by: Tomasz Nowicki <tn@semihalf.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Acked-by: Lorenzo Pieralisi <lorenzo.pieralisi@arm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Andy Shevchenko	625fd9d1af	x86/intel_idle: Add CPU model 0x4a (Atom Z34xx series) [ Upstream commit `5e7ec268fd` ] Add CPU ID for Atom Z34xx processors. Datasheets indicate support for this, detailed information about potential quirks or limitations are missing, though. So we just reuse the definition from official BSP code. Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Signed-off-by: Len Brown <len.brown@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Chuck Lever	607ca1dccb	svcauth_gss: Close connection when dropping an incoming message [ Upstream commit `4d712ef1db` ] S5.3.3.1 of RFC 2203 requires that an incoming GSS-wrapped message whose sequence number lies outside the current window is dropped. The rationale is: The reason for discarding requests silently is that the server is unable to determine if the duplicate or out of range request was due to a sequencing problem in the client, network, or the operating system, or due to some quirk in routing, or a replay attack by an intruder. Discarding the request allows the client to recover after timing out, if indeed the duplication was unintentional or well intended. However, clients may rely on the server dropping the connection to indicate that a retransmit is needed. Without a connection reset, a client can wait forever without retransmitting, and the workload just stops dead. I've reproduced this behavior by running xfstests generic/323 on an NFSv4.0 mount with proto=rdma and sec=krb5i. To address this issue, have the server close the connection when it silently discards an incoming message due to a GSS sequence number problem. There are a few other places where the server will never reply. Change those spots in a similar fashion. Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Subhash Jadavani	3a87bcdebf	scsi: ufs: ensure that host pa_tactivate is higher than device [ Upstream commit `c6a6db4398` ] Some UFS devices require host PA_TACTIVATE to be higher than device PA_TACTIVATE otherwise it may get stuck during hibern8 sequence. This change allows this by using quirk. Reviewed-by: Venkat Gopalakrishnan <venkatg@codeaurora.org> Signed-off-by: Subhash Jadavani <subhashj@codeaurora.org> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:17 +02:00
Ritesh Harjani	d84be51d1c	mmc: sdhci-msm: Enable few quirks [ Upstream commit `a0e3142869` ] sdhc-msm controller needs this SDHCI_QUIRK_CAP_CLOCK_BASE_BROKEN & SDHCI_QUIRK2_PRESET_VALUE_BROKEN to be set. Hence setting it. Signed-off-by: Sahitya Tummala <stummala@codeaurora.org> Signed-off-by: Ritesh Harjani <riteshh@codeaurora.org> Acked-by: Adrian Hunter <adrian.hunter@intel.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Benjamin Tissoires	c6e3c6628d	HID: multitouch: do not retrieve all reports for all devices [ Upstream commit `b897f6db3a` ] We already have in place a quirk for Windows 8 devices, but it looks like the Surface Cover are not conforming to it. Given that we are only interested in 3 feature reports (the ones that the Windows driver retrieves), we should be safe to unconditionally apply the quirk to everybody. In case there is an issue with a controller, we can always mark it as such in the transport driver, and hid-multitouch won't try to retrieve the feature report. Signed-off-by: Benjamin Tissoires <benjamin.tissoires@redhat.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Benjamin Tissoires	68a83be381	HID: multitouch: enable the Surface 3 Type Cover to report multitouch data [ Upstream commit `8fe89ef076` ] There is no reasons to filter out keyboard and consumer control collections in hid-multitouch. With the previous hid-input fix, there is now a full support of the Type Cover and we can remove all specific bits from hid-core and hid-microsoft. hid-multitouch will automatically set HID_QUIRK_NO_INIT_REPORTS so we can also remove it from the list of ushbid quirks. Signed-off-by: Benjamin Tissoires <benjamin.tissoires@redhat.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Benjamin Tissoires	c5fcc6332d	HID: sensor-hub: add quirk for Microchip MM7150 [ Upstream commit `5cc5084dd9` ] One more device requiring a quirk :/ Reported-by: Christian-Nils Boda <christian-nils.boda@gadz.org> Signed-off-by: Benjamin Tissoires <benjamin.tissoires@redhat.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Benjamin Tissoires	39f3c9291d	HID: sensor-hub add quirk for Microsoft Surface 3 [ Upstream commit `da809197a9` ] One more device requiring a quirk :/ [jkosina@suse.cz: update comment based on Bastien's remark] Signed-off-by: Benjamin Tissoires <benjamin.tissoires@redhat.com> Tested-by: Bastien Nocera <hadess@hadess.net> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Kiwoong Kim	9b41ed79ec	scsi: ufs: introduce UFSHCD_QUIRK_PRDT_BYTE_GRAN quirk [ Upstream commit `75b1cc4ad6` ] Some UFS host controllers may think granularities of PRDT length and offset as bytes, not double words. Signed-off-by: Kiwoong Kim <kwmad.kim@samsung.com> Reviewed-by: Subhash Jadavani <subhashj@codeaurora.org> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Brian Norris	26cbe162df	clocksource/drivers/arm_arch_timer: Don't assume clock runs in suspend [ Upstream commit `d8ec7595a0` ] The ARM specifies that the system counter "must be implemented in an always-on power domain," and so we try to use the counter as a source of timekeeping across suspend/resume. Unfortunately, some SoCs (e.g., Rockchip's RK3399) do not keep the counter ticking properly when switched from their high-power clock to the lower-power clock used in system suspend. Support this quirk by adding a new device tree property. Signed-off-by: Brian Norris <briannorris@chromium.org> Reviewed-by: Douglas Anderson <dianders@chromium.org> Acked-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Daniel Lezcano <daniel.lezcano@linaro.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Bjorn Helgaas	d2f1000c3a	net/mlx4_core: Use device ID defines [ Upstream commit `c19e4b9037` ] We added a bunch of new Mellanox device ID definitions because they'll be used by INTx quirks. Use them in the mlx4 ID table also so grep can find both places. No functional change intended. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Acked-by: Tariq Toukan <tariqt@mellanox.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
John Garry	dcc4c67581	arm64: dts: hisi: fix hip06 sas am-max-trans quirk [ Upstream commit `f65e786604` ] The string for the am max transmissions quirk property is not correct -> fix it. Signed-off-by: John Garry <john.garry@huawei.com> Reviewed-by: Xiang Chen <chenxiang66@hisilicon.com> Signed-off-by: Wei Xu <xuwei5@hisilicon.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Pierre-Louis Bossart	b340c9648c	ASoC: Intel: bytct_rt5640: change default capture settings [ Upstream commit `bf46241bee` ] Most Baytrail-CR devices use analog differential microphones, modify capture default to avoid DMI quirks. Keep digital mics for all other configurations. Signed-off-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:16 +02:00
Janusz Dziedzic	d9a97dcdf5	usb: dwc3: gadget: delay unmap of bounced requests commit `de288e36fe` upstream. In the case of bounced ep0 requests, we must delay DMA operation until after ->complete() otherwise we might overwrite contents of req->buf. This caused problems with RNDIS gadget. Signed-off-by: Janusz Dziedzic <januszx.dziedzic@intel.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
HungNien Chen	ee670af5fe	HID: i2c-hid: add a simple quirk to fix device defects [ Upstream commit `71af01a8c8` ] Certain devices produced by Weida Tech need to have a wakeup command sent to them before powering on. The call itself will come back with error, but the device can be powered on afterwards. [jkosina@suse.cz: rewrite changelog] [jkosina@suse.cz: remove unused device ID addition] Signed-off-by: HungNien Chen <hn.chen@weidahitech.com> Reviewed-by: Benjamin Tissoires <benjamin.tissoires@redhat.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Marcel Hasler	d640c41bce	HID: usbhid: Add quirks for Mayflash/Dragonrise GameCube and PS3 adapters [ Upstream commit `b2554000f5` ] All known gamepad adapters by Mayflash (identified as Dragonrise) need HID_QUIRK_MULTI_INPUT to split them up into four input devices. Without this quirk those adapters are falsely recognized as tablets. Fixes bug 115841 (https://bugzilla.kernel.org/show_bug.cgi?id=115841). Signed-off-by: Marcel Hasler <mahasler@gmail.com> Reviewed-by: Benjamin Tissoires <benjamin.tissoires@redhat.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Vladimir Zapolskiy	e02a5d1d5a	clk: lpc32xx: add a quirk for PWM and MS clock dividers [ Upstream commit `f84d42a9cf` ] In common clock framework CLK_DIVIDER_ONE_BASED or'ed with CLK_DIVIDER_ALLOW_ZERO flags indicates that 1) a divider clock may be set to zero value, 2) divider's zero value is interpreted as a non-divided clock. On the LPC32xx platform clock dividers of PWM and memory card clocks comply with the first condition, but zero value means a gated clock, thus it may happen that the divider value is not updated when the clock is enabled and the clock remains gated. The change adds one-shot quirks, which check for zero value of divider on initialization and set it to a non-zero value, therefore in runtime a gate clock will work as expected. Signed-off-by: Vladimir Zapolskiy <vz@mleia.com> Reviewed-by: Sylvain Lemieux <slemieux.tyco@gmail.com> Signed-off-by: Stephen Boyd <sboyd@codeaurora.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Chen-Yu Tsai	666d5f34d8	drm/sun4i: Add compatible string for A31/A31s TCON (timing controller) commit `93a5ec14da` upstream. The A31 TCON has mux controls for how TCON outputs are routed to the HDMI and MIPI DSI blocks. Since the A31s does not have MIPI DSI, it only has a mux for the HDMI controller input. This patch only adds support for the compatible strings. Actual support for the mux controls should be added with HDMI and MIPI DSI support. Signed-off-by: Chen-Yu Tsai <wens@csie.org> Signed-off-by: Maxime Ripard <maxime.ripard@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Chen-Yu Tsai	6af7e1cae4	drm/sun4i: Add compatible strings for A31/A31s display pipelines commit `49c440e87c` upstream. The A31's display pipeline has 2 frontends, 2 backends, and 2 TCONs. It also has new display enhancement blocks, such as the DRC (Dynamic Range Controller), the DEU (Display Enhancement Unit), and the CMU (Color Management Unit). It supports HDMI, MIPI DSI, and 2 LCD/LVDS channels. The A31s display pipeline is almost the same, just without MIPI DSI. Only the TCON seems to be different, due to the missing mux for MIPI DSI. Add compatible strings for both of them. Signed-off-by: Chen-Yu Tsai <wens@csie.org> Acked-by: Rob Herring <robh@kernel.org> Signed-off-by: Maxime Ripard <maxime.ripard@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Chen-Yu Tsai	06a2bb472f	drm/sun4i: tcon: Move SoC specific quirks to a DT matched data structure commit `91ea2f29cb` upstream. We already have some differences between the 2 supported SoCs. More will be added as we support other SoCs. To avoid bloating the probe function with even more conditionals, move the quirks to a separate data structure that's tied to the compatible string. Signed-off-by: Chen-Yu Tsai <wens@csie.org> Signed-off-by: Maxime Ripard <maxime.ripard@free-electrons.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Jason A. Donenfeld	7c03613344	random: use chacha20 for get_random_int/long commit `f5b98461cb` upstream. Now that our crng uses chacha20, we can rely on its speedy characteristics for replacing MD5, while simultaneously achieving a higher security guarantee. Before the idea was to use these functions if you wanted random integers that aren't stupidly insecure but aren't necessarily secure either, a vague gray zone, that hopefully was "good enough" for its users. With chacha20, we can strengthen this claim, since either we're using an rdrand-like instruction, or we're using the same crng as /dev/urandom. And it's faster than what was before. We could have chosen to replace this with a SipHash-derived function, which might be slightly faster, but at the cost of having yet another RNG construction in the kernel. By moving to chacha20, we have a single RNG to analyze and verify, and we also already get good performance improvements on all platforms. Implementation-wise, rather than use a generic buffer for both get_random_int/long and memcpy based on the size needs, we use a specific buffer for 32-bit reads and for 64-bit reads. This way, we're guaranteed to always have aligned accesses on all platforms. While slightly more verbose in C, the assembly this generates is a lot simpler than otherwise. Finally, on 32-bit platforms where longs and ints are the same size, we simply alias get_random_int to get_random_long. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com> Suggested-by: Theodore Ts'o <tytso@mit.edu> Cc: Theodore Ts'o <tytso@mit.edu> Cc: Hannes Frederic Sowa <hannes@stressinduktion.org> Cc: Andy Lutomirski <luto@amacapital.net> Signed-off-by: Theodore Ts'o <tytso@mit.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Chris Salls	cddab768d1	mm/mempolicy.c: fix error handling in set_mempolicy and mbind. commit `cf01fb9985` upstream. In the case that compat_get_bitmap fails we do not want to copy the bitmap to the user as it will contain uninitialized stack data and leak sensitive data. Signed-off-by: Chris Salls <salls@cs.ucsb.edu> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Johan Hovold	24317cb6b4	Documentation: stable-kernel-rules: fix stable-tag format commit `cf903e9d3a` upstream. A patch documenting how to specify which kernels a particular fix should be backported to (seemingly) inadvertently added a minus sign after the kernel version. This particular stable-tag format had never been used prior to this patch, and was neither present when the patch in question was first submitted (it was added in v2 without any comment). Drop the minus sign to avoid any confusion. Fixes: `fdc81b7910` ("stable_kernel_rules: Add clause about specification of kernel versions to patch.") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Huacai Chen	be56f92cc0	MIPS: c-r4k: Fix Loongson-3's vcache/scache waysize calculation commit `0be032c190` upstream. If scache.waysize is 0, r4k___flush_cache_all() will do nothing and then cause bugs. BTW, though vcache.waysize isn't being used by now, we also fix its calculation. Signed-off-by: Huacai Chen <chenhc@lemote.com> Cc: John Crispin <john@phrozen.org> Cc: Steven J . Hill <Steven.Hill@caviumnetworks.com> Cc: Fuxin Zhang <zhangfx@lemote.com> Cc: Zhangjin Wu <wuzhangjin@gmail.com> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15756/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:15 +02:00
Huacai Chen	59b8725f2b	MIPS: Flush wrong invalid FTLB entry for huge page commit `0115f6cbf2` upstream. On VTLB+FTLB platforms (such as Loongson-3A R2), FTLB's pagesize is usually configured the same as PAGE_SIZE. In such a case, Huge page entry is not suitable to write in FTLB. Unfortunately, when a huge page is created, its page table entries haven't created immediately. Then the TLB refill handler will fetch an invalid page table entry which has no "HUGE" bit, and this entry may be written to FTLB. Since it is invalid, TLB load/store handler will then use tlbwi to write the valid entry at the same place. However, the valid entry is a huge page entry which isn't suitable for FTLB. Our solution is to modify build_huge_handler_tail. Flush the invalid old entry (whether it is in FTLB or VTLB, this is in order to reduce branches) and use tlbwr to write the valid new entry. Signed-off-by: Rui Wang <wangr@lemote.com> Signed-off-by: Huacai Chen <chenhc@lemote.com> Cc: John Crispin <john@phrozen.org> Cc: Steven J . Hill <Steven.Hill@caviumnetworks.com> Cc: Fuxin Zhang <zhangfx@lemote.com> Cc: Zhangjin Wu <wuzhangjin@gmail.com> Cc: Huacai Chen <chenhc@lemote.com> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15754/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Huacai Chen	186fb3c52e	MIPS: Add MIPS_CPU_FTLB for Loongson-3A R2 commit `033cffeedb` upstream. Loongson-3A R2 and newer CPU have FTLB, but Config0.MT is 1, so add MIPS_CPU_FTLB to the CPU options. Signed-off-by: Huacai Chen <chenhc@lemote.com> Cc: John Crispin <john@phrozen.org> Cc: Steven J . Hill <Steven.Hill@caviumnetworks.com> Cc: Fuxin Zhang <zhangfx@lemote.com> Cc: Zhangjin Wu <wuzhangjin@gmail.com> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15752/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Huacai Chen	3d5e13d891	MIPS: Check TLB before handle_ri_rdhwr() for Loongson-3 commit `5a34133167` upstream. Loongson-3's micro TLB (ITLB) is not strictly a subset of JTLB. That means: when a JTLB entry is replaced by hardware, there may be an old valid entry exists in ITLB. So, a TLB miss exception may occur while handle_ri_rdhwr() is running because it try to access EPC's content. However, handle_ri_rdhwr() doesn't clear EXL, which makes a TLB Refill exception be treated as a TLB Invalid exception and tlbp may fail. In this case, if FTLB (which is usually set-associative instead of set- associative) is enabled, a tlbp failure will cause an invalid tlbwi, which will hang the whole system. This patch rename handle_ri_rdhwr_vivt to handle_ri_rdhwr_tlbp and use it for Loongson-3. It try to solve the same problem described as below, but more straightforwards. https://patchwork.linux-mips.org/patch/12591/ I think Loongson-2 has the same problem, but it has no FTLB, so we just keep it as is. Signed-off-by: Huacai Chen <chenhc@lemote.com> Cc: Rui Wang <wangr@lemote.com> Cc: John Crispin <john@phrozen.org> Cc: Steven J . Hill <Steven.Hill@caviumnetworks.com> Cc: Fuxin Zhang <zhangfx@lemote.com> Cc: Zhangjin Wu <wuzhangjin@gmail.com> Cc: Huacai Chen <chenhc@lemote.com> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15753/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Hauke Mehrtens	c7f6633f76	MIPS: Lantiq: fix missing xbar kernel panic commit `6ef90877ee` upstream. Commit `08b3c894e5` ("MIPS: lantiq: Disable xbar fpi burst mode") accidentally requested the resources from the pmu address region instead of the xbar registers region, but the check for the return value of request_mem_region() was wrong. Commit `98ea51cb0c` ("MIPS: Lantiq: Fix another request_mem_region() return code check") fixed the check of the return value of request_mem_region() which made the kernel panics. This patch now makes use of the correct memory region for the cross bar. Fixes: `08b3c894e5` ("MIPS: lantiq: Disable xbar fpi burst mode") Signed-off-by: Hauke Mehrtens <hauke@hauke-m.de> Cc: John Crispin <john@phrozen.org> Cc: james.hogan@imgtec.com Cc: arnd@arndb.de Cc: sergei.shtylyov@cogentembedded.com Cc: john@phrozen.org Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15751 Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Paul Burton	1e7deb9da0	MIPS: End spinlocks with .insn commit `4b5347a24a` upstream. When building for microMIPS we need to ensure that the assembler always knows that there is code at the target of a branch or jump. Recent toolchains will fail to link a microMIPS kernel when this isn't the case due to what it thinks is a branch to non-microMIPS code. mips-mti-linux-gnu-ld kernel/built-in.o: .spinlock.text+0x2fc: Unsupported branch between ISA modes. mips-mti-linux-gnu-ld final link failed: Bad value This is due to inline assembly labels in spinlock.h not being followed by an instruction mnemonic, either due to a .subsection pseudo-op or the end of the inline asm block. Fix this with a .insn direction after such labels. Signed-off-by: Paul Burton <paul.burton@imgtec.com> Signed-off-by: James Hogan <james.hogan@imgtec.com> Reviewed-by: Maciej W. Rozycki <macro@imgtec.com> Cc: Ralf Baechle <ralf@linux-mips.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Ingo Molnar <mingo@redhat.com> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15325/ Signed-off-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
John Crispin	19aa26f5be	MIPS: ralink: Fix typos in rt3883 pinctrl commit `7c5a3d8130` upstream. There are two copy & paste errors in the definition of the 5GHz LNA and second ethernet pinmux. Fixes: `f576fb6a07` ("MIPS: ralink: cleanup the soc specific pinmux data") Signed-off-by: John Crispin <john@phrozen.org> Signed-off-by: Daniel Golle <daniel@makrotopia.org> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15328/ Signed-off-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
James Hogan	9dcb21e63b	MIPS: Force o32 fp64 support on 32bit MIPS64r6 kernels commit `2e6c774773` upstream. When a 32-bit kernel is configured to support MIPS64r6 (CPU_MIPS64_R6), MIPS_O32_FP64_SUPPORT won't be selected as it should be because MIPS32_O32 is disabled (o32 is already the default ABI available on 32-bit kernels). This results in userland FP breakage as CP0_Status.FR is read-only 1 since r6 (when an FPU is present) so __enable_fpu() will fail to clear FR. This causes the FPU emulator to get used which will incorrectly emulate 32-bit FPU registers. Force o32 fp64 support in this case by also selecting MIPS_O32_FP64_SUPPORT from CPU_MIPS64_R6 if 32BIT. Fixes: `4e9d324d42` ("MIPS: Require O32 FP64 support for MIPS64 with O32 compat") Signed-off-by: James Hogan <james.hogan@imgtec.com> Reviewed-by: Paul Burton <paul.burton@imgtec.com> Cc: Ralf Baechle <ralf@linux-mips.org> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15310/ Signed-off-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Heiko Carstens	7b68273f79	s390/uaccess: get_user() should zero on failure (again) commit `d09c5373e8` upstream. Commit `fd2d2b191f` ("s390: get_user() should zero on failure") intended to fix s390's get_user() implementation which did not zero the target operand if the read from user space faulted. Unfortunately the patch has no effect: the corresponding inline assembly specifies that the operand is only written to ("=") and the previous value is discarded. Therefore the compiler is free to and actually does omit the zero initialization. To fix this simply change the contraint modifier to "+", so the compiler cannot omit the initialization anymore. Fixes: `c9ca78415a` ("s390/uaccess: provide inline variants of get_user/put_user") Fixes: `fd2d2b191f` ("s390: get_user() should zero on failure") Cc: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Heiko Carstens <heiko.carstens@de.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Marcelo Henrique Cerri	f4938792af	s390/decompressor: fix initrd corruption caused by bss clear commit `d82c0d12c9` upstream. Reorder the operations in decompress_kernel() to ensure initrd is moved to a safe location before the bss section is zeroed. During decompression bss can overlap with the initrd and this can corrupt the initrd contents depending on the size of the compressed kernel (which affects where the initrd is placed by the bootloader) and the size of the bss section of the decompressor. Also use the correct initrd size when checking for overlaps with parmblock. Fixes: `06c0dd72ae` ([S390] fix boot failures with compressed kernels) Reviewed-by: Joy Latten <joy.latten@canonical.com> Reviewed-by: Vineetha HariPai <vineetha.hari.pai@canonical.com> Signed-off-by: Marcelo Henrique Cerri <marcelo.cerri@canonical.com> Signed-off-by: Heiko Carstens <heiko.carstens@de.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Max Filippov	6a776f6ae3	xtensa: make __pa work with uncached KSEG addresses commit `2b83878dd7` upstream. When __pa is applied to virtual address in uncached KSEG region the result is incorrect. Fix it by checking if the original address is in the uncached KSEG and adjusting the result. It looks better than masking off bits because pfn_valid would correctly work with new __pa results and it may be made working in noMMU case, once we get definition for uncached memory view. This is required for the dma_common_mmap and DMA debug code to work correctly: they both indirectly use __pa with coherent DMA addresses. In case of DMA debug the visible effect is false reports that an address mapped for DMA is accessed by CPU. Tested-by: Boris Brezillon <boris.brezillon@free-electrons.com> Reviewed-by: Boris Brezillon <boris.brezillon@free-electrons.com> Signed-off-by: Max Filippov <jcmvbkbc@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Tobias Klauser	c1dcea1236	nios2: reserve boot memory for device tree commit `921d701e6f` upstream. Make sure to reserve the boot memory for the flattened device tree. Otherwise it might get overwritten, e.g. when initial_boot_params is copied, leading to a corrupted FDT and a boot hang/crash: bootconsole [early0] enabled Early console on uart16650 initialized at 0xf8001600 OF: fdt: Error -11 processing FDT Kernel panic - not syncing: setup_cpuinfo: No CPU found in devicetree! ---[ end Kernel panic - not syncing: setup_cpuinfo: No CPU found in devicetree! Guenter Roeck says: > I think I found the problem. In unflatten_and_copy_device_tree(), with added > debug information: > > OF: fdt: initial_boot_params=c861e400, dt=c861f000 size=28874 (0x70ca) > > ... and then initial_boot_params is copied to dt, which results in corrupted > fdt since the memory overlaps. Looks like the initial_boot_params memory > is not reserved and (re-)allocated by early_init_dt_alloc_memory_arch(). Reported-by: Guenter Roeck <linux@roeck-us.net> Reference: http://lkml.kernel.org/r/20170226210338.GA19476@roeck-us.net Tested-by: Guenter Roeck <linux@roeck-us.net> Signed-off-by: Tobias Klauser <tklauser@distanz.ch> Acked-by: Ley Foon Tan <ley.foon.tan@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:14 +02:00
Dmitry Bilunov	27d382fb6a	dm raid: fix NULL pointer dereference for raid1 without bitmap commit `7a0c5c5b83` upstream. Commit `4257e08` ("dm raid: support to change bitmap region size") introduced a bitmap resize call during preresume phase. User can create a DM device with "raid" target configured as raid1 with no metadata devices to hold superblock/bitmap info. It can be achieved using the following sequence: truncate -s 32M /dev/shm/raid-test LOOP=$(losetup --show -f /dev/shm/raid-test) dmsetup create raid-test-linear0 --table "0 1024 linear $LOOP 0" dmsetup create raid-test-linear1 --table "0 1024 linear $LOOP 1024" dmsetup create raid-test --table "0 1024 raid raid1 1 2048 2 - /dev/mapper/raid-test-linear0 - /dev/mapper/raid-test-linear1" This results in the following crash: [ 4029.110216] device-mapper: raid: Ignoring chunk size parameter for RAID 1 [ 4029.110217] device-mapper: raid: Choosing default region size of 4MiB [ 4029.111349] md/raid1:mdX: active with 2 out of 2 mirrors [ 4029.114770] BUG: unable to handle kernel NULL pointer dereference at 0000000000000030 [ 4029.114802] IP: bitmap_resize+0x25/0x7c0 [md_mod] [ 4029.114816] PGD 0 … [ 4029.115059] Hardware name: Aquarius Pro P30 S85 BUY-866/B85M-E, BIOS 2304 05/25/2015 [ 4029.115079] task: ffff88015cc29a80 task.stack: ffffc90001a5c000 [ 4029.115097] RIP: 0010:bitmap_resize+0x25/0x7c0 [md_mod] [ 4029.115112] RSP: 0018:ffffc90001a5fb68 EFLAGS: 00010246 [ 4029.115127] RAX: 0000000000000005 RBX: 0000000000000000 RCX: 0000000000000000 [ 4029.115146] RDX: 0000000000000000 RSI: 0000000000000400 RDI: 0000000000000000 [ 4029.115166] RBP: ffffc90001a5fc28 R08: 0000000800000000 R09: 00000008ffffffff [ 4029.115185] R10: ffffea0005661600 R11: ffff88015cc29a80 R12: ffff88021231f058 [ 4029.115204] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 [ 4029.115223] FS: 00007fe73a6b4740(0000) GS:ffff88021ea80000(0000) knlGS:0000000000000000 [ 4029.115245] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 4029.115261] CR2: 0000000000000030 CR3: 0000000159a74000 CR4: 00000000001426e0 [ 4029.115281] Call Trace: [ 4029.115291] ? raid_iterate_devices+0x63/0x80 [dm_raid] [ 4029.115309] ? dm_table_all_devices_attribute.isra.23+0x41/0x70 [dm_mod] [ 4029.115329] ? dm_table_set_restrictions+0x225/0x2d0 [dm_mod] [ 4029.115346] raid_preresume+0x81/0x2e0 [dm_raid] [ 4029.115361] dm_table_resume_targets+0x47/0xe0 [dm_mod] [ 4029.115378] dm_resume+0xa8/0xd0 [dm_mod] [ 4029.115391] dev_suspend+0x123/0x250 [dm_mod] [ 4029.115405] ? table_load+0x350/0x350 [dm_mod] [ 4029.115419] ctl_ioctl+0x1c2/0x490 [dm_mod] [ 4029.115433] dm_ctl_ioctl+0xe/0x20 [dm_mod] [ 4029.115447] do_vfs_ioctl+0x8d/0x5a0 [ 4029.115459] ? ____fput+0x9/0x10 [ 4029.115470] ? task_work_run+0x79/0xa0 [ 4029.115481] SyS_ioctl+0x3c/0x70 [ 4029.115493] entry_SYSCALL_64_fastpath+0x13/0x94 The raid_preresume() function incorrectly assumes that the raid_set has a bitmap enabled if RT_FLAG_RS_BITMAP_LOADED is set. But RT_FLAG_RS_BITMAP_LOADED is getting set in __load_dirty_region_bitmap() even if there is no bitmap present (and bitmap_load() happily returns 0 even if a bitmap isn't present). So the only way forward in the near-term is to check if the bitmap is present by seeing if mddev->bitmap is not NULL after bitmap_load() has been called. By doing so the above NULL pointer is avoided. Fixes: `4257e08` ("dm raid: support to change bitmap region size") Signed-off-by: Dmitry Bilunov <kmeaw@yandex-team.ru> Signed-off-by: Andrey Smetanin <asmetanin@yandex-team.ru> Acked-by: Heinz Mauelshagen <heinzm@redhat.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
Michael Ellerman	abbf0fd436	powerpc/crypto/crc32c-vpmsum: Fix missing preempt_disable() commit `4749228f02` upstream. In crc32c_vpmsum() we call enable_kernel_altivec() without first disabling preemption, which is not allowed: WARNING: CPU: 9 PID: 2949 at ../arch/powerpc/kernel/process.c:277 enable_kernel_altivec+0x100/0x120 Modules linked in: dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c vmx_crypto ... CPU: 9 PID: 2949 Comm: docker Not tainted 4.11.0-rc5-compiler_gcc-6.3.1-00033-g308ac7563944 #381 ... NIP [c00000000001e320] enable_kernel_altivec+0x100/0x120 LR [d000000003df0910] crc32c_vpmsum+0x108/0x150 [crc32c_vpmsum] Call Trace: 0xc138fd09 (unreliable) crc32c_vpmsum+0x108/0x150 [crc32c_vpmsum] crc32c_vpmsum_update+0x3c/0x60 [crc32c_vpmsum] crypto_shash_update+0x88/0x1c0 crc32c+0x64/0x90 [libcrc32c] dm_bm_checksum+0x48/0x80 [dm_persistent_data] sb_check+0x84/0x120 [dm_thin_pool] dm_bm_validate_buffer.isra.0+0xc0/0x1b0 [dm_persistent_data] dm_bm_read_lock+0x80/0xf0 [dm_persistent_data] __create_persistent_data_objects+0x16c/0x810 [dm_thin_pool] dm_pool_metadata_open+0xb0/0x1a0 [dm_thin_pool] pool_ctr+0x4cc/0xb60 [dm_thin_pool] dm_table_add_target+0x16c/0x3c0 table_load+0x184/0x400 ctl_ioctl+0x2f0/0x560 dm_ctl_ioctl+0x38/0x50 do_vfs_ioctl+0xd8/0x920 SyS_ioctl+0x68/0xc0 system_call+0x38/0xfc It used to be sufficient just to call pagefault_disable(), because that also disabled preemption. But the two were decoupled in commit `8222dbe21e` ("sched/preempt, mm/fault: Decouple preemption from the page fault logic") in mid 2015. So add the missing preempt_disable/enable(). We should also call disable_kernel_fp(), although it does nothing by default, there is a debug switch to make it active and all enables should be paired with disables. Fixes: `6dd7a82cc5` ("crypto: powerpc - Add POWER8 optimised crc32c") Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
Paul Mackerras	45c2ed941c	powerpc: Don't try to fix up misaligned load-with-reservation instructions commit `48fe9e9488` upstream. In the past, there was only one load-with-reservation instruction, lwarx, and if a program attempted a lwarx on a misaligned address, it would take an alignment interrupt and the kernel handler would emulate it as though it was lwzx, which was not really correct, but benign since it is loading the right amount of data, and the lwarx should be paired with a stwcx. to the same address, which would also cause an alignment interrupt which would result in a SIGBUS being delivered to the process. We now have 5 different sizes of load-with-reservation instruction. Of those, lharx and ldarx cause an immediate SIGBUS by luck since their entries in aligninfo[] overlap instructions which were not fixed up, but lqarx overlaps with lhz and will be emulated as such. lbarx can never generate an alignment interrupt since it only operates on 1 byte. To straighten this out and fix the lqarx case, this adds code to detect the l[hwdq]arx instructions and return without fixing them up, resulting in a SIGBUS being delivered to the process. Signed-off-by: Paul Mackerras <paulus@ozlabs.org> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
Oliver O'Halloran	fba7546bbe	powerpc/64: Fix flush_(d\|i)cache_range() called from modules commit `8f5f525d5b` upstream. When the kernel is compiled to use 64bit ABIv2 the _GLOBAL() macro does not include a global entry point. A function's global entry point is used when the function is called from a different TOC context and in the kernel this typically means a call from a module into the vmlinux (or vice-versa). There are a few exported asm functions declared with _GLOBAL() and calling them from a module will likely crash the kernel since any TOC relative load will yield garbage. flush_icache_range() and flush_dcache_range() are both exported to modules, and use the TOC, so must use _GLOBAL_TOC(). Fixes: `721aeaa9fd` ("powerpc: Build little endian ppc64 kernel with ABIv2") Signed-off-by: Oliver O'Halloran <oohall@gmail.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
Frederic Barrat	24d945d479	powerpc/mm: Add missing global TLB invalidate if cxl is active commit `88b1bf7268` upstream. Commit `4c6d9acce1` ("powerpc/mm: Add hooks for cxl") converted local TLB invalidates to global if the cxl driver is active. This is necessary because the CAPP snoops invalidations to forward them to the PSL on the cxl adapter. However one path was forgotten. native_flush_hash_range() still does local TLB invalidates, as found out the hard way recently. This patch fixes it by following the same logic as previously: if the cxl driver is active, the local TLB invalidates are 'upgraded' to global. Fixes: `4c6d9acce1` ("powerpc/mm: Add hooks for cxl") Signed-off-by: Frederic Barrat <fbarrat@linux.vnet.ibm.com> Reviewed-by: Aneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
Benjamin Herrenschmidt	6fbf84b5da	powerpc: Disable HFSCR[TM] if TM is not supported commit `7ed23e1bae` upstream. On Power8 & Power9 the early CPU inititialisation in __init_HFSCR() turns on HFSCR[TM] (Hypervisor Facility Status and Control Register [Transactional Memory]), but that doesn't take into account that TM might be disabled by CPU features, or disabled by the kernel being built with CONFIG_PPC_TRANSACTIONAL_MEM=n. So later in boot, when we have setup the CPU features, clear HSCR[TM] if the TM CPU feature has been disabled. We use CPU_FTR_TM_COMP to account for the CONFIG_PPC_TRANSACTIONAL_MEM=n case. Without this a KVM guest might try use TM, even if told not to, and cause an oops in the host kernel. Typically the oops is seen in __kvmppc_vcore_entry() and may or may not be fatal to the host, but is always bad news. In practice all shipping CPU revisions do support TM, and all host kernels we are aware of build with TM support enabled, so no one should actually be able to hit this in the wild. Fixes: `2a3563b023` ("powerpc: Setup in HFSCR for POWER8") Signed-off-by: Benjamin Herrenschmidt <benh@kernel.crashing.org> Tested-by: Sam Bobroff <sam.bobroff@au1.ibm.com> [mpe: Rewrite change log with input from Sam, add Fixes/stable] Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	a1db9b2c1b	metag/usercopy: Add missing fixups commit `b884a190af` upstream. The rapf copy loops in the Meta usercopy code is missing some extable entries for HTP cores with unaligned access checking enabled, where faults occur on the instruction immediately after the faulting access. Add the fixup labels and extable entries for these cases so that corner case user copy failures don't cause kernel crashes. Fixes: `373cd784d0` ("metag: Memory handling") Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	ce154d517a	metag/usercopy: Fix src fixup in from user rapf loops commit `2c0b1df88b` upstream. The fixup code to rewind the source pointer in __asm_copy_from_user_{32,64}bit_rapf_loop() always rewound the source by a single unit (4 or 8 bytes), however this is insufficient if the fault didn't occur on the first load in the loop, as the source pointer will have been incremented but nothing will have been stored until all 4 register [pairs] are loaded. Read the LSM_STEP field of TXSTATUS (which is already loaded into a register), a bit like the copy_to_user versions, to determine how many iterations of MGET[DL] have taken place, all of which need rewinding. Fixes: `373cd784d0` ("metag: Memory handling") Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	4f3f0dd2a7	metag/usercopy: Set flags before ADDZ commit `fd40eee129` upstream. The fixup code for the copy_to_user rapf loops reads TXStatus.LSM_STEP to decide how far to rewind the source pointer. There is a special case for the last execution of an MGETL/MGETD, since it leaves LSM_STEP=0 even though the number of MGETLs/MGETDs attempted was 4. This uses ADDZ which is conditional upon the Z condition flag, but the AND instruction which masked the TXStatus.LSM_STEP field didn't set the condition flags based on the result. Fix that now by using ANDS which does set the flags, and also marking the condition codes as clobbered by the inline assembly. Fixes: `373cd784d0` ("metag: Memory handling") Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	3dc0fe517a	metag/usercopy: Zero rest of buffer from copy_from_user commit `563ddc1076` upstream. Currently we try to zero the destination for a failed read from userland in fixup code in the usercopy.c macros. The rest of the destination buffer is then zeroed from __copy_user_zeroing(), which is used for both copy_from_user() and __copy_from_user(). Unfortunately we fail to zero in the fixup code as D1Ar1 is set to 0 before the fixup code entry labels, and __copy_from_user() shouldn't even be zeroing the rest of the buffer. Move the zeroing out into copy_from_user() and rename __copy_user_zeroing() to raw_copy_from_user() since it no longer does any zeroing. This also conveniently matches the name needed for RAW_COPY_USER support in a later patch. Fixes: `373cd784d0` ("metag: Memory handling") Reported-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	4a93ac814d	metag/usercopy: Add early abort to copy_to_user commit `fb8ea062a8` upstream. When copying to userland on Meta, if any faults are encountered immediately abort the copy instead of continuing on and repeatedly faulting, and worse potentially copying further bytes successfully to subsequent valid pages. Fixes: `373cd784d0` ("metag: Memory handling") Reported-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	49a292dcd8	metag/usercopy: Fix alignment error checking commit `2257211942` upstream. Fix the error checking of the alignment adjustment code in raw_copy_from_user(), which mistakenly considers it safe to skip the error check when aligning the source buffer on a 2 or 4 byte boundary. If the destination buffer was unaligned it may have started to copy using byte or word accesses, which could well be at the start of a new (valid) source page. This would result in it appearing to have copied 1 or 2 bytes at the end of the first (invalid) page rather than none at all. Fixes: `373cd784d0` ("metag: Memory handling") Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:13 +02:00
James Hogan	2bb52b47e7	metag/usercopy: Drop unused macros commit `ef62a2d81f` upstream. Metag's lib/usercopy.c has a bunch of copy_from_user macros for larger copies between 5 and 16 bytes which are completely unused. Before fixing zeroing lets drop these macros so there is less to fix. Signed-off-by: James Hogan <james.hogan@imgtec.com> Cc: Al Viro <viro@zeniv.linux.org.uk> Cc: linux-metag@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Arend Van Spriel	9afc076d28	brcmfmac: use local iftype avoiding use-after-free of virtual interface commit `d77facb884` upstream. A use-after-free was found using KASAN. In brcmf_p2p_del_if() the virtual interface is removed using call to brcmf_remove_interface(). After that the virtual interface instance has been freed and should not be referenced. Solve this by storing the nl80211 iftype in local variable, which is used in a couple of places anyway. Reported-by: Daniel J Blueman <daniel@quora.org> Reviewed-by: Hante Meuleman <hante.meuleman@broadcom.com> Reviewed-by: Pieter-Paul Giesberts <pieter-paul.giesberts@broadcom.com> Reviewed-by: Franky Lin <franky.lin@broadcom.com> Signed-off-by: Arend van Spriel <arend.vanspriel@broadcom.com> Signed-off-by: Kalle Valo <kvalo@codeaurora.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Johannes Berg	c0321505df	mac80211: unconditionally start new netdev queues with iTXQ support commit `7d65f82954` upstream. When internal mac80211 TXQs aren't supported, netdev queues must always started out started even when driver queues are stopped while the interface is added. This is necessary because with the internal TXQ support netdev queues are never stopped and packet scheduling/dropping is done in mac80211. Fixes: `80a83cfc43` ("mac80211: skip netdev queue control with software queuing") Reported-and-tested-by: Sven Eckelmann <sven.eckelmann@openmesh.com> Signed-off-by: Johannes Berg <johannes.berg@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Wei Yongjun	703cebf6e9	ring-buffer: Fix return value check in test_ringbuffer() commit `62277de758` upstream. In case of error, the function kthread_run() returns ERR_PTR() and never returns NULL. The NULL test in the return value check should be replaced with IS_ERR(). Link: http://lkml.kernel.org/r/1466184839-14927-1-git-send-email-weiyj_lk@163.com Fixes: `6c43e554a` ("ring-buffer: Add ring buffer startup selftest") Signed-off-by: Wei Yongjun <yongjun_wei@trendmicro.com.cn> Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Calvin Owens	f7db18998e	xfs: Honor FALLOC_FL_KEEP_SIZE when punching ends of files commit `3dd09d5a85` upstream. When punching past EOF on XFS, fallocate(mode=PUNCH_HOLE\|KEEP_SIZE) will round the file size up to the nearest multiple of PAGE_SIZE: calvinow@vm-disks/generic-xfs-1 ~$ dd if=/dev/urandom of=test bs=2048 count=1 calvinow@vm-disks/generic-xfs-1 ~$ stat test Size: 2048 Blocks: 8 IO Block: 4096 regular file calvinow@vm-disks/generic-xfs-1 ~$ fallocate -n -l 2048 -o 2048 -p test calvinow@vm-disks/generic-xfs-1 ~$ stat test Size: 4096 Blocks: 8 IO Block: 4096 regular file Commit `3c2bdc912a` ("xfs: kill xfs_zero_remaining_bytes") replaced xfs_zero_remaining_bytes() with calls to iomap helpers. The new helpers don't enforce that [pos,offset) lies strictly on [0,i_size) when being called from xfs_free_file_space(), so by "leaking" these ranges into xfs_zero_range() we get this buggy behavior. Fix this by reintroducing the checks xfs_zero_remaining_bytes() did against i_size at the bottom of xfs_free_file_space(). Reported-by: Aaron Gao <gzh@fb.com> Fixes: `3c2bdc912a` ("xfs: kill xfs_zero_remaining_bytes") Cc: Christoph Hellwig <hch@lst.de> Cc: Brian Foster <bfoster@redhat.com> Signed-off-by: Calvin Owens <calvinowens@fb.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Martin Brandenburg	dc62935ce0	orangefs: move features validation to fix filesystem hang commit `cefdc26e86` upstream. Without this fix (and another to the userspace component itself described later), the kernel will be unable to process any OrangeFS requests after the userspace component is restarted (due to a crash or at the administrator's behest). The bug here is that inside orangefs_remount, the orangefs_request_mutex is locked. When the userspace component restarts while the filesystem is mounted, it sends a ORANGEFS_DEV_REMOUNT_ALL ioctl to the device, which causes the kernel to send it a few requests aimed at synchronizing the state between the two. While this is happening the orangefs_request_mutex is locked to prevent any other requests going through. This is only half of the bugfix. The other half is in the userspace component which outright ignores(!) requests made before it considers the filesystem remounted, which is after the ioctl returns. Of course the ioctl doesn't return until after the userspace component responds to the request it ignores. The userspace component has been changed to allow ORANGEFS_VFS_OP_FEATURES regardless of the mount status. Mike Marshall says: "I've tested this patch against the fixed userspace part. This patch is real important, I hope it can make it into 4.11... Here's what happens when the userspace daemon is restarted, without the patch: ============================================= [ INFO: possible recursive locking detected ] [ 4.10.0-00007-ge98bdb3 #1 Not tainted ] --------------------------------------------- pvfs2-client-co/29032 is trying to acquire lock: (orangefs_request_mutex){+.+.+.}, at: service_operation+0x3c7/0x7b0 [orangefs] but task is already holding lock: (orangefs_request_mutex){+.+.+.}, at: dispatch_ioctl_command+0x1bf/0x330 [orangefs] CPU: 0 PID: 29032 Comm: pvfs2-client-co Not tainted 4.10.0-00007-ge98bdb3 #1 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.9.3-1.fc25 04/01/2014 Call Trace: __lock_acquire+0x7eb/0x1290 lock_acquire+0xe8/0x1d0 mutex_lock_killable_nested+0x6f/0x6e0 service_operation+0x3c7/0x7b0 [orangefs] orangefs_remount+0xea/0x150 [orangefs] dispatch_ioctl_command+0x227/0x330 [orangefs] orangefs_devreq_ioctl+0x29/0x70 [orangefs] do_vfs_ioctl+0xa3/0x6e0 SyS_ioctl+0x79/0x90" Signed-off-by: Martin Brandenburg <martin@omnibond.com> Acked-by: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Arnd Bergmann	c4fe79a44a	Kbuild: use cc-disable-warning consistently for maybe-uninitialized commit `b334e19ae9` upstream. In commit `a76bcf557e` ("Kbuild: enable -Wmaybe-uninitialized warning for "make W=1""), I reverted another change that happened to fix a problem with old compilers, and now we get this report again with old compilers (prior to gcc-4.8) and GCOV enabled: cc1: warnings being treated as errors drivers/gpu/drm/i915/intel_ringbuffer.c: In function 'intel_ring_setup_status_page': drivers/gpu/drm/i915/intel_ringbuffer.c:438: error: 'mmio.reg' may be used uninitialized in this function At top level: >> cc1: error: unrecognized command line option "-Wno-maybe-uninitialized" The problem is that we turn off the warning conditionally in a number of places as we should, but one of them does it unconditionally. Instead, change it to call cc-disable-warning as we do elsewhere. The original patch that caused it was merged into linux-4.7, then 4.8 removed the change and 4.9 brought it back, so we probably want a backport to 4.9 once this is merged. Use a ':=' assignment instead of '=' to force the cc-disable-warning call to only be evaluated once instead of every time. Fixes: `a76bcf557e` ("Kbuild: enable -Wmaybe-uninitialized warning for "make W=1"") Fixes: `e72e2dfe7c` ("gcov: disable -Wmaybe-uninitialized warning") Reported-by: kbuild test robot <fengguang.wu@intel.com> Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Masahiro Yamada <yamada.masahiro@socionext.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Dmitry Torokhov	e4a6234893	ACPI / gpio: do not fall back to parsing _CRS when we get a deferral commit `693bdaa164` upstream. If, while locating GPIOs by name, we get probe deferral, we should immediately report it to caller rather than trying to fall back to parsing unnamed GPIOs from _CRS block. Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Acked-by: Mika Westerberg <mika.westerberg@linux.intel.com> Acked-and-Tested-by: Hans de Goede <hdegoede@redhat.com> Signed-off-by: Linus Walleij <linus.walleij@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Sami Tolvanen	ab83597b9d	dm verity fec: fix bufio leaks commit `86e3e83b44` upstream. Buffers read through dm_bufio_read() were not released in all code paths. Fixes: `a739ff3f54` ("dm verity: add support for forward error correction") Signed-off-by: Sami Tolvanen <samitolvanen@google.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Sami Tolvanen	2ff0876426	dm verity fec: limit error correction recursion commit `f1a880a93b` upstream. If the hash tree itself is sufficiently corrupt in addition to data blocks, it's possible for error correction to end up in a deep recursive loop, which eventually causes a kernel panic. This change limits the recursion to a reasonable level during a single I/O operation. Fixes: `a739ff3f54` ("dm verity: add support for forward error correction") Signed-off-by: Sami Tolvanen <samitolvanen@google.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
bsegall@google.com	d9fa435103	ptrace: fix PTRACE_LISTEN race corrupting task->state commit `5402e97af6` upstream. In PT_SEIZED + LISTEN mode STOP/CONT signals cause a wakeup against __TASK_TRACED. If this races with the ptrace_unfreeze_traced at the end of a PTRACE_LISTEN, this can wake the task /after/ the check against __TASK_TRACED, but before the reset of state to TASK_TRACED. This causes it to instead clobber TASK_WAKING, allowing a subsequent wakeup against TRACED while the task is still on the rq wake_list, corrupting it. Oleg said: "The kernel can crash or this can lead to other hard-to-debug problems. In short, "task->state = TASK_TRACED" in ptrace_unfreeze_traced() assumes that nobody else can wake it up, but PTRACE_LISTEN breaks the contract. Obviusly it is very wrong to manipulate task->state if this task is already running, or WAKING, or it sleeps again" [akpm@linux-foundation.org: coding-style fixes] Fixes: `9899d11f` ("ptrace: ensure arch_ptrace/ptrace_request can never race with SIGKILL") Link: http://lkml.kernel.org/r/xm26y3vfhmkp.fsf_-_@bsegall-linux.mtv.corp.google.com Signed-off-by: Ben Segall <bsegall@google.com> Acked-by: Oleg Nesterov <oleg@redhat.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:12 +02:00
Alexander Polakov	e3b08ebe47	mm/page_alloc.c: fix print order in show_free_areas() commit `1f06b81aea` upstream. Fixes: `11fb998986` ("mm: move most file-based accounting to the node") Link: http://lkml.kernel.org/r/1490377730.30219.2.camel@beget.ru Signed-off-by: Alexander Polyakov <apolyakov@beget.com> Acked-by: Michal Hocko <mhocko@suse.com> Cc: Mel Gorman <mgorman@techsingularity.net> Cc: Vlastimil Babka <vbabka@suse.cz> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Jan-Marek Glogowski	8446cb1adf	Reset TreeId to zero on SMB2 TREE_CONNECT commit `806a28efe9` upstream. Currently the cifs module breaks the CIFS specs on reconnect as described in http://msdn.microsoft.com/en-us/library/cc246529.aspx: "TreeId (4 bytes): Uniquely identifies the tree connect for the command. This MUST be 0 for the SMB2 TREE_CONNECT Request." Signed-off-by: Jan-Marek Glogowski <glogow@fbihome.de> Reviewed-by: Aurelien Aptel <aaptel@suse.com> Tested-by: Aurelien Aptel <aaptel@suse.com> Signed-off-by: Steve French <smfrench@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Arend Van Spriel	57e1e90dda	cfg80211: check rdev resume callback only for registered wiphy commit `b3ef5520c1` upstream. We got the following use-after-free KASAN report: BUG: KASAN: use-after-free in wiphy_resume+0x591/0x5a0 [cfg80211] at addr ffff8803fc244090 Read of size 8 by task kworker/u16:24/2587 CPU: 6 PID: 2587 Comm: kworker/u16:24 Tainted: G B 4.9.13-debug+ Hardware name: Dell Inc. XPS 15 9550/0N7TVV, BIOS 1.2.19 12/22/2016 Workqueue: events_unbound async_run_entry_fn ffff880425d4f9d8 ffffffffaeedb541 ffff88042b80ef00 ffff8803fc244088 ffff880425d4fa00 ffffffffae84d7a1 ffff880425d4fa98 ffff8803fc244080 ffff88042b80ef00 ffff880425d4fa88 ffffffffae84da3a ffffffffc141f7d9 Call Trace: [<ffffffffaeedb541>] dump_stack+0x85/0xc4 [<ffffffffae84d7a1>] kasan_object_err+0x21/0x70 [<ffffffffae84da3a>] kasan_report_error+0x1fa/0x500 [<ffffffffc141f7d9>] ? cfg80211_bss_age+0x39/0xc0 [cfg80211] [<ffffffffc141f83a>] ? cfg80211_bss_age+0x9a/0xc0 [cfg80211] [<ffffffffae48d46d>] ? trace_hardirqs_on+0xd/0x10 [<ffffffffc13fb1c0>] ? wiphy_suspend+0xc70/0xc70 [cfg80211] [<ffffffffae84def1>] __asan_report_load8_noabort+0x61/0x70 [<ffffffffc13fb100>] ? wiphy_suspend+0xbb0/0xc70 [cfg80211] [<ffffffffc13fb751>] ? wiphy_resume+0x591/0x5a0 [cfg80211] [<ffffffffc13fb751>] wiphy_resume+0x591/0x5a0 [cfg80211] [<ffffffffc13fb1c0>] ? wiphy_suspend+0xc70/0xc70 [cfg80211] [<ffffffffaf3b206e>] dpm_run_callback+0x6e/0x4f0 [<ffffffffaf3b31b2>] device_resume+0x1c2/0x670 [<ffffffffaf3b367d>] async_resume+0x1d/0x50 [<ffffffffae3ee84e>] async_run_entry_fn+0xfe/0x610 [<ffffffffae3d0666>] process_one_work+0x716/0x1a50 [<ffffffffae3d05c9>] ? process_one_work+0x679/0x1a50 [<ffffffffafdd7b6d>] ? _raw_spin_unlock_irq+0x3d/0x60 [<ffffffffae3cff50>] ? pwq_dec_nr_in_flight+0x2b0/0x2b0 [<ffffffffae3d1a80>] worker_thread+0xe0/0x1460 [<ffffffffae3d19a0>] ? process_one_work+0x1a50/0x1a50 [<ffffffffae3e54c2>] kthread+0x222/0x2e0 [<ffffffffae3e52a0>] ? kthread_park+0x80/0x80 [<ffffffffae3e52a0>] ? kthread_park+0x80/0x80 [<ffffffffae3e52a0>] ? kthread_park+0x80/0x80 [<ffffffffafdd86aa>] ret_from_fork+0x2a/0x40 Object at ffff8803fc244088, in cache kmalloc-1024 size: 1024 Allocated: PID = 71 save_stack_trace+0x1b/0x20 save_stack+0x46/0xd0 kasan_kmalloc+0xad/0xe0 kasan_slab_alloc+0x12/0x20 __kmalloc_track_caller+0x134/0x360 kmemdup+0x20/0x50 brcmf_cfg80211_attach+0x10b/0x3a90 [brcmfmac] brcmf_bus_start+0x19a/0x9a0 [brcmfmac] brcmf_pcie_setup+0x1f1a/0x3680 [brcmfmac] brcmf_fw_request_nvram_done+0x44c/0x11b0 [brcmfmac] request_firmware_work_func+0x135/0x280 process_one_work+0x716/0x1a50 worker_thread+0xe0/0x1460 kthread+0x222/0x2e0 ret_from_fork+0x2a/0x40 Freed: PID = 2568 save_stack_trace+0x1b/0x20 save_stack+0x46/0xd0 kasan_slab_free+0x71/0xb0 kfree+0xe8/0x2e0 brcmf_cfg80211_detach+0x62/0xf0 [brcmfmac] brcmf_detach+0x14a/0x2b0 [brcmfmac] brcmf_pcie_remove+0x140/0x5d0 [brcmfmac] brcmf_pcie_pm_leave_D3+0x198/0x2e0 [brcmfmac] pci_pm_resume+0x186/0x220 dpm_run_callback+0x6e/0x4f0 device_resume+0x1c2/0x670 async_resume+0x1d/0x50 async_run_entry_fn+0xfe/0x610 process_one_work+0x716/0x1a50 worker_thread+0xe0/0x1460 kthread+0x222/0x2e0 ret_from_fork+0x2a/0x40 Memory state around the buggy address: ffff8803fc243f80: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc ffff8803fc244000: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc >ffff8803fc244080: fc fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb ^ ffff8803fc244100: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb ffff8803fc244180: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb What is happening is that brcmf_pcie_resume() detects a device that is no longer responsive and it decides to unbind resulting in a wiphy_unregister() and wiphy_free() call. Now the wiphy instance remains allocated, because PM needs to call wiphy_resume() for it. However, brcmfmac already does a kfree() for the struct cfg80211_registered_device::ops field. Change the checks in wiphy_resume() to only access the struct cfg80211_registered_device::ops if the wiphy instance is still registered at this time. Reported-by: Daniel J Blueman <daniel@quora.org> Reviewed-by: Hante Meuleman <hante.meuleman@broadcom.com> Reviewed-by: Pieter-Paul Giesberts <pieter-paul.giesberts@broadcom.com> Reviewed-by: Franky Lin <franky.lin@broadcom.com> Signed-off-by: Arend van Spriel <arend.vanspriel@broadcom.com> Signed-off-by: Johannes Berg <johannes.berg@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Victor Kamensky	3715dbf77f	arm64: mm: unaligned access by user-land should be received as SIGBUS commit `09a6adf53d` upstream. After `52d7523` (arm64: mm: allow the kernel to handle alignment faults on user accesses) commit user-land accesses that produce unaligned exceptions like in case of aarch32 ldm/stm/ldrd/strd instructions operating on unaligned memory received by user-land as SIGSEGV. It is wrong, it should be reported as SIGBUS as it was before `52d7523` commit. Changed do_bad_area function to take signal and code parameters out of esr value using fault_info table, so in case of do_alignment_fault fault user-land will receive SIGBUS. Wrapped access to fault_info table into esr_to_fault_info function. Fixes: `52d7523` (arm64: mm: allow the kernel to handle alignment faults on user accesses) Signed-off-by: Victor Kamensky <kamensky@cisco.com> Signed-off-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Quentin Schulz	71b44ef83d	iio: bmg160: reset chip when probing commit `4bdc902968` upstream. The gyroscope chip might need to be reset to be used. Without the chip being reset, the driver stopped at the first regmap_read (to get the CHIP_ID) and failed to probe. The datasheet of the gyroscope says that a minimum wait of 30ms after the reset has to be done. This patch has been checked on a BMX055 and the datasheet of the BMG160 and the BMI055 give the same reset register and bits. Signed-off-by: Quentin Schulz <quentin.schulz@free-electrons.com> Signed-off-by: Jonathan Cameron <jic23@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Suzuki K Poulose	ac303c64cd	kvm: arm/arm64: Fix locking for kvm_free_stage2_pgd commit `8b3405e345` upstream. In kvm_free_stage2_pgd() we don't hold the kvm->mmu_lock while calling unmap_stage2_range() on the entire memory range for the guest. This could cause problems with other callers (e.g, munmap on a memslot) trying to unmap a range. And since we have to unmap the entire Guest memory range holding a spinlock, make sure we yield the lock if necessary, after we unmap each PUD range. Fixes: commit `d5d8184d35` ("KVM: ARM: Memory virtualization setup") Cc: Paolo Bonzini <pbonzin@redhat.com> Cc: Marc Zyngier <marc.zyngier@arm.com> Cc: Christoffer Dall <christoffer.dall@linaro.org> Cc: Mark Rutland <mark.rutland@arm.com> Signed-off-by: Suzuki K Poulose <suzuki.poulose@arm.com> [ Avoid vCPU starvation and lockup detector warnings ] Signed-off-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Suzuki K Poulose <suzuki.poulose@arm.com> Signed-off-by: Christoffer Dall <cdall@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Marc Zyngier	a1ea318936	arm/arm64: KVM: Take mmap_sem in kvm_arch_prepare_memory_region commit `72f310481a` upstream. We don't hold the mmap_sem while searching for VMAs (via find_vma), in kvm_arch_prepare_memory_region, which can end up in expected failures. Fixes: commit `8eef91239e` ("arm/arm64: KVM: map MMIO regions at creation time") Cc: Ard Biesheuvel <ard.biesheuvel@linaro.org> Cc: Eric Auger <eric.auger@rehat.com> Reviewed-by: Christoffer Dall <cdall@linaro.org> [ Handle dirty page logging failure case ] Signed-off-by: Suzuki K Poulose <suzuki.poulose@arm.com> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Marc Zyngier	48f2825abc	arm/arm64: KVM: Take mmap_sem in stage2_unmap_vm commit `90f6e150e4` upstream. We don't hold the mmap_sem while searching for the VMAs when we try to unmap each memslot for a VM. Fix this properly to avoid unexpected results. Fixes: commit `957db105c9` ("arm/arm64: KVM: Introduce stage2_unmap_vm") Reviewed-by: Christoffer Dall <cdall@linaro.org> Signed-off-by: Suzuki K Poulose <suzuki.poulose@arm.com> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Shuxiao Zhang	8f8de8d2bf	staging: android: ashmem: lseek failed due to no FMODE_LSEEK. commit `97fbfef6bd` upstream. vfs_llseek will check whether the file mode has FMODE_LSEEK, no return failure. But ashmem can be lseek, so add FMODE_LSEEK to ashmem file. Comment From Greg Hackmann: ashmem_llseek() passes the llseek() call through to the backing shmem file. `91360b02ab` ("ashmem: use vfs_llseek()") changed this from directly calling the file's llseek() op into a VFS layer call. This also adds a check for the FMODE_LSEEK bit, so without that bit ashmem_llseek() now always fails with -ESPIPE. Fixes: `91360b02ab` ("ashmem: use vfs_llseek()") Signed-off-by: Shuxiao Zhang <zhangshuxiao@xiaomi.com> Tested-by: Greg Hackmann <ghackmann@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
NeilBrown	d9eedab383	sysfs: be careful of error returns from ops->show() commit `c8a139d001` upstream. ops->show() can return a negative error code. Commit `65da3484d9` ("sysfs: correctly handle short reads on PREALLOC attrs.") (in v4.4) caused this to be stored in an unsigned 'size_t' variable, so errors would look like large numbers. As a result, if an error is returned, sysfs_kf_read() will return the value of 'count', typically 4096. Commit `17d0774f80` ("sysfs: correctly handle read offset on PREALLOC attrs") (in v4.8) extended this error to use the unsigned large 'len' as a size for memmove(). Consequently, if ->show returns an error, then the first read() on the sysfs file will return 4096 and could return uninitialized memory to user-space. If the application performs a subsequent read, this will trigger a memmove() with extremely large count, and is likely to crash the machine is bizarre ways. This bug can currently only be triggered by reading from an md sysfs attribute declared with __ATTR_PREALLOC() during the brief period between when mddev_put() deletes an mddev from the ->all_mddevs list, and when mddev_delayed_delete() - which is scheduled on a workqueue - completes. Before this, an error won't be returned by the ->show() After this, the ->show() won't be called. I can reproduce it reliably only by putting delay like usleep_range(500000,700000); early in mddev_delayed_delete(). Then after creating an md device md0 run echo clear > /sys/block/md0/md/array_state; cat /sys/block/md0/md/array_state The bug can be triggered without the usleep. Fixes: `65da3484d9` ("sysfs: correctly handle short reads on PREALLOC attrs.") Fixes: `17d0774f80` ("sysfs: correctly handle read offset on PREALLOC attrs") Signed-off-by: NeilBrown <neilb@suse.com> Acked-by: Tejun Heo <tj@kernel.org> Reported-and-tested-by: Miroslav Benes <mbenes@suse.cz> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Li Qiang	4ddd24d54f	drm/vmwgfx: fix integer overflow in vmw_surface_define_ioctl() commit `e7e11f9956` upstream. In vmw_surface_define_ioctl(), the 'num_sizes' is the sum of the 'req->mip_levels' array. This array can be assigned any value from the user space. As both the 'num_sizes' and the array is uint32_t, it is easy to make 'num_sizes' overflow. The later 'mip_levels' is used as the loop count. This can lead an oob write. Add the check of 'req->mip_levels' to avoid this. Signed-off-by: Li Qiang <liqiang6-s@360.cn> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:11 +02:00
Thomas Hellstrom	28c84df739	drm/vmwgfx: Remove getparam error message commit `53e16798b0` upstream. The mesa winsys sometimes uses unimplemented parameter requests to check for features. Remove the error message to avoid bloating the kernel log. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Thomas Hellstrom	b116797b81	drm/ttm, drm/vmwgfx: Relax permission checking when opening surfaces commit `fe25deb773` upstream. Previously, when a surface was opened using a legacy (non prime) handle, it was verified to have been created by a client in the same master realm. Relax this so that opening is also allowed recursively if the client already has the surface open. This works around a regression in svga mesa where opening of a shared surface is used recursively to obtain surface information. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Murray McAllister	604d2eac67	drm/vmwgfx: avoid calling vzalloc with a 0 size in vmw_get_cap_3d_ioctl() commit `63774069d9` upstream. In vmw_get_cap_3d_ioctl(), a user can supply 0 for a size that is used in vzalloc(). This eventually calls dump_stack() (in warn_alloc()), which can leak useful addresses to dmesg. Add check to avoid a size of 0. Signed-off-by: Murray McAllister <murray.mcallister@insomniasec.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Murray McAllister	73ab72517b	drm/vmwgfx: NULL pointer dereference in vmw_surface_define_ioctl() commit `36274ab8c5` upstream. Before memory allocations vmw_surface_define_ioctl() checks the upper-bounds of a user-supplied size, but does not check if the supplied size is 0. Add check to avoid NULL pointer dereferences. Signed-off-by: Murray McAllister <murray.mcallister@insomniasec.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Thomas Hellstrom	92cc48166e	drm/vmwgfx: Type-check lookups of fence objects commit `f7652afa8e` upstream. A malicious caller could otherwise hand over handles to other objects causing all sorts of interesting problems. Testing done: Ran a Fedora 25 desktop using both Xorg and gnome-shell/Wayland. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Sudip Mukherjee	9c2b46e720	ppdev: fix registering same device name commit `9a69645dde` upstream. Usually every parallel port will have a single pardev registered with it. But ppdev driver is an exception. This userspace parallel port driver allows to create multiple parrallel port devices for a single parallel port. And as a result we were having a big warning like: "sysfs: cannot create duplicate filename '/devices/parport0/ppdev0.0'". And with that many parallel port printers stopped working. We have been using the minor number as the id field while registering a parralel port device with a parralel port. But when there are multiple parrallel port device for one single parallel port, they all tried to register with the same name like 'pardev0.0' and everything started failing. Use an incremented index as the id instead of the minor number. Fixes: `8b7d3a9d90` ("ppdev: use new parport device model") Cc: stable <stable@vger.kernel.org> # v4.9+ Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1414656 Bugzilla: https://bugs.archlinux.org/task/52322 Tested-by: James Feeney <james@nurealm.net> Signed-off-by: Sudip Mukherjee <sudip.mukherjee@codethink.co.uk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Sudip Mukherjee	bf5202b58f	ppdev: check before attaching port commit `dd5c472a60` upstream. After parport starts using the device model, all pardevice drivers should decide in their match_port callback function if they want to attach with that particulatr port. ppdev has been converted to use the new parport device-model code but pp_attach() tried to attach with all the ports. Create a new array of pointer and use that to remember the ports we have attached. And use that information to skip attaching ports which we have already attached. Tested-by: Joe Lawrence <joe.lawrence@redhat.com> Signed-off-by: Sudip Mukherjee <sudip.mukherjee@codethink.co.uk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-12 12:41:10 +02:00
Greg Kroah-Hartman	37feaf8095	Linux 4.9.21	2017-04-08 09:31:27 +02:00
Keith Busch	02b23e059a	nvme/pci: Disable on removal when disconnected commit `6db28eda26` upstream. If the device is not present, the driver should disable the queues immediately. Prior to this, the driver was relying on the watchdog timer to kill the queues if requests were outstanding to the device, and that just delays removal up to one second. Signed-off-by: Keith Busch <keith.busch@intel.com> Reviewed-by: Johannes Thumshirn <jthumshirn@suse.de> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Sagi Grimberg <sagi@grimberg.me> Signed-off-by: Jens Axboe <axboe@fb.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:36 +02:00
Keith Busch	a5e39a7f29	nvme/core: Fix race kicking freed request_queue commit `f33447b90e` upstream. If a namespace has already been marked dead, we don't want to kick the request_queue again since we may have just freed it from another thread. Signed-off-by: Keith Busch <keith.busch@intel.com> Reviewed-by: Johannes Thumshirn <jthumshirn@suse.de> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Sagi Grimberg <sagi@grimberg.me> Signed-off-by: Jens Axboe <axboe@fb.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:36 +02:00
Jason A. Donenfeld	eb8c62a384	padata: avoid race in reordering commit `de5540d088` upstream. Under extremely heavy uses of padata, crashes occur, and with list debugging turned on, this happens instead: [87487.298728] WARNING: CPU: 1 PID: 882 at lib/list_debug.c:33 __list_add+0xae/0x130 [87487.301868] list_add corruption. prev->next should be next (ffffb17abfc043d0), but was ffff8dba70872c80. (prev=ffff8dba70872b00). [87487.339011] [<ffffffff9a53d075>] dump_stack+0x68/0xa3 [87487.342198] [<ffffffff99e119a1>] ? console_unlock+0x281/0x6d0 [87487.345364] [<ffffffff99d6b91f>] __warn+0xff/0x140 [87487.348513] [<ffffffff99d6b9aa>] warn_slowpath_fmt+0x4a/0x50 [87487.351659] [<ffffffff9a58b5de>] __list_add+0xae/0x130 [87487.354772] [<ffffffff9add5094>] ? _raw_spin_lock+0x64/0x70 [87487.357915] [<ffffffff99eefd66>] padata_reorder+0x1e6/0x420 [87487.361084] [<ffffffff99ef0055>] padata_do_serial+0xa5/0x120 padata_reorder calls list_add_tail with the list to which its adding locked, which seems correct: spin_lock(&squeue->serial.lock); list_add_tail(&padata->list, &squeue->serial.list); spin_unlock(&squeue->serial.lock); This therefore leaves only place where such inconsistency could occur: if padata->list is added at the same time on two different threads. This pdata pointer comes from the function call to padata_get_next(pd), which has in it the following block: next_queue = per_cpu_ptr(pd->pqueue, cpu); padata = NULL; reorder = &next_queue->reorder; if (!list_empty(&reorder->list)) { padata = list_entry(reorder->list.next, struct padata_priv, list); spin_lock(&reorder->lock); list_del_init(&padata->list); atomic_dec(&pd->reorder_objects); spin_unlock(&reorder->lock); pd->processed++; goto out; } out: return padata; I strongly suspect that the problem here is that two threads can race on reorder list. Even though the deletion is locked, call to list_entry is not locked, which means it's feasible that two threads pick up the same padata object and subsequently call list_add_tail on them at the same time. The fix is thus be hoist that lock outside of that block. Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com> Acked-by: Steffen Klassert <steffen.klassert@secunet.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:36 +02:00
NeilBrown	5959cded91	blk: Ensure users for current->bio_list can see the full list. commit `f5fe1b5190` upstream. Commit `79bd99596b` ("blk: improve order of bio handling in generic_make_request()") changed current->bio_list so that it did not contain all of the queued bios, but only those submitted by the currently running make_request_fn. There are two places which walk the list and requeue selected bios, and others that check if the list is empty. These are no longer correct. So redefine current->bio_list to point to an array of two lists, which contain all queued bios, and adjust various code to test or walk both lists. Signed-off-by: NeilBrown <neilb@suse.com> Fixes: `79bd99596b` ("blk: improve order of bio handling in generic_make_request()") Signed-off-by: Jens Axboe <axboe@fb.com> Cc: Jack Wang <jinpu.wang@profitbricks.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:36 +02:00
NeilBrown	d5986e0078	blk: improve order of bio handling in generic_make_request() commit `79bd99596b` upstream. To avoid recursion on the kernel stack when stacked block devices are in use, generic_make_request() will, when called recursively, queue new requests for later handling. They will be handled when the make_request_fn for the current bio completes. If any bios are submitted by a make_request_fn, these will ultimately be handled seqeuntially. If the handling of one of those generates further requests, they will be added to the end of the queue. This strict first-in-first-out behaviour can lead to deadlocks in various ways, normally because a request might need to wait for a previous request to the same device to complete. This can happen when they share a mempool, and can happen due to interdependencies particular to the device. Both md and dm have examples where this happens. These deadlocks can be erradicated by more selective ordering of bios. Specifically by handling them in depth-first order. That is: when the handling of one bio generates one or more further bios, they are handled immediately after the parent, before any siblings of the parent. That way, when generic_make_request() calls make_request_fn for some particular device, we can be certain that all previously submited requests for that device have been completely handled and are not waiting for anything in the queue of requests maintained in generic_make_request(). An easy way to achieve this would be to use a last-in-first-out stack instead of a queue. However this will change the order of consecutive bios submitted by a make_request_fn, which could have unexpected consequences. Instead we take a slightly more complex approach. A fresh queue is created for each call to a make_request_fn. After it completes, any bios for a different device are placed on the front of the main queue, followed by any bios for the same device, followed by all bios that were already on the queue before the make_request_fn was called. This provides the depth-first approach without reordering bios on the same level. This, by itself, it not enough to remove all deadlocks. It just makes it possible for drivers to take the extra step required themselves. To avoid deadlocks, drivers must never risk waiting for a request after submitting one to generic_make_request. This includes never allocing from a mempool twice in the one call to a make_request_fn. A common pattern in drivers is to call bio_split() in a loop, handling the first part and then looping around to possibly split the next part. Instead, a driver that finds it needs to split a bio should queue (with generic_make_request) the second part, handle the first part, and then return. The new code in generic_make_request will ensure the requests to underlying bios are processed first, then the second bio that was split off. If it splits again, the same process happens. In each case one bio will be completely handled before the next one is attempted. With this is place, it should be possible to disable the punt_bios_to_recover() recovery thread for many block devices, and eventually it may be possible to remove it completely. Ref: http://www.spinics.net/lists/raid/msg54680.html Tested-by: Jinpu Wang <jinpu.wang@profitbricks.com> Inspired-by: Lars Ellenberg <lars.ellenberg@linbit.com> Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jens Axboe <axboe@fb.com> Cc: Jack Wang <jinpu.wang@profitbricks.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:36 +02:00
Johannes Weiner	e3a55294fc	mm: workingset: fix premature shadow node shrinking with cgroups commit `0cefabdaf7` upstream. Commit `0a6b76dd23` ("mm: workingset: make shadow node shrinker memcg aware") enabled cgroup-awareness in the shadow node shrinker, but forgot to also enable cgroup-awareness in the list_lru the shadow nodes sit on. Consequently, all shadow nodes are sitting on a global (per-NUMA node) list, while the shrinker applies the limits according to the amount of cache in the cgroup its shrinking. The result is excessive pressure on the shadow nodes from cgroups that have very little cache. Enable memcg-mode on the shadow node LRUs, such that per-cgroup limits are applied to per-cgroup lists. Fixes: `0a6b76dd23` ("mm: workingset: make shadow node shrinker memcg aware") Link: http://lkml.kernel.org/r/20170322005320.8165-1-hannes@cmpxchg.org Signed-off-by: Johannes Weiner <hannes@cmpxchg.org> Acked-by: Vladimir Davydov <vdavydov@tarantool.org> Cc: Michal Hocko <mhocko@suse.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:36 +02:00
Felix Fietkau	362721c495	MIPS: Lantiq: Fix cascaded IRQ setup commit `6c356eda22` upstream. With the IRQ stack changes integrated, the XRX200 devices started emitting a constant stream of kernel messages like this: [ 565.415310] Spurious IRQ: CAUSE=0x1100c300 This is caused by IP0 getting handled by plat_irq_dispatch() rather than its vectored interrupt handler, which is fixed by commit de856416e714 ("MIPS: IRQ Stack: Fix erroneous jal to plat_irq_dispatch"). Fix plat_irq_dispatch() to handle non-vectored IPI interrupts correctly by setting up IP2-6 as proper chained IRQ handlers and calling do_IRQ for all MIPS CPU interrupts. Signed-off-by: Felix Fietkau <nbd@nbd.name> Acked-by: John Crispin <john@phrozen.org> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/15077/ [james.hogan@imgtec.com: tweaked commit message] Signed-off-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Jon Mason	1b442f9bdf	ARM: dts: BCM5301X: Correct GIC_PPI interrupt flags commit `0c2bf9f959` upstream. GIC_PPI flags were misconfigured for the timers, resulting in errors like: [ 0.000000] GIC: PPI11 is secure or misconfigured Changing them to being edge triggered corrects the issue Suggested-by: Rafał Miłecki <rafal@milecki.pl> Signed-off-by: Jon Mason <jon.mason@broadcom.com> Fixes: `d27509f1` ("ARM: BCM5301X: add dts files for BCM4708 SoC") Signed-off-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Joe Carnuccio	c1716f0c35	qla2xxx: Allow vref count to timeout on vport delete. commit `c4a9b538ab` upstream. Signed-off-by: Joe Carnuccio <joe.carnuccio@cavium.com> Signed-off-by: Himanshu Madhani <himanshu.madhani@cavium.com> Signed-off-by: Nicholas Bellinger <nab@linux-iscsi.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Rafał Miłecki	5ed56ca86f	ARM: BCM5301X: Add back handler ignoring external imprecise aborts commit `09f3510fb7` upstream. Since early BCM5301X days we got abort handler that was removed by commit `937b12306e` ("ARM: BCM5301X: remove workaround imprecise abort fault handler"). It assumed we need to deal only with pending aborts left by the bootloader. Unfortunately this isn't true for BCM5301X. When probing PCI config space (device enumeration) it is expected to have master aborts on the PCI bus. Most bridges don't forward (or they allow disabling it) these errors onto the AXI/AMBA bus but not the Northstar (BCM5301X) one. iProc PCIe controller on Northstar seems to be some older one, without a control register for errors forwarding. It means we need to workaround this at platform level. All newer platforms are not affected by this issue. Signed-off-by: Rafał Miłecki <rafal@milecki.pl> Signed-off-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Naoya Horiguchi	40c5b99f8a	mm, hugetlb: use pte_present() instead of pmd_present() in follow_huge_pmd() commit `c9d398fa23` upstream. I found the race condition which triggers the following bug when move_pages() and soft offline are called on a single hugetlb page concurrently. Soft offlining page 0x119400 at 0x700000000000 BUG: unable to handle kernel paging request at ffffea0011943820 IP: follow_huge_pmd+0x143/0x190 PGD 7ffd2067 PUD 7ffd1067 PMD 0 [61163.582052] Oops: 0000 [#1] SMP Modules linked in: binfmt_misc ppdev virtio_balloon parport_pc pcspkr i2c_piix4 parport i2c_core acpi_cpufreq ip_tables xfs libcrc32c ata_generic pata_acpi virtio_blk 8139too crc32c_intel ata_piix serio_raw libata virtio_pci 8139cp virtio_ring virtio mii floppy dm_mirror dm_region_hash dm_log dm_mod [last unloaded: cap_check] CPU: 0 PID: 22573 Comm: iterate_numa_mo Tainted: P OE 4.11.0-rc2-mm1+ #2 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:follow_huge_pmd+0x143/0x190 RSP: 0018:ffffc90004bdbcd0 EFLAGS: 00010202 RAX: 0000000465003e80 RBX: ffffea0004e34d30 RCX: 00003ffffffff000 RDX: 0000000011943800 RSI: 0000000000080001 RDI: 0000000465003e80 RBP: ffffc90004bdbd18 R08: 0000000000000000 R09: ffff880138d34000 R10: ffffea0004650000 R11: 0000000000c363b0 R12: ffffea0011943800 R13: ffff8801b8d34000 R14: ffffea0000000000 R15: 000077ff80000000 FS: 00007fc977710740(0000) GS:ffff88007dc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: ffffea0011943820 CR3: 000000007a746000 CR4: 00000000001406f0 Call Trace: follow_page_mask+0x270/0x550 SYSC_move_pages+0x4ea/0x8f0 SyS_move_pages+0xe/0x10 do_syscall_64+0x67/0x180 entry_SYSCALL64_slow_path+0x25/0x25 RIP: 0033:0x7fc976e03949 RSP: 002b:00007ffe72221d88 EFLAGS: 00000246 ORIG_RAX: 0000000000000117 RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fc976e03949 RDX: 0000000000c22390 RSI: 0000000000001400 RDI: 0000000000005827 RBP: 00007ffe72221e00 R08: 0000000000c2c3a0 R09: 0000000000000004 R10: 0000000000c363b0 R11: 0000000000000246 R12: 0000000000400650 R13: 00007ffe72221ee0 R14: 0000000000000000 R15: 0000000000000000 Code: 81 e4 ff ff 1f 00 48 21 c2 49 c1 ec 0c 48 c1 ea 0c 4c 01 e2 49 bc 00 00 00 00 00 ea ff ff 48 c1 e2 06 49 01 d4 f6 45 bc 04 74 90 <49> 8b 7c 24 20 40 f6 c7 01 75 2b 4c 89 e7 8b 47 1c 85 c0 7e 2a RIP: follow_huge_pmd+0x143/0x190 RSP: ffffc90004bdbcd0 CR2: ffffea0011943820 ---[ end trace e4f81353a2d23232 ]--- Kernel panic - not syncing: Fatal exception Kernel Offset: disabled This bug is triggered when pmd_present() returns true for non-present hugetlb, so fixing the present check in follow_huge_pmd() prevents it. Using pmd_present() to determine present/non-present for hugetlb is not correct, because pmd_present() checks multiple bits (not only _PAGE_PRESENT) for historical reason and it can misjudge hugetlb state. Fixes: `e66f17ff71` ("mm/hugetlb: take page table lock in follow_huge_pmd()") Link: http://lkml.kernel.org/r/1490149898-20231-1-git-send-email-n-horiguchi@ah.jp.nec.com Signed-off-by: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com> Acked-by: Hillf Danton <hillf.zj@alibaba-inc.com> Cc: Hugh Dickins <hughd@google.com> Cc: Michal Hocko <mhocko@kernel.org> Cc: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com> Cc: Mike Kravetz <mike.kravetz@oracle.com> Cc: Christian Borntraeger <borntraeger@de.ibm.com> Cc: Gerald Schaefer <gerald.schaefer@de.ibm.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Johannes Weiner	b5707920e4	mm: rmap: fix huge file mmap accounting in the memcg stats commit `553af430e7` upstream. Huge pages are accounted as single units in the memcg's "file_mapped" counter. Account the correct number of base pages, like we do in the corresponding node counter. Link: http://lkml.kernel.org/r/20170322005111.3156-1-hannes@cmpxchg.org Signed-off-by: Johannes Weiner <hannes@cmpxchg.org> Reviewed-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com> Acked-by: Michal Hocko <mhocko@suse.com> Cc: Vladimir Davydov <vdavydov.dev@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Kees Cook	673dfb6d1b	lib/syscall: Clear return values when no stack commit `854fbd6e5f` upstream. Commit: `aa1f1a6396` ("lib/syscall: Pin the task stack in collect_syscall()") ... added logic to handle a process stack not existing, but left sp and pc uninitialized, which can be later reported via /proc/$pid/syscall for zombie processes, potentially exposing kernel memory to userspace. Zombie /proc/$pid/syscall before: -1 0xffffffff9a060100 0xffff92f42d6ad900 Zombie /proc/$pid/syscall after: -1 0x0 0x0 Reported-by: Robert Święcki <robert@swiecki.net> Signed-off-by: Kees Cook <keescook@chromium.org> Reviewed-by: Andy Lutomirski <luto@kernel.org> Cc: Borislav Petkov <bp@alien8.de> Cc: Brian Gerst <brgerst@gmail.com> Cc: Denys Vlasenko <dvlasenk@redhat.com> Cc: H. Peter Anvin <hpa@zytor.com> Cc: Josh Poimboeuf <jpoimboe@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Fixes: `aa1f1a6396` ("lib/syscall: Pin the task stack in collect_syscall()") Link: http://lkml.kernel.org/r/20170323224616.GA92694@beast Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Tony Luck	af5ef6dafe	x86/mce: Fix copy/paste error in exception table entries commit `26a37ab319` upstream. Back in commit: `92b0729c34` ("x86/mm, x86/mce: Add memcpy_mcsafe()") ... I made a copy/paste error setting up the exception table entries and ended up with two for label .L_cache_w3 and none for .L_cache_w2. This means that if we take a machine check on: .L_cache_w2: movq 2*8(%rsi), %r10 then we don't have an exception table entry for this instruction and we can't recover. Fix: s/3/2/ Signed-off-by: Tony Luck <tony.luck@intel.com> Cc: Andy Lutomirski <luto@kernel.org> Cc: Borislav Petkov <bp@alien8.de> Cc: Brian Gerst <brgerst@gmail.com> Cc: Denys Vlasenko <dvlasenk@redhat.com> Cc: H. Peter Anvin <hpa@zytor.com> Cc: Josh Poimboeuf <jpoimboe@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Fixes: `92b0729c34` ("x86/mm, x86/mce: Add memcpy_mcsafe()") Link: http://lkml.kernel.org/r/1490046030-25862-1-git-send-email-tony.luck@intel.com Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Baoquan He	2211d19ac6	x86/mm/KASLR: Exclude EFI region from KASLR VA space randomization commit `a46f60d760` upstream. Currently KASLR is enabled on three regions: the direct mapping of physical memory, vamlloc and vmemmap. However the EFI region is also mistakenly included for VA space randomization because of misusing EFI_VA_START macro and assuming EFI_VA_START < EFI_VA_END. (This breaks kexec and possibly other things that rely on stable addresses.) The EFI region is reserved for EFI runtime services virtual mapping which should not be included in KASLR ranges. In Documentation/x86/x86_64/mm.txt, we can see: ffffffef00000000 - fffffffeffffffff (=64 GB) EFI region mapping space EFI uses the space from -4G to -64G thus EFI_VA_START > EFI_VA_END, Here EFI_VA_START = -4G, and EFI_VA_END = -64G. Changing EFI_VA_START to EFI_VA_END in mm/kaslr.c fixes this problem. Signed-off-by: Baoquan He <bhe@redhat.com> Reviewed-by: Bhupesh Sharma <bhsharma@redhat.com> Acked-by: Dave Young <dyoung@redhat.com> Acked-by: Thomas Garnier <thgarnie@google.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Andy Lutomirski <luto@kernel.org> Cc: Ard Biesheuvel <ard.biesheuvel@linaro.org> Cc: Borislav Petkov <bp@alien8.de> Cc: Brian Gerst <brgerst@gmail.com> Cc: Denys Vlasenko <dvlasenk@redhat.com> Cc: H. Peter Anvin <hpa@zytor.com> Cc: Josh Poimboeuf <jpoimboe@redhat.com> Cc: Kees Cook <keescook@chromium.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Masahiro Yamada <yamada.masahiro@socionext.com> Cc: Matt Fleming <matt@codeblueprint.co.uk> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: linux-efi@vger.kernel.org Link: http://lkml.kernel.org/r/1490331592-31860-1-git-send-email-bhe@redhat.com Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Lucas Stach	4366c7e346	drm/etnaviv: (re-)protect fence allocation with GPU mutex commit `f3cd1b064f` upstream. The fence allocation needs to be protected by the GPU mutex, otherwise the fence seqnos of concurrent submits might not match the insertion order of the jobs in the kernel ring. This breaks the assumption that jobs complete with monotonically increasing fence seqnos. Fixes: `d985349017` (drm/etnaviv: take GPU lock later in the submit process) Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Eric Anholt	6acf520708	drm/vc4: Allocate the right amount of space for boot-time CRTC state. commit `6d6e500391` upstream. Without this, the first modeset would dereference past the allocation when trying to free the mm node. Signed-off-by: Eric Anholt <eric@anholt.net> Tested-by: Stefan Wahren <stefan.wahren@i2se.com> Link: http://patchwork.freedesktop.org/patch/msgid/20170328201343.4884-1-eric@anholt.net Fixes: `d8dbf44f13` ("drm/vc4: Make the CRTCs cooperate on allocating display lists.") Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
Michel Dänzer	fa68e1d3ce	drm/radeon: Override fpfn for all VRAM placements in radeon_evict_flags commit `ce4b4f228e` upstream. We were accidentally only overriding the first VRAM placement. For BOs with the RADEON_GEM_NO_CPU_ACCESS flag set, radeon_ttm_placement_from_domain creates a second VRAM placment with fpfn == 0. If VRAM is almost full, the first VRAM placement with fpfn > 0 may not work, but the second one with fpfn == 0 always will (the BO's current location trivially satisfies it). Because "moving" the BO to its current location puts it back on the LRU list, this results in an infinite loop. Fixes: `2a85aedd11` ("drm/radeon: Try evicting from CPU accessible to inaccessible VRAM first") Reported-by: Zachary Michaels <zmichaels@oblong.com> Reported-and-Tested-by: Julien Isorce <jisorce@oblong.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:35 +02:00
David Hildenbrand	1563625c71	KVM: kvm_io_bus_unregister_dev() should never fail commit `90db10434b` upstream. No caller currently checks the return value of kvm_io_bus_unregister_dev(). This is evil, as all callers silently go on freeing their device. A stale reference will remain in the io_bus, getting at least used again, when the iobus gets teared down on kvm_destroy_vm() - leading to use after free errors. There is nothing the callers could do, except retrying over and over again. So let's simply remove the bus altogether, print an error and make sure no one can access this broken bus again (returning -ENOMEM on any attempt to access it). Fixes: `e93f8a0f82` ("KVM: convert io_bus to SRCU") Reported-by: Dmitry Vyukov <dvyukov@google.com> Reviewed-by: Cornelia Huck <cornelia.huck@de.ibm.com> Signed-off-by: David Hildenbrand <david@redhat.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Peter Xu	ef46a13b9c	KVM: x86: clear bus pointer when destroyed commit `df630b8c1e` upstream. When releasing the bus, let's clear the bus pointers to mark it out. If any further device unregister happens on this bus, we know that we're done if we found the bus being released already. Signed-off-by: Peter Xu <peterx@redhat.com> Signed-off-by: Radim Krčmář <rkrcmar@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Uwe Kleine-König	5289f1ce39	serial: mxs-auart: Fix baudrate calculation commit `a6040bc610` upstream. The reference manual for the i.MX28 recommends to calculate the divisor as divisor = (UARTCLK * 32) / baud rate, rounded to the nearest integer , so let's do this. For a typical setup of UARTCLK = 24 MHz and baud rate = 115200 this changes the divisor from 6666 to 6667 and so the actual baud rate improves from 115211.521 Bd (error ≅ 0.01 %) to 115194.240 Bd (error ≅ 0.005 %). Signed-off-by: Uwe Kleine-König <u.kleine-koenig@pengutronix.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Alan Stern	67e41b1368	USB: fix linked-list corruption in rh_call_control() commit `1633682053` upstream. Using KASAN, Dmitry found a bug in the rh_call_control() routine: If buffer allocation fails, the routine returns immediately without unlinking its URB from the control endpoint, eventually leading to linked-list corruption. This patch fixes the problem by jumping to the end of the routine (where the URB is unlinked) when an allocation failure occurs. Signed-off-by: Alan Stern <stern@rowland.harvard.edu> Reported-and-tested-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Nicolas Ferre	1f1c9e2965	tty/serial: atmel: fix TX path in atmel_console_write() commit `497e1e16f4` upstream. A side effect of `89d8232411` ("tty/serial: atmel_serial: BUG: stop DMA from transmitting in stop_tx") is that the console can be called with TX path disabled. Then the system would hang trying to push charecters out in atmel_console_putchar(). Signed-off-by: Nicolas Ferre <nicolas.ferre@microchip.com> Fixes: `89d8232411` ("tty/serial: atmel_serial: BUG: stop DMA from transmitting in stop_tx") Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Richard Genoud	e087ae68e8	tty/serial: atmel: fix race condition (TX+DMA) commit `31ca2c63fd` upstream. If uart_flush_buffer() is called between atmel_tx_dma() and atmel_complete_tx_dma(), the circular buffer has been cleared, but not atmel_port->tx_len. That leads to a circular buffer overflow (dumping (UART_XMIT_SIZE - atmel_port->tx_len) bytes). Tested-by: Nicolas Ferre <nicolas.ferre@microchip.com> Signed-off-by: Richard Genoud <richard.genoud@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Joerg Roedel	b3641939b1	ACPI: Do not create a platform_device for IOAPIC/IOxAPIC commit `08f63d9774` upstream. No platform-device is required for IO(x)APICs, so don't even create them. [ rjw: This fixes a problem with leaking platform device objects after IOAPIC/IOxAPIC hot-removal events.] Signed-off-by: Joerg Roedel <jroedel@suse.de> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Josh Poimboeuf	000d2bb6c0	ACPI: Fix incompatibility with mcount-based function graph tracing commit `61b79e16c6` upstream. Paul Menzel reported a warning: WARNING: CPU: 0 PID: 774 at /build/linux-ROBWaj/linux-4.9.13/kernel/trace/trace_functions_graph.c:233 ftrace_return_to_handler+0x1aa/0x1e0 Bad frame pointer: expected f6919d98, received f6919db0 from func acpi_pm_device_sleep_wake return to c43b6f9d The warning means that function graph tracing is broken for the acpi_pm_device_sleep_wake() function. That's because the ACPI Makefile unconditionally sets the '-Os' gcc flag to optimize for size. That's an issue because mcount-based function graph tracing is incompatible with '-Os' on x86, thanks to the following gcc bug: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=42109 I have another patch pending which will ensure that mcount-based function graph tracing is never used with CONFIG_CC_OPTIMIZE_FOR_SIZE on x86. But this patch is needed in addition to that one because the ACPI Makefile overrides that config option for no apparent reason. It has had this flag since the beginning of git history, and there's no related comment, so I don't know why it's there. As far as I can tell, there's no reason for it to be there. The appropriate behavior is for it to honor CONFIG_CC_OPTIMIZE_FOR_{SIZE,PERFORMANCE} like the rest of the kernel. Reported-by: Paul Menzel <pmenzel@molgen.mpg.de> Signed-off-by: Josh Poimboeuf <jpoimboe@redhat.com> Acked-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Helge Deller	76343bfbca	parisc: Fix access fault handling in pa_memcpy() commit `554bfeceb8` upstream. pa_memcpy() is the major memcpy implementation in the parisc kernel which is used to do any kind of userspace/kernel memory copies. Al Viro noticed various bugs in the implementation of pa_mempcy(), most notably that in case of faults it may report back to have copied more bytes than it actually did. Fixing those bugs is quite hard in the C-implementation, because the compiler is messing around with the registers and we are not guaranteed that specific variables are always in the same processor registers. This makes proper fault handling complicated. This patch implements pa_memcpy() in assembler. That way we have correct fault handling and adding a 64-bit copy routine was quite easy. Runtime tested with 32- and 64bit kernels. Reported-by: Al Viro <viro@ZenIV.linux.org.uk> Signed-off-by: John David Anglin <dave.anglin@bell.net> Signed-off-by: Helge Deller <deller@gmx.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Helge Deller	99e354a59a	parisc: Avoid stalled CPU warnings after system shutdown commit `476e75a44b` upstream. Commit `73580dac76` ("parisc: Fix system shutdown halt") introduced an endless loop for systems which don't provide a software power off function. But the soft lockup detector will detect this and report stalled CPUs after some time. Avoid those unwanted warnings by disabling the soft lockup detector. Fixes: `73580dac76` ("parisc: Fix system shutdown halt") Signed-off-by: Helge Deller <deller@gmx.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Helge Deller	09b931fcb8	parisc: Clean up fixup routines for get_user()/put_user() commit `d19f5e41b3` upstream. Al Viro noticed that userspace accesses via get_user()/put_user() can be simplified a lot with regard to usage of the exception handling. This patch implements a fixup routine for get_user() and put_user() in such that the exception handler will automatically load -EFAULT into the register %r8 (the error value) in case on a fault on userspace. Additionally the fixup routine will zero the target register on fault in case of a get_user() call. The target register is extracted out of the faulting assembly instruction. This patch brings a few benefits over the old implementation: 1. Exception handling gets much cleaner, easier and smaller in size. 2. Helper functions like fixup_get_user_skip_1 (all of fixup.S) can be dropped. 3. No need to hardcode %r9 as target register for get_user() any longer. This helps the compiler register allocator and thus creates less assembler statements. 4. No dependency on the exception_data contents any longer. 5. Nested faults will be handled cleanly. Reported-by: Al Viro <viro@ZenIV.linux.org.uk> Signed-off-by: Helge Deller <deller@gmx.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:34 +02:00
Kinglong Mee	3967cf7e6a	nfsd: map the ENOKEY to nfserr_perm for avoiding warning commit `c952cd4e94` upstream. Now that Ext4 and f2fs filesystems support encrypted directories and files, attempts to access those files may return ENOKEY, resulting in the following WARNING. Map ENOKEY to nfserr_perm instead of nfserr_io. [ 1295.411759] ------------[ cut here ]------------ [ 1295.411787] WARNING: CPU: 0 PID: 12786 at fs/nfsd/nfsproc.c:796 nfserrno+0x74/0x80 [nfsd] [ 1295.411806] nfsd: non-standard errno: -126 [ 1295.411816] Modules linked in: nfsd nfs_acl auth_rpcgss nfsv4 nfs lockd fscache tun bridge stp llc fuse ip_set nfnetlink vmw_vsock_vmci_transport vsock snd_seq_midi snd_seq_midi_event coretemp crct10dif_pclmul crc32_generic crc32_pclmul snd_ens1371 gameport ghash_clmulni_intel snd_ac97_codec f2fs intel_rapl_perf ac97_bus snd_seq ppdev snd_pcm snd_rawmidi snd_timer vmw_balloon snd_seq_device snd joydev soundcore parport_pc parport nfit acpi_cpufreq tpm_tis vmw_vmci tpm_tis_core tpm shpchp i2c_piix4 grace sunrpc xfs libcrc32c vmwgfx drm_kms_helper ttm drm crc32c_intel e1000 mptspi scsi_transport_spi serio_raw mptscsih mptbase ata_generic pata_acpi fjes [last unloaded: nfs_acl] [ 1295.412522] CPU: 0 PID: 12786 Comm: nfsd Tainted: G W 4.11.0-rc1+ #521 [ 1295.412959] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 07/02/2015 [ 1295.413814] Call Trace: [ 1295.414252] dump_stack+0x63/0x86 [ 1295.414666] __warn+0xcb/0xf0 [ 1295.415087] warn_slowpath_fmt+0x5f/0x80 [ 1295.415502] ? put_filp+0x42/0x50 [ 1295.415927] nfserrno+0x74/0x80 [nfsd] [ 1295.416339] nfsd_open+0xd7/0x180 [nfsd] [ 1295.416746] nfs4_get_vfs_file+0x367/0x3c0 [nfsd] [ 1295.417182] ? security_inode_permission+0x41/0x60 [ 1295.417591] nfsd4_process_open2+0x9b2/0x1200 [nfsd] [ 1295.418007] nfsd4_open+0x481/0x790 [nfsd] [ 1295.418409] nfsd4_proc_compound+0x395/0x680 [nfsd] [ 1295.418812] nfsd_dispatch+0xb8/0x1f0 [nfsd] [ 1295.419233] svc_process_common+0x4d9/0x830 [sunrpc] [ 1295.419631] svc_process+0xfe/0x1b0 [sunrpc] [ 1295.420033] nfsd+0xe9/0x150 [nfsd] [ 1295.420420] kthread+0x101/0x140 [ 1295.420802] ? nfsd_destroy+0x60/0x60 [nfsd] [ 1295.421199] ? kthread_park+0x90/0x90 [ 1295.421598] ret_from_fork+0x2c/0x40 [ 1295.421996] ---[ end trace 0d5a969cd7852e1f ]--- Signed-off-by: Kinglong Mee <kinglongmee@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Olga Kornievskaia	461bbb9094	NFSv4.1 fix infinite loop on IO BAD_STATEID error commit `0e3d3e5df0` upstream. Commit `63d63cbf5e` "NFSv4.1: Don't recheck delegations that have already been checked" introduced a regression where when a client received BAD_STATEID error it would not send any TEST_STATEID and instead go into an infinite loop of resending the IO that caused the BAD_STATEID. Fixes: `63d63cbf5e` ("NFSv4.1: Don't recheck delegations that have already been checked") Signed-off-by: Olga Kornievskaia <kolga@netapp.com> Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Ludovic Desroches	80df2b3e18	mmc: sdhci-of-at91: fix MMC_DDR_52 timing selection commit `d0918764c1` upstream. The controller has different timings for MMC_TIMING_UHS_DDR50 and MMC_TIMING_MMC_DDR52. Configuring the controller with SDHCI_CTRL_UHS_DDR50, when MMC_TIMING_MMC_DDR52 timings are requested, is not correct and can lead to unexpected behavior. Signed-off-by: Ludovic Desroches <ludovic.desroches@microchip.com> Fixes: `bb5f8ea4d5` ("mmc: sdhci-of-at91: introduce driver for the Atmel SDMMC") Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Hans de Goede	fa3b4f4f57	mmc: sdhci: Disable runtime pm when the sdio_irq is enabled commit `923713b357` upstream. SDIO cards may need clock to send the card interrupt to the host. On a cherrytrail tablet with a RTL8723BS wifi chip, without this patch pinging the tablet results in: PING 192.168.1.14 (192.168.1.14) 56(84) bytes of data. 64 bytes from 192.168.1.14: icmp_seq=1 ttl=64 time=78.6 ms 64 bytes from 192.168.1.14: icmp_seq=2 ttl=64 time=1760 ms 64 bytes from 192.168.1.14: icmp_seq=3 ttl=64 time=753 ms 64 bytes from 192.168.1.14: icmp_seq=4 ttl=64 time=3.88 ms 64 bytes from 192.168.1.14: icmp_seq=5 ttl=64 time=795 ms 64 bytes from 192.168.1.14: icmp_seq=6 ttl=64 time=1841 ms 64 bytes from 192.168.1.14: icmp_seq=7 ttl=64 time=810 ms 64 bytes from 192.168.1.14: icmp_seq=8 ttl=64 time=1860 ms 64 bytes from 192.168.1.14: icmp_seq=9 ttl=64 time=812 ms 64 bytes from 192.168.1.14: icmp_seq=10 ttl=64 time=48.6 ms Where as with this patch I get: PING 192.168.1.14 (192.168.1.14) 56(84) bytes of data. 64 bytes from 192.168.1.14: icmp_seq=1 ttl=64 time=3.96 ms 64 bytes from 192.168.1.14: icmp_seq=2 ttl=64 time=1.97 ms 64 bytes from 192.168.1.14: icmp_seq=3 ttl=64 time=17.2 ms 64 bytes from 192.168.1.14: icmp_seq=4 ttl=64 time=2.46 ms 64 bytes from 192.168.1.14: icmp_seq=5 ttl=64 time=2.83 ms 64 bytes from 192.168.1.14: icmp_seq=6 ttl=64 time=1.40 ms 64 bytes from 192.168.1.14: icmp_seq=7 ttl=64 time=2.10 ms 64 bytes from 192.168.1.14: icmp_seq=8 ttl=64 time=1.40 ms 64 bytes from 192.168.1.14: icmp_seq=9 ttl=64 time=2.04 ms 64 bytes from 192.168.1.14: icmp_seq=10 ttl=64 time=1.40 ms Cc: Dong Aisheng <b29396@freescale.com> Cc: Ian W MORRISON <ianwmorrison@gmail.com> Signed-off-by: Hans de Goede <hdegoede@redhat.com> Acked-by: Adrian Hunter <adrian.hunter@intel.com> Acked-by: Dong Aisheng <aisheng.dong@nxp.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Aaron Armstrong Skomra	8d6c332242	HID: wacom: Don't add ghost interface as shared data commit `8b40735969` upstream. A previous commit (below) adds a check for already probed interfaces to Wacom's matching heuristic. Unfortunately this causes the Bamboo Pen (CTL-460) to match itself to its 'ghost' touch interface. After subsequent changes to the driver this match to the ghost causes the kernel to crash. This patch avoids calling wacom_add_shared_data() for the BAMBOO_PEN's ghost touch interface. Fixes: `41372d5d40` ("HID: wacom: Augment 'oVid' and 'oPid' with heuristics for HID_GENERIC") Signed-off-by: Aaron Armstrong Skomra <aaron.skomra@wacom.com> Signed-off-by: Jiri Kosina <jkosina@suse.cz> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Takashi Sakamoto	e5a1347391	ASoC: Intel: Skylake: fix invalid memory access due to wrong reference of pointer commit `d1a6fe41d3` upstream. In 'skl_tplg_set_module_init_data()', a pointer to 'params' member of 'struct skl_algo_data' is calculated, then casted to (u32 ) and assigned to a member of configuration data. The configuration data is passed to the other functions and used to process intel IPC. In this processing, the value of member is used to get message data, however this can bring invalid memory access in 'skl_set_module_params()' as a result of calculation of a pointer for actual message data. (sound/soc/intel/skylake/skl-topology.c) skl_tplg_init_pipe_modules() ->skl_tplg_set_module_init_data() (has this bug) ->skl_tplg_set_module_params() (sound/soc/intel/skylake/skl-messages.c) ->skl_set_module_params() ((char )param) + data_offset This commit fixes the bug. Fixes: `abb740033b` ("ASoC: Intel: Skylake: Add support to configure module params") Signed-off-by: Takashi Sakamoto <takashi.sakamoto@miraclelinux.com> Acked-by: Vinod Koul <vinod.koul@intel.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Songjun Wu	7a042a4eeb	ASoC: atmel-classd: fix audio clock rate commit `cd3ac9affc` upstream. Fix the audio clock rate according to the datasheet. Reported-by: Dushara Jayasinghe <dushara@successful.com.au> Signed-off-by: Songjun Wu <songjun.wu@microchip.com> Acked-by: Nicolas Ferre <nicolas.ferre@microchip.com> Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Hui Wang	8aabccdc9d	ALSA: hda - fix a problem for lineout on a Dell AIO machine commit `2f726aec19` upstream. On this Dell AIO machine, the lineout jack does not work. We found the pin 0x1a is assigned to lineout on this machine, and in the past, we applied ALC298_FIXUP_DELL1_MIC_NO_PRESENCE to fix the heaset-set mic problem for this machine, this fixup will redefine the pin 0x1a to headphone-mic, as a result the lineout doesn't work anymore. After consulting with Dell, they told us this machine doesn't support microphone via headset jack, so we add a new fixup which only defines the pin 0x18 as the headset-mic. [rearranged the fixup insertion position by tiwai in order to make the merge with other branches easier -- tiwai] Fixes: `59ec4b57bc` ("ALSA: hda - Fix headset mic detection problem for two dell machines") Signed-off-by: Hui Wang <hui.wang@canonical.com> Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Takashi Iwai	74a2c1ff88	ALSA: seq: Fix race during FIFO resize commit `2d7d54002e` upstream. When a new event is queued while processing to resize the FIFO in snd_seq_fifo_clear(), it may lead to a use-after-free, as the old pool that is being queued gets removed. For avoiding this race, we need to close the pool to be deleted and sync its usage before actually deleting it. The issue was spotted by syzkaller. Reported-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Bjorn Helgaas	0dd5b335ed	PCI: iproc: Save host bridge window resource in struct iproc_pcie commit `6e347b5e05` upstream. The host bridge memory window resource is inserted into the iomem_resource tree and cannot be deallocated until the host bridge itself is removed. Previously, the window was on the stack, which meant the iomem_resource entry pointed into the stack and was corrupted as soon as the probe function returned, which caused memory corruption and errors like this: pcie_iproc_bcma bcma0:8: resource collision: [mem 0x40000000-0x47ffffff] conflicts with PCIe MEM space [mem 0x40000000-0x47ffffff] Move the memory window resource from the stack into struct iproc_pcie so its lifetime matches that of the host bridge. Fixes: `c3245a5664` ("PCI: iproc: Request host bridge window resources") Reported-and-tested-by: Rafał Miłecki <zajec5@gmail.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Bart Van Assche	8f9155989f	scsi: scsi_dh_alua: Ensure that alua_activate() calls the completion function commit `7cb689fe42` upstream. Callers of scsi_dh_activate(), e.g. dm-mpath, assume that this function either returns an error code or calls the completion function. Make alua_activate() call the completion function even if scsi_device_get() fails. Signed-off-by: Bart Van Assche <bart.vanassche@sandisk.com> Cc: Hannes Reinecke <hare@suse.de> Cc: Tang Junhui <tang.junhui@zte.com.cn> Reviewed-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:33 +02:00
Bart Van Assche	68b275b7cb	scsi: scsi_dh_alua: Check scsi_device_get() return value commit `625fe857e4` upstream. Do not queue ALUA work nor call scsi_device_put() if the scsi_device_get() call fails. This patch fixes the following crash: general protection fault: 0000 [#1] SMP RIP: 0010:scsi_device_put+0xb/0x30 Call Trace: scsi_disk_put+0x2d/0x40 sd_release+0x3d/0xb0 __blkdev_put+0x29e/0x360 blkdev_put+0x49/0x170 dm_put_table_device+0x58/0xc0 [dm_mod] dm_put_device+0x70/0xc0 [dm_mod] free_priority_group+0x92/0xc0 [dm_multipath] free_multipath+0x70/0xc0 [dm_multipath] multipath_dtr+0x19/0x20 [dm_multipath] dm_table_destroy+0x67/0x120 [dm_mod] dev_suspend+0xde/0x240 [dm_mod] ctl_ioctl+0x1f5/0x520 [dm_mod] dm_ctl_ioctl+0xe/0x20 [dm_mod] do_vfs_ioctl+0x8f/0x700 SyS_ioctl+0x3c/0x70 entry_SYSCALL_64_fastpath+0x18/0xad Fixes: commit `03197b61c5` ("scsi_dh_alua: Use workqueue for RTPG") Signed-off-by: Bart Van Assche <bart.vanassche@sandisk.com> Cc: Hannes Reinecke <hare@suse.de> Cc: Tang Junhui <tang.junhui@zte.com.cn> Reviewed-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
John Garry	cf31d6d215	scsi: libsas: fix ata xfer length commit `9702c67c60` upstream. The total ata xfer length may not be calculated properly, in that we do not use the proper method to get an sg element dma length. According to the code comment, sg_dma_len() should be used after dma_map_sg() is called. This issue was found by turning on the SMMUv3 in front of the hisi_sas controller in hip07. Multiple sg elements were being combined into a single element, but the original first element length was being use as the total xfer length. Fixes: `ff2aeb1eb6` ("libata: convert to chained sg") Signed-off-by: John Garry <john.garry@huawei.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
peter chang	c2a8695278	scsi: sg: check length passed to SG_NEXT_CMD_LEN commit `bf33f87dd0` upstream. The user can control the size of the next command passed along, but the value passed to the ioctl isn't checked against the usable max command size. Signed-off-by: Peter Chang <dpf@google.com> Acked-by: Douglas Gilbert <dgilbert@interlog.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Christoph Hellwig	d5dbd1c959	xfs: try any AG when allocating the first btree block when reflinking commit `2fcc319d24` upstream. When a reflink operation causes the bmap code to allocate a btree block we're currently doing single-AG allocations due to having ->firstblock set and then try any higher AG due a little reflink quirk we've put in when adding the reflink code. But given that we do not have a minleft reservation of any kind in this AG we can still not have any space in the same or higher AG even if the file system has enough free space. To fix this use a XFS_ALLOCTYPE_FIRST_AG allocation in this fall back path instead. [And yes, we need to redo this properly instead of piling hacks over hacks. I'm working on that, but it's not going to be a small series. In the meantime this fixes the customer reported issue] Also add a warning for failing allocations to make it easier to debug. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Brian Foster	da617af8f0	xfs: use iomap new flag for newly allocated delalloc blocks commit `f65e6fad29` upstream. Commit `fa7f138` ("xfs: clear delalloc and cache on buffered write failure") fixed one regression in the iomap error handling code and exposed another. The fundamental problem is that if a buffered write is a rewrite of preexisting delalloc blocks and the write fails, the failure handling code can punch out preexisting blocks with valid file data. This was reproduced directly by sub-block writes in the LTP kernel/syscalls/write/write03 test. A first 100 byte write allocates a single block in a file. A subsequent 100 byte write fails and punches out the block, including the data successfully written by the previous write. To address this problem, update the ->iomap_begin() handler to distinguish newly allocated delalloc blocks from preexisting delalloc blocks via the IOMAP_F_NEW flag. Use this flag in the ->iomap_end() handler to decide when a failed or short write should punch out delalloc blocks. This introduces the subtle requirement that ->iomap_begin() should never combine newly allocated delalloc blocks with existing blocks in the resulting iomap descriptor. This can occur when a new delalloc reservation merges with a neighboring extent that is part of the current write, for example. Therefore, drop the post-allocation extent lookup from xfs_bmapi_reserve_delalloc() and just return the record inserted into the fork. This ensures only new blocks are returned and thus that preexisting delalloc blocks are always handled as "found" blocks and not punched out on a failed rewrite. Reported-by: Xiong Zhou <xzhou@redhat.com> Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Chandan Rajendra	77aedb0cbe	xfs: Use xfs_icluster_size_fsb() to calculate inode alignment mask commit `d5825712ee` upstream. When block size is larger than inode cluster size, the call to XFS_B_TO_FSBT(mp, mp->m_inode_cluster_size) returns 0. Also, mkfs.xfs would have set xfs_sb->sb_inoalignmt to 0. Hence in xfs_set_inoalignment(), xfs_mount->m_inoalign_mask gets initialized to -1 instead of 0. However, xfs_mount->m_sinoalign would get correctly intialized to 0 because for every positive value of xfs_mount->m_dalign, the condition "!(mp->m_dalign & mp->m_inoalign_mask)" would evaluate to false. Also, xfs_imap() worked fine even with xfs_mount->m_inoalign_mask having -1 as the value because blks_per_cluster variable would have the value 1 and hence we would never have a need to use xfs_mount->m_inoalign_mask to compute the inode chunk's agbno and offset within the chunk. Signed-off-by: Chandan Rajendra <chandan@linux.vnet.ibm.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Christoph Hellwig	d07b5855ab	xfs: fix and streamline error handling in xfs_end_io commit `787eb48550` upstream. There are two different cases of buffered I/O errors: - first we can have an already shutdown fs. In that case we should skip any on-disk operations and just clean up the appen transaction if present and destroy the ioend - a real I/O error. In that case we should cleanup any lingering COW blocks. This gets skipped in the current code and is fixed by this patch. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Christoph Hellwig	3b83a02af2	xfs: only reclaim unwritten COW extents periodically commit `3802a34532` upstream. We only want to reclaim preallocations from our periodic work item. Currently this is archived by looking for a dirty inode, but that check is rather fragile. Instead add a flag to xfs_reflink_cancel_cow_* so that the caller can ask for just cancelling unwritten extents in the COW fork. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> [darrick: fix typos in commit message] Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Christoph Hellwig	a240293694	xfs: tune down agno asserts in the bmap code commit `410d17f67e` upstream. In various places we currently assert that xfs_bmap_btalloc allocates from the same as the firstblock value passed in, unless it's either NULLAGNO or the dop_low flag is set. But the reflink code does not fully follow this convention as it passes in firstblock purely as a hint for the allocator without actually having previous allocations in the transaction, and without having a minleft check on the current AG, leading to the assert firing on a very full and heavily used file system. As even the reflink code only allocates from equal or higher AGs for now we can simply the check to always allow for equal or higher AGs. Note that we need to eventually split the two meanings of the firstblock value. At that point we can also allow the reflink code to allocate from any AG instead of limiting it in any way. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Chandan Rajendra	9559c48c1a	xfs: Use xfs_icluster_size_fsb() to calculate inode chunk alignment commit `8ee9fdbebc` upstream. On a ppc64 system, executing generic/256 test with 32k block size gives the following call trace, XFS: Assertion failed: args->maxlen > 0, file: /root/repos/linux/fs/xfs/libxfs/xfs_alloc.c, line: 2026 kernel BUG at /root/repos/linux/fs/xfs/xfs_message.c:113! Oops: Exception in kernel mode, sig: 5 [#1] SMP NR_CPUS=2048 DEBUG_PAGEALLOC NUMA pSeries Modules linked in: CPU: 2 PID: 19361 Comm: mkdir Not tainted 4.10.0-rc5 #58 task: c000000102606d80 task.stack: c0000001026b8000 NIP: c0000000004ef798 LR: c0000000004ef798 CTR: c00000000082b290 REGS: c0000001026bb090 TRAP: 0700 Not tainted (4.10.0-rc5) MSR: 8000000000029032 <SF,EE,ME,IR,DR,RI> CR: 28004428 XER: 00000000 CFAR: c0000000004ef180 SOFTE: 1 GPR00: c0000000004ef798 c0000001026bb310 c000000001157300 ffffffffffffffea GPR04: 000000000000000a c0000001026bb130 0000000000000000 ffffffffffffffc0 GPR08: 00000000000000d1 0000000000000021 00000000ffffffd1 c000000000dd4990 GPR12: 0000000022004444 c00000000fe00800 0000000020000000 0000000000000000 GPR16: 0000000000000000 0000000043a606fc 0000000043a76c08 0000000043a1b3d0 GPR20: 000001002a35cd60 c0000001026bbb80 0000000000000000 0000000000000001 GPR24: 0000000000000240 0000000000000004 c00000062dc55000 0000000000000000 GPR28: 0000000000000004 c00000062ecd9200 0000000000000000 c0000001026bb6c0 NIP [c0000000004ef798] .assfail+0x28/0x30 LR [c0000000004ef798] .assfail+0x28/0x30 Call Trace: [c0000001026bb310] [c0000000004ef798] .assfail+0x28/0x30 (unreliable) [c0000001026bb380] [c000000000455d74] .xfs_alloc_space_available+0x194/0x1b0 [c0000001026bb410] [c00000000045b914] .xfs_alloc_fix_freelist+0x144/0x480 [c0000001026bb580] [c00000000045c368] .xfs_alloc_vextent+0x698/0xa90 [c0000001026bb650] [c0000000004a6200] .xfs_ialloc_ag_alloc+0x170/0x820 [c0000001026bb7c0] [c0000000004a9098] .xfs_dialloc+0x158/0x320 [c0000001026bb8a0] [c0000000004e628c] .xfs_ialloc+0x7c/0x610 [c0000001026bb990] [c0000000004e8138] .xfs_dir_ialloc+0xa8/0x2f0 [c0000001026bbaa0] [c0000000004e8814] .xfs_create+0x494/0x790 [c0000001026bbbf0] [c0000000004e5ebc] .xfs_generic_create+0x2bc/0x410 [c0000001026bbce0] [c0000000002b4a34] .vfs_mkdir+0x154/0x230 [c0000001026bbd70] [c0000000002bc444] .SyS_mkdirat+0x94/0x120 [c0000001026bbe30] [c00000000000b760] system_call+0x38/0xfc Instruction dump: 4e800020 60000000 7c0802a6 7c862378 3c82ffca 7ca72b78 38841c18 7c651b78 38600000 f8010010 f821ff91 4bfff94d <0fe00000> 60000000 7c0802a6 7c892378 When block size is larger than inode cluster size, the call to XFS_B_TO_FSBT(mp, mp->m_inode_cluster_size) returns 0. Also, mkfs.xfs would have set xfs_sb->sb_inoalignmt to 0. This causes xfs_ialloc_cluster_alignment() to return 0. Due to this args.minalignslop (in xfs_ialloc_ag_alloc()) gets the unsigned equivalent of -1 assigned to it. This later causes alloc_len in xfs_alloc_space_available() to have a value of 0. In such a scenario when args.total is also 0, the assert statement "ASSERT(args->maxlen > 0);" fails. This commit fixes the bug by replacing the call to XFS_B_TO_FSBT() in xfs_ialloc_cluster_alignment() with a call to xfs_icluster_size_fsb(). Suggested-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Chandan Rajendra <chandan@linux.vnet.ibm.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Brian Foster	5db7b41b60	xfs: don't reserve blocks for right shift transactions commit `48af96ab92` upstream. The block reservation for the transaction allocated in xfs_shift_file_space() is an artifact of the original collapse range support. It exists to handle the case where a collapse range occurs, the initial extent is left shifted into a location that forms a contiguous boundary with the previous extent and thus the extents are merged. This code was subsequently refactored and reused for insert range (right shift) support. If an insert range occurs under low free space conditions, the extent at the starting offset is split before the first shift transaction is allocated. If the block reservation fails, this leaves separate, but contiguous extents around in the inode. While not a fatal problem, this is unexpected and will flag a warning on subsequent insert range operations on the inode. This problem has been reproduce intermittently by generic/270 running against a ramdisk device. Since right shift does not create new extent boundaries in the inode, a block reservation for extent merge is unnecessary. Update xfs_shift_file_space() to conditionally reserve fs blocks for left shift transactions only. This avoids the warning reproduced by generic/270. Reported-by: Ross Zwisler <ross.zwisler@linux.intel.com> Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:32 +02:00
Darrick J. Wong	e5e2e56fd4	xfs: fix uninitialized variable in _reflink_convert_cow commit `93aaead52a` upstream. Fix an uninitialize variable. Reported-by: Dan Carpenter <dan.carpenter@oracle.com> Reviewed-by: Brian Foster <bfoster@redhat.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Brian Foster	c251c6c2de	xfs: split indlen reservations fairly when under reserved commit `75d65361cf` upstream. Certain workoads that punch holes into speculative preallocation can cause delalloc indirect reservation splits when the delalloc extent is split in two. If further splits occur, an already short-handed extent can be split into two in a manner that leaves zero indirect blocks for one of the two new extents. This occurs because the shortage is large enough that the xfs_bmap_split_indlen() algorithm completely drains the requested indlen of one of the extents before it honors the existing reservation. This ultimately results in a warning from xfs_bmap_del_extent(). This has been observed during file copies of large, sparse files using 'cp --sparse=always.' To avoid this problem, update xfs_bmap_split_indlen() to explicitly apply the reservation shortage fairly between both extents. This smooths out the overall indlen shortage and defers the situation where we end up with a delalloc extent with zero indlen reservation to extreme circumstances. Reported-by: Patrick Dung <mpatdung@gmail.com> Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Brian Foster	2d7c1c7ffa	xfs: handle indlen shortage on delalloc extent merge commit `0e339ef855` upstream. When a delalloc extent is created, it can be merged with pre-existing, contiguous, delalloc extents. When this occurs, xfs_bmap_add_extent_hole_delay() merges the extents along with the associated indirect block reservations. The expectation here is that the combined worst case indlen reservation is always less than or equal to the indlen reservation for the individual extents. This is not always the case, however, as existing extents can less than the expected indlen reservation if the extent was previously split due to a hole punch. If a new extent merges with such an extent, the total indlen requirement may be larger than the sum of the indlen reservations held by both extents. xfs_bmap_add_extent_hole_delay() assumes that the worst case indlen reservation is always available and assigns it to the merged extent without consideration for the indlen held by the pre-existing extent. As a result, the subsequent xfs_mod_fdblocks() call can attempt an unintentional allocation rather than a free (indicated by an ASSERT() failure). Further, if the allocation happens to fail in this context, the failure goes unhandled and creates a filesystem wide block accounting inconsistency. Fix xfs_bmap_add_extent_hole_delay() to function as designed. Cap the indlen reservation assigned to the merged extent to the sum of the indlen reservations held by each of the individual extents. Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Christoph Hellwig	47d7d1ea6c	xfs: don't fail xfs_extent_busy allocation commit `5e30c23d13` upstream. We don't just need the structure to track busy extents which can be avoided with a synchronous transaction, but also to keep track of pending discard. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Christoph Hellwig	5bbf5ba693	xfs: reject all unaligned direct writes to reflinked files commit `54a4ef8af4` upstream. We currently fall back from direct to buffered writes if we detect a remaining shared extent in the iomap_begin callback. But by the time iomap_begin is called for the potentially unaligned end block we might have already written most of the data to disk, which we'd now write again using buffered I/O. To avoid this reject all writes to reflinked files before starting I/O so that we are guaranteed to only write the data once. The alternative would be to unshare the unaligned start and/or end block before doing the I/O. I think that's doable, and will actually be required to support reflinks on DAX file system. But it will take a little more time and I'd rather get rid of the double write ASAP. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> [slight changes in context due to the new direct I/O code in 4.10+] Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Christoph Hellwig	67eb7bf836	xfs: update ctime and mtime on clone destinatation inodes commit `c5ecb42342` upstream. We're changing both metadata and data, so we need to update the timestamps for clone operations. Dedupe on the other hand does not change file data, and only changes invisible metadata so the timestamps should not be updated. This follows existing btrfs behavior. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> [darrick: remove redundant is_dedupe test] Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Hou Tao	e060f4884c	xfs: reset b_first_retry_time when clear the retry status of xfs_buf_t commit `4dd2eb6335` upstream. After successful IO or permanent error, b_first_retry_time also needs to be cleared, else the invalid first retry time will be used by the next retry check. Signed-off-by: Hou Tao <houtao1@huawei.com> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Darrick J. Wong	e02f0ff252	xfs: mark speculative prealloc CoW fork extents unwritten commit `5eda430000` upstream. Christoph Hellwig pointed out that there's a potentially nasty race when performing simultaneous nearby directio cow writes: "Thread 1 writes a range from B to c " B --------- C p "a little later thread 2 writes from A to B " A --------- B p [editor's note: the 'p' denote cowextsize boundaries, which I added to make this more clear] "but the code preallocates beyond B into the range where thread "1 has just written, but ->end_io hasn't been called yet. "But once ->end_io is called thread 2 has already allocated "up to the extent size hint into the write range of thread 1, "so the end_io handler will splice the unintialized blocks from "that preallocation back into the file right after B." We can avoid this race by ensuring that thread 1 cannot accidentally remap the blocks that thread 2 allocated (as part of speculative preallocation) as part of t2's write preparation in t1's end_io handler. The way we make this happen is by taking advantage of the unwritten extent flag as an intermediate step. Recall that when we begin the process of writing data to shared blocks, we create a delayed allocation extent in the CoW fork: D: --RRRRRRSSSRRRRRRRR--- C: ------DDDDDDD--------- When a thread prepares to CoW some dirty data out to disk, it will now convert the delalloc reservation into an /unwritten/ allocated extent in the cow fork. The da conversion code tries to opportunistically allocate as much of a (speculatively prealloc'd) extent as possible, so we may end up allocating a larger extent than we're actually writing out: D: --RRRRRRSSSRRRRRRRR--- U: ------UUUUUUU--------- Next, we convert only the part of the extent that we're actively planning to write to normal (i.e. not unwritten) status: D: --RRRRRRSSSRRRRRRRR--- U: ------UURRUUU--------- If the write succeeds, the end_cow function will now scan the relevant range of the CoW fork for real extents and remap only the real extents into the data fork: D: --RRRRRRRRSRRRRRRRR--- U: ------UU--UUU--------- This ensures that we never obliterate valid data fork extents with unwritten blocks from the CoW fork. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Darrick J. Wong	8370826f7d	xfs: allow unwritten extents in the CoW fork commit `05a630d76b` upstream. In the data fork, we only allow extents to perform the following state transitions: delay -> real <-> unwritten There's no way to move directly from a delalloc reservation to an /unwritten/ allocated extent. However, for the CoW fork we want to be able to do the following to each extent: delalloc -> unwritten -> written -> remapped to data fork This will help us to avoid a race in the speculative CoW preallocation code between a first thread that is allocating a CoW extent and a second thread that is remapping part of a file after a write. In order to do this, however, we need two things: first, we have to be able to transition from da to unwritten, and second the function that converts between real and unwritten has to be made aware of the cow fork. Do both of those things. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Darrick J. Wong	3d2bd2fd5c	xfs: verify free block header fields commit `de14c5f541` upstream. Perform basic sanity checking of the directory free block header fields so that we avoid hanging the system on invalid data. (Granted that just means that now we shutdown on directory write, but that seems better than hanging...) Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Darrick J. Wong	4056a74aaf	xfs: check for obviously bad level values in the bmbt root commit `b3bf607d58` upstream. We can't handle a bmbt that's taller than BTREE_MAXLEVELS, and there's no such thing as a zero-level bmbt (for that we have extents format), so if we see this, send back an error code. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:31 +02:00
Darrick J. Wong	efab3ae29c	xfs: filter out obviously bad btree pointers commit `d5a91baeb6` upstream. Don't let anybody load an obviously bad btree pointer. Since the values come from disk, we must return an error, not just ASSERT. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Eric Sandeen <sandeen@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Darrick J. Wong	7e2dd1fb71	xfs: fail _dir_open when readahead fails commit `7a652bbe36` upstream. When we open a directory, we try to readahead block 0 of the directory on the assumption that we're going to need it soon. If the bmbt is corrupt, the directory will never be usable and the readahead fails immediately, so we might as well prevent the directory from being opened at all. This prevents a subsequent read or modify operation from hitting it and taking the fs offline. NOTE: We're only checking for early failures in the block mapping, not the readahead directory block itself. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Eric Sandeen <sandeen@redhat.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Darrick J. Wong	0a6844abac	xfs: fix toctou race when locking an inode to access the data map commit `4b5bd5bf3f` upstream. We use di_format and if_flags to decide whether we're grabbing the ilock in btree mode (btree extents not loaded) or shared mode (anything else), but the state of those fields can be changed by other threads that are also trying to load the btree extents -- IFEXTENTS gets set before the _bmap_read_extents call and cleared if it fails. We don't actually need to have IFEXTENTS set until after the bmbt records are successfully loaded and validated, which will fix the race between multiple threads trying to read the same directory. The next patch strengthens directory bmbt validation by refusing to open the directory if reading the bmbt to start directory readahead fails. Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Brian Foster	4127a5d9fb	xfs: fix eofblocks race with file extending async dio writes commit `e4229d6b0b` upstream. It's possible for post-eof blocks to end up being used for direct I/O writes. dio write performs an upfront unwritten extent allocation, sends the dio and then updates the inode size (if necessary) on write completion. If a file release occurs while a file extending dio write is in flight, it is possible to mistake the post-eof blocks for speculative preallocation and incorrectly truncate them from the inode. This means that the resulting dio write completion can discover a hole and allocate new blocks rather than perform unwritten extent conversion. This requires a strange mix of I/O and is thus not likely to reproduce in real world workloads. It is intermittently reproduced by generic/299. The error manifests as an assert failure due to transaction overrun because the aforementioned write completion transaction has only reserved enough blocks for btree operations: XFS: Assertion failed: tp->t_blk_res_used <= tp->t_blk_res, \ file: fs/xfs//xfs_trans.c, line: 309 The root cause is that xfs_free_eofblocks() uses i_size to truncate post-eof blocks from the inode, but async, file extending direct writes do not update i_size until write completion, long after inode locks are dropped. Therefore, xfs_free_eofblocks() effectively truncates the inode to the incorrect size. Update xfs_free_eofblocks() to serialize against dio similar to how extending writes are serialized against i_size updates before post-eof block zeroing. Specifically, wait on dio while under the iolock. This ensures that dio write completions have updated i_size before post-eof blocks are processed. Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Brian Foster	4d725d7474	xfs: sync eofblocks scans under iolock are livelock prone commit `c3155097ad` upstream. The xfs_eofblocks.eof_scan_owner field is an internal field to facilitate invoking eofb scans from the kernel while under the iolock. This is necessary because the eofb scan acquires the iolock of each inode. Synchronous scans are invoked on certain buffered write failures while under iolock. In such cases, the scan owner indicates that the context for the scan already owns the particular iolock and prevents a double lock deadlock. eofblocks scans while under iolock are still livelock prone in the event of multiple parallel scans, however. If multiple buffered writes to different inodes fail and invoke eofblocks scans at the same time, each scan avoids a deadlock with its own inode by virtue of the eof_scan_owner field, but will never be able to acquire the iolock of the inode from the parallel scan. Because the low free space scans are invoked with SYNC_WAIT, the scan will not return until it has processed every tagged inode and thus both scans will spin indefinitely on the iolock being held across the opposite scan. This problem can be reproduced reliably by generic/224 on systems with higher cpu counts (x16). To avoid this problem, simplify the semantics of eofblocks scans to never invoke a scan while under iolock. This means that the buffered write context must drop the iolock before the scan. It must reacquire the lock before the write retry and also repeat the initial write checks, as the original state might no longer be valid once the iolock was dropped. Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Brian Foster	798b1dc5cb	xfs: pull up iolock from xfs_free_eofblocks() commit `a36b926180` upstream. xfs_free_eofblocks() requires the IOLOCK_EXCL lock, but is called from different contexts where the lock may or may not be held. The need_iolock parameter exists for this reason, to indicate whether xfs_free_eofblocks() must acquire the iolock itself before it can proceed. This is ugly and confusing. Simplify the semantics of xfs_free_eofblocks() to require the caller to acquire the iolock appropriately and kill the need_iolock parameter. While here, the mp param can be removed as well as the xfs_mount is accessible from the xfs_inode structure. This patch does not change behavior. Signed-off-by: Brian Foster <bfoster@redhat.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Christoph Hellwig	08a2a26816	xfs: use per-AG reservations for the finobt commit `76d771b4cb` upstream. Currently we try to rely on the global reserved block pool for block allocations for the free inode btree, but I have customer reports (fairly complex workload, need to find an easier reproducer) where that is not enough as the AG where we free an inode that requires a new finobt block is entirely full. This causes us to cancel a dirty transaction and thus a file system shutdown. I think the right way to guard against this is to treat the finot the same way as the refcount btree and have a per-AG reservations for the possible worst case size of it, and the patch below implements that. Note that this could increase mount times with large finobt trees. In an ideal world we would have added a field for the number of finobt fields to the AGI, similar to what we did for the refcount blocks. We should do add it next time we rev the AGI or AGF format by adding new fields. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Christoph Hellwig	9be1c33d4a	xfs: only update mount/resv fields on success in __xfs_ag_resv_init commit `4dfa2b8411` upstream. Try to reserve the blocks first and only then update the fields in or hanging off the mount structure. This way we can call __xfs_ag_resv_init again after a previous failure. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Ross Lagerwall	8b08aec62c	xen/setup: Don't relocate p2m over existing one commit `7ecec8503a` upstream. When relocating the p2m, take special care not to relocate it so that is overlaps with the current location of the p2m/initrd. This is needed since the full extent of the current location is not marked as a reserved region in the e820. This was seen to happen to a dom0 with a large initial p2m and a small reserved region in the middle of the initial p2m. Signed-off-by: Ross Lagerwall <ross.lagerwall@citrix.com> Reviewed-by: Juergen Gross <jgross@suse.com> Signed-off-by: Juergen Gross <jgross@suse.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Ilya Dryomov	8601537724	libceph: force GFP_NOIO for socket allocations commit `633ee407b9` upstream. sock_alloc_inode() allocates socket+inode and socket_wq with GFP_KERNEL, which is not allowed on the writeback path: Workqueue: ceph-msgr con_work [libceph] ffff8810871cb018 0000000000000046 0000000000000000 ffff881085d40000 0000000000012b00 ffff881025cad428 ffff8810871cbfd8 0000000000012b00 ffff880102fc1000 ffff881085d40000 ffff8810871cb038 ffff8810871cb148 Call Trace: [<ffffffff816dd629>] schedule+0x29/0x70 [<ffffffff816e066d>] schedule_timeout+0x1bd/0x200 [<ffffffff81093ffc>] ? ttwu_do_wakeup+0x2c/0x120 [<ffffffff81094266>] ? ttwu_do_activate.constprop.135+0x66/0x70 [<ffffffff816deb5f>] wait_for_completion+0xbf/0x180 [<ffffffff81097cd0>] ? try_to_wake_up+0x390/0x390 [<ffffffff81086335>] flush_work+0x165/0x250 [<ffffffff81082940>] ? worker_detach_from_pool+0xd0/0xd0 [<ffffffffa03b65b1>] xlog_cil_force_lsn+0x81/0x200 [xfs] [<ffffffff816d6b42>] ? __slab_free+0xee/0x234 [<ffffffffa03b4b1d>] _xfs_log_force_lsn+0x4d/0x2c0 [xfs] [<ffffffff811adc1e>] ? lookup_page_cgroup_used+0xe/0x30 [<ffffffffa039a723>] ? xfs_reclaim_inode+0xa3/0x330 [xfs] [<ffffffffa03b4dcf>] xfs_log_force_lsn+0x3f/0xf0 [xfs] [<ffffffffa039a723>] ? xfs_reclaim_inode+0xa3/0x330 [xfs] [<ffffffffa03a62c6>] xfs_iunpin_wait+0xc6/0x1a0 [xfs] [<ffffffff810aa250>] ? wake_atomic_t_function+0x40/0x40 [<ffffffffa039a723>] xfs_reclaim_inode+0xa3/0x330 [xfs] [<ffffffffa039ac07>] xfs_reclaim_inodes_ag+0x257/0x3d0 [xfs] [<ffffffffa039bb13>] xfs_reclaim_inodes_nr+0x33/0x40 [xfs] [<ffffffffa03ab745>] xfs_fs_free_cached_objects+0x15/0x20 [xfs] [<ffffffff811c0c18>] super_cache_scan+0x178/0x180 [<ffffffff8115912e>] shrink_slab_node+0x14e/0x340 [<ffffffff811afc3b>] ? mem_cgroup_iter+0x16b/0x450 [<ffffffff8115af70>] shrink_slab+0x100/0x140 [<ffffffff8115e425>] do_try_to_free_pages+0x335/0x490 [<ffffffff8115e7f9>] try_to_free_pages+0xb9/0x1f0 [<ffffffff816d56e4>] ? __alloc_pages_direct_compact+0x69/0x1be [<ffffffff81150cba>] __alloc_pages_nodemask+0x69a/0xb40 [<ffffffff8119743e>] alloc_pages_current+0x9e/0x110 [<ffffffff811a0ac5>] new_slab+0x2c5/0x390 [<ffffffff816d71c4>] __slab_alloc+0x33b/0x459 [<ffffffff815b906d>] ? sock_alloc_inode+0x2d/0xd0 [<ffffffff8164bda1>] ? inet_sendmsg+0x71/0xc0 [<ffffffff815b906d>] ? sock_alloc_inode+0x2d/0xd0 [<ffffffff811a21f2>] kmem_cache_alloc+0x1a2/0x1b0 [<ffffffff815b906d>] sock_alloc_inode+0x2d/0xd0 [<ffffffff811d8566>] alloc_inode+0x26/0xa0 [<ffffffff811da04a>] new_inode_pseudo+0x1a/0x70 [<ffffffff815b933e>] sock_alloc+0x1e/0x80 [<ffffffff815ba855>] __sock_create+0x95/0x220 [<ffffffff815baa04>] sock_create_kern+0x24/0x30 [<ffffffffa04794d9>] con_work+0xef9/0x2050 [libceph] [<ffffffffa04aa9ec>] ? rbd_img_request_submit+0x4c/0x60 [rbd] [<ffffffff81084c19>] process_one_work+0x159/0x4f0 [<ffffffff8108561b>] worker_thread+0x11b/0x530 [<ffffffff81085500>] ? create_worker+0x1d0/0x1d0 [<ffffffff8108b6f9>] kthread+0xc9/0xe0 [<ffffffff8108b630>] ? flush_kthread_worker+0x90/0x90 [<ffffffff816e1b98>] ret_from_fork+0x58/0x90 [<ffffffff8108b630>] ? flush_kthread_worker+0x90/0x90 Use memalloc_noio_{save,restore}() to temporarily force GFP_NOIO here. Link: http://tracker.ceph.com/issues/19309 Reported-by: Sergey Jerusalimov <wintchester@gmail.com> Signed-off-by: Ilya Dryomov <idryomov@gmail.com> Reviewed-by: Jeff Layton <jlayton@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-04-08 09:30:30 +02:00
Greg Kroah-Hartman	f676772746	Linux 4.9.20	2017-03-31 10:32:02 +02:00
Bin Liu	1dc3a068cc	usb: musb: fix possible spinlock deadlock commit `bc1e215454` upstream. The DSPS glue calls del_timer_sync() in its musb_platform_disable() implementation, which requires the caller to not hold a lock. But musb_remove() calls musb_platform_disable() will musb->lock held. This could causes spinlock deadlock. So change musb_remove() to call musb_platform_disable() without holds musb->lock. This doesn't impact the musb_platform_disable implementation in other glue drivers. root@am335x-evm:~# modprobe -r musb-dsps [ 126.134879] musb-hdrc musb-hdrc.1: remove, state 1 [ 126.140465] usb usb2: USB disconnect, device number 1 [ 126.146178] usb 2-1: USB disconnect, device number 2 [ 126.416985] musb-hdrc musb-hdrc.1: USB bus 2 deregistered [ 126.423943] [ 126.425525] ====================================================== [ 126.431997] [ INFO: possible circular locking dependency detected ] [ 126.438564] 4.11.0-rc1-00003-g1557f13bca04-dirty #77 Not tainted [ 126.444852] ------------------------------------------------------- [ 126.451414] modprobe/778 is trying to acquire lock: [ 126.456523] (((&glue->timer))){+.-...}, at: [<c01b8788>] del_timer_sync+0x0/0xd0 [ 126.464403] [ 126.464403] but task is already holding lock: [ 126.470511] (&(&musb->lock)->rlock){-.-...}, at: [<bf30b7f8>] musb_remove+0x50/0x1 30 [musb_hdrc] [ 126.479965] [ 126.479965] which lock already depends on the new lock. [ 126.479965] [ 126.488531] [ 126.488531] the existing dependency chain (in reverse order) is: [ 126.496368] [ 126.496368] -> #1 (&(&musb->lock)->rlock){-.-...}: [ 126.502968] otg_timer+0x80/0xec [musb_dsps] [ 126.507990] call_timer_fn+0xb4/0x390 [ 126.512372] expire_timers+0xf0/0x1fc [ 126.516754] run_timer_softirq+0x80/0x178 [ 126.521511] __do_softirq+0xc4/0x554 [ 126.525802] irq_exit+0xe8/0x158 [ 126.529735] __handle_domain_irq+0x58/0xb8 [ 126.534583] __irq_usr+0x54/0x80 [ 126.538507] [ 126.538507] -> #0 (((&glue->timer))){+.-...}: [ 126.544636] del_timer_sync+0x40/0xd0 [ 126.549066] musb_remove+0x6c/0x130 [musb_hdrc] [ 126.554370] platform_drv_remove+0x24/0x3c [ 126.559206] device_release_driver_internal+0x14c/0x1e0 [ 126.565225] bus_remove_device+0xd8/0x108 [ 126.569970] device_del+0x1e4/0x308 [ 126.574170] platform_device_del+0x24/0x8c [ 126.579006] platform_device_unregister+0xc/0x20 [ 126.584394] dsps_remove+0x14/0x30 [musb_dsps] [ 126.589595] platform_drv_remove+0x24/0x3c [ 126.594432] device_release_driver_internal+0x14c/0x1e0 [ 126.600450] driver_detach+0x38/0x6c [ 126.604740] bus_remove_driver+0x4c/0xa0 [ 126.609407] SyS_delete_module+0x11c/0x1e4 [ 126.614252] __sys_trace_return+0x0/0x10 Fixes: `ea2f35c01d` ("usb: musb: Fix sleeping function called from invalid context for hdrc glue") Acked-by: Tony Lindgren <tony@atomide.com> Signed-off-by: Bin Liu <b-liu@ti.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Sebastian Andrzej Siewior	916c5cfeab	sched/rt: Add a missing rescheduling point commit `619bd4a718` upstream. Since the change in commit: `fd7a4bed18` ("sched, rt: Convert switched_{from, to}_rt() / prio_changed_rt() to balance callbacks") ... we don't reschedule a task under certain circumstances: Lets say task-A, SCHED_OTHER, is running on CPU0 (and it may run only on CPU0) and holds a PI lock. This task is removed from the CPU because it used up its time slice and another SCHED_OTHER task is running. Task-B on CPU1 runs at RT priority and asks for the lock owned by task-A. This results in a priority boost for task-A. Task-B goes to sleep until the lock has been made available. Task-A is already runnable (but not active), so it receives no wake up. The reality now is that task-A gets on the CPU once the scheduler decides to remove the current task despite the fact that a high priority task is enqueued and waiting. This may take a long time. The desired behaviour is that CPU0 immediately reschedules after the priority boost which made task-A the task with the lowest priority. Suggested-by: Peter Zijlstra <peterz@infradead.org> Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mike Galbraith <efault@gmx.de> Cc: Thomas Gleixner <tglx@linutronix.de> Fixes: `fd7a4bed18` ("sched, rt: Convert switched_{from, to}_rt() prio_changed_rt() to balance callbacks") Link: http://lkml.kernel.org/r/20170124144006.29821-1-bigeasy@linutronix.de Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Eric Biggers	2984e52c75	fscrypt: remove broken support for detecting keyring key revocation commit `1b53cf9815` upstream. Filesystem encryption ostensibly supported revoking a keyring key that had been used to "unlock" encrypted files, causing those files to become "locked" again. This was, however, buggy for several reasons, the most severe of which was that when key revocation happened to be detected for an inode, its fscrypt_info was immediately freed, even while other threads could be using it for encryption or decryption concurrently. This could be exploited to crash the kernel or worse. This patch fixes the use-after-free by removing the code which detects the keyring key having been revoked, invalidated, or expired. Instead, an encrypted inode that is "unlocked" now simply remains unlocked until it is evicted from memory. Note that this is no worse than the case for block device-level encryption, e.g. dm-crypt, and it still remains possible for a privileged user to evict unused pages, inodes, and dentries by running 'sync; echo 3 > /proc/sys/vm/drop_caches', or by simply unmounting the filesystem. In fact, one of those actions was already needed anyway for key revocation to work even somewhat sanely. This change is not expected to break any applications. In the future I'd like to implement a real API for fscrypt key revocation that interacts sanely with ongoing filesystem operations --- waiting for existing operations to complete and blocking new operations, and invalidating and sanitizing key material and plaintext from the VFS caches. But this is a hard problem, and for now this bug must be fixed. This bug affected almost all versions of ext4, f2fs, and ubifs encryption, and it was potentially reachable in any kernel configured with encryption support (CONFIG_EXT4_ENCRYPTION=y, CONFIG_EXT4_FS_ENCRYPTION=y, CONFIG_F2FS_FS_ENCRYPTION=y, or CONFIG_UBIFS_FS_ENCRYPTION=y). Note that older kernels did not use the shared fs/crypto/ code, but due to the potential security implications of this bug, it may still be worthwhile to backport this fix to them. Fixes: `b7236e21d5` ("ext4 crypto: reorganize how we store keys in the inode") Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu> Acked-by: Michael Halcrow <mhalcrow@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	21c95eca62	metag/ptrace: Reject partial NT_METAG_RPIPE writes commit `7195ee3120` upstream. It's not clear what behaviour is sensible when doing partial write of NT_METAG_RPIPE, so just don't bother. This patch assumes that userspace will never rely on a partial SETREGSET in this case, since it's not clear what should happen anyway. Signed-off-by: Dave Martin <Dave.Martin@arm.com> Acked-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	2d6532ceb4	metag/ptrace: Provide default TXSTATUS for short NT_PRSTATUS commit `5fe81fe981` upstream. Ensure that if userspace supplies insufficient data to PTRACE_SETREGSET to fill TXSTATUS, a well-defined default value is used, based on the task's current value. Suggested-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Dave Martin <Dave.Martin@arm.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	2739b48741	metag/ptrace: Preserve previous registers for short regset write commit `a78ce80d2c` upstream. Ensure that if userspace supplies insufficient data to PTRACE_SETREGSET to fill all the registers, the thread's old registers are preserved. Signed-off-by: Dave Martin <Dave.Martin@arm.com> Acked-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	84b94c4356	sparc/ptrace: Preserve previous registers for short regset write commit `d3805c546b` upstream. Ensure that if userspace supplies insufficient data to PTRACE_SETREGSET to fill all the registers, the thread's old registers are preserved. Signed-off-by: Dave Martin <Dave.Martin@arm.com> Acked-by: David S. Miller <davem@davemloft.net> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	0ba34c87f7	mips/ptrace: Preserve previous registers for short regset write commit `d614fd58a2` upstream. Ensure that if userspace supplies insufficient data to PTRACE_SETREGSET to fill all the registers, the thread's old registers are preserved. Signed-off-by: Dave Martin <Dave.Martin@arm.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	616fe993f4	h8300/ptrace: Fix incorrect register transfer count commit `502585c755` upstream. regs_set() and regs_get() are vulnerable to an off-by-1 buffer overrun if CONFIG_CPU_H8S is set, since this adds an extra entry to register_offset[] but not to user_regs_struct. So, iterate over user_regs_struct based on its actual size, not based on the length of register_offset[]. Signed-off-by: Dave Martin <Dave.Martin@arm.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Dave Martin	d77bd2d6a4	c6x/ptrace: Remove useless PTRACE_SETREGSET implementation commit `fb411b837b` upstream. gpr_set won't work correctly and can never have been tested, and the correct behaviour is not clear due to the endianness-dependent task layout. So, just remove it. The core code will now return -EOPNOTSUPPORT when trying to set NT_PRSTATUS on this architecture until/unless a correct implementation is supplied. Signed-off-by: Dave Martin <Dave.Martin@arm.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:46 +02:00
Bjorn Andersson	dd6054234b	pinctrl: qcom: Don't clear status bit on irq_unmask commit `a6566710ad` upstream. Clearing the status bit on irq_unmask will discard any pending interrupt that did arrive after the irq_ack, i.e. while the IRQ handler function was executing. Fixes: `f365be0925` ("pinctrl: Add Qualcomm TLMM driver") Cc: Stephen Boyd <sboyd@codeaurora.org> Reported-by: Timur Tabi <timur@codeaurora.org> Signed-off-by: Bjorn Andersson <bjorn.andersson@linaro.org> Signed-off-by: Linus Walleij <linus.walleij@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:45 +02:00
Ladi Prosek	b82b2f930c	virtio_balloon: init 1st buffer in stats vq commit `fc8653228c` upstream. When init_vqs runs, virtio_balloon.stats is either uninitialized or contains stale values. The host updates its state with garbage data because it has no way of knowing that this is just a marker buffer used for signaling. This patch updates the stats before pushing the initial buffer. Alternative fixes: * Push an empty buffer in init_vqs. Not easily done with the current virtio implementation and violates the spec "Driver MUST supply the same subset of statistics in all buffers submitted to the statsq". * Push a buffer with invalid tags in init_vqs. Violates the same spec clause, plus "invalid tag" is not really defined. Note: the spec says: When using the legacy interface, the device SHOULD ignore all values in the first buffer in the statsq supplied by the driver after device initialization. Note: Historically, drivers supplied an uninitialized buffer in the first buffer. Unfortunately QEMU does not seem to implement the recommendation even for the legacy interface. Signed-off-by: Ladi Prosek <lprosek@redhat.com> Signed-off-by: Michael S. Tsirkin <mst@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:45 +02:00
Paolo Bonzini	3e8c6bd9db	KVM: x86: cleanup the page tracking SRCU instance commit `2beb6dad2e` upstream. SRCU uses a delayed work item. Skip cleaning it up, and the result is use-after-free in the work item callbacks. Reported-by: Dmitry Vyukov <dvyukov@google.com> Suggested-by: Dmitry Vyukov <dvyukov@google.com> Fixes: `0eb05bf290` Reviewed-by: Xiao Guangrong <xiaoguangrong.eric@gmail.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:45 +02:00
Andy Whitcroft	79191ea36d	xfrm_user: validate XFRM_MSG_NEWAE incoming ESN size harder commit `f843ee6dd0` upstream. Kees Cook has pointed out that xfrm_replay_state_esn_len() is subject to wrapping issues. To ensure we are correctly ensuring that the two ESN structures are the same size compare both the overall size as reported by xfrm_replay_state_esn_len() and the internal length are the same. CVE-2017-7184 Signed-off-by: Andy Whitcroft <apw@canonical.com> Acked-by: Steffen Klassert <steffen.klassert@secunet.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:45 +02:00
Andy Whitcroft	64a5465799	xfrm_user: validate XFRM_MSG_NEWAE XFRMA_REPLAY_ESN_VAL replay_window commit `677e806da4` upstream. When a new xfrm state is created during an XFRM_MSG_NEWSA call we validate the user supplied replay_esn to ensure that the size is valid and to ensure that the replay_window size is within the allocated buffer. However later it is possible to update this replay_esn via a XFRM_MSG_NEWAE call. There we again validate the size of the supplied buffer matches the existing state and if so inject the contents. We do not at this point check that the replay_window is within the allocated memory. This leads to out-of-bounds reads and writes triggered by netlink packets. This leads to memory corruption and the potential for priviledge escalation. We already attempt to validate the incoming replay information in xfrm_new_ae() via xfrm_replay_verify_len(). This confirms that the user is not trying to change the size of the replay state buffer which includes the replay_esn. It however does not check the replay_window remains within that buffer. Add validation of the contained replay_window. CVE-2017-7184 Signed-off-by: Andy Whitcroft <apw@canonical.com> Acked-by: Steffen Klassert <steffen.klassert@secunet.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:45 +02:00
Florian Westphal	f68a09c794	xfrm: policy: init locks early commit `c282222a45` upstream. Dmitry reports following splat: INFO: trying to register non-static key. the code is fine but needs lockdep annotation. turning off the locking correctness validator. CPU: 0 PID: 13059 Comm: syz-executor1 Not tainted 4.10.0-rc7-next-20170207 #1 [..] spin_lock_bh include/linux/spinlock.h:304 [inline] xfrm_policy_flush+0x32/0x470 net/xfrm/xfrm_policy.c:963 xfrm_policy_fini+0xbf/0x560 net/xfrm/xfrm_policy.c:3041 xfrm_net_init+0x79f/0x9e0 net/xfrm/xfrm_policy.c:3091 ops_init+0x10a/0x530 net/core/net_namespace.c:115 setup_net+0x2ed/0x690 net/core/net_namespace.c:291 copy_net_ns+0x26c/0x530 net/core/net_namespace.c:396 create_new_namespaces+0x409/0x860 kernel/nsproxy.c:106 unshare_nsproxy_namespaces+0xae/0x1e0 kernel/nsproxy.c:205 SYSC_unshare kernel/fork.c:2281 [inline] Problem is that when we get error during xfrm_net_init we will call xfrm_policy_fini which will acquire xfrm_policy_lock before it was initialized. Just move it around so locks get set up first. Reported-by: Dmitry Vyukov <dvyukov@google.com> Fixes: `283bc9f35b` ("xfrm: Namespacify xfrm state/policy locks") Signed-off-by: Florian Westphal <fw@strlen.de> Signed-off-by: Steffen Klassert <steffen.klassert@secunet.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-31 10:31:45 +02:00
Greg Kroah-Hartman	c8e131605d	Linux 4.9.19	2017-03-30 09:41:57 +02:00
Jiri Slaby	bc959a402d	crypto: algif_hash - avoid zero-sized array commit `6207119444` upstream. With this reproducer: struct sockaddr_alg alg = { .salg_family = 0x26, .salg_type = "hash", .salg_feat = 0xf, .salg_mask = 0x5, .salg_name = "digest_null", }; int sock, sock2; sock = socket(AF_ALG, SOCK_SEQPACKET, 0); bind(sock, (struct sockaddr *)&alg, sizeof(alg)); sock2 = accept(sock, NULL, NULL); setsockopt(sock, SOL_ALG, ALG_SET_KEY, "\x9b\xca", 2); accept(sock2, NULL, NULL); ==== 8< ======== 8< ======== 8< ======== 8< ==== one can immediatelly see an UBSAN warning: UBSAN: Undefined behaviour in crypto/algif_hash.c:187:7 variable length array bound value 0 <= 0 CPU: 0 PID: 15949 Comm: syz-executor Tainted: G E 4.4.30-0-default #1 ... Call Trace: ... [<ffffffff81d598fd>] ? __ubsan_handle_vla_bound_not_positive+0x13d/0x188 [<ffffffff81d597c0>] ? __ubsan_handle_out_of_bounds+0x1bc/0x1bc [<ffffffffa0e2204d>] ? hash_accept+0x5bd/0x7d0 [algif_hash] [<ffffffffa0e2293f>] ? hash_accept_nokey+0x3f/0x51 [algif_hash] [<ffffffffa0e206b0>] ? hash_accept_parent_nokey+0x4a0/0x4a0 [algif_hash] [<ffffffff8235c42b>] ? SyS_accept+0x2b/0x40 It is a correct warning, as hash state is propagated to accept as zero, but creating a zero-length variable array is not allowed in C. Fix this as proposed by Herbert -- do "?: 1" on that site. No sizeof or similar happens in the code there, so we just allocate one byte even though we do not use the array. Signed-off-by: Jiri Slaby <jslaby@suse.cz> Cc: Herbert Xu <herbert@gondor.apana.org.au> Cc: "David S. Miller" <davem@davemloft.net> (maintainer:CRYPTO API) Reported-by: Sasha Levin <sasha.levin@oracle.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Cc: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:28 +02:00
Takashi Iwai	3fd37725c4	fbcon: Fix vc attr at deinit commit `8aac7f3436` upstream. fbcon can deal with vc_hi_font_mask (the upper 256 chars) and adjust the vc attrs dynamically when vc_hi_font_mask is changed at fbcon_init(). When the vc_hi_font_mask is set, it remaps the attrs in the existing console buffer with one bit shift up (for 9 bits), while it remaps with one bit shift down (for 8 bits) when the value is cleared. It works fine as long as the font gets updated after fbcon was initialized. However, we hit a bizarre problem when the console is switched to another fb driver (typically from vesafb or efifb to drmfb). At switching to the new fb driver, we temporarily rebind the console to the dummy console, then rebind to the new driver. During the switching, we leave the modified attrs as is. Thus, the new fbcon takes over the old buffer as if it were to contain 8 bits chars (although the attrs are still shifted for 9 bits), and effectively this results in the yellow color texts instead of the original white color, as found in the bugzilla entry below. An easy fix for this is to re-adjust the attrs before leaving the fbcon at con_deinit callback. Since the code to adjust the attrs is already present in the current fbcon code, in this patch, we simply factor out the relevant code, and call it from fbcon_deinit(). Bugzilla: https://bugzilla.suse.com/show_bug.cgi?id=1000619 Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com> Cc: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:28 +02:00
Daniel Vetter	c75fe78995	drm: reference count event->completion commit `24835e442f` upstream. When writing the generic nonblocking commit code I assumed that through clever lifetime management I can assure that the completion (stored in drm_crtc_commit) only gets freed after it is completed. And that worked. I also wanted to make nonblocking helpers resilient against driver bugs, by having timeouts everywhere. And that worked too. Unfortunately taking boths things together results in oopses :( Well, at least sometimes: What seems to happen is that the drm event hangs around forever stuck in limbo land. The nonblocking helpers eventually time out, move on and release it. Now the bug I tested all this against is drivers that just entirely fail to deliver the vblank events like they should, and in those cases the event is simply leaked. But what seems to happen, at least sometimes, on i915 is that the event is set up correctly, but somohow the vblank fails to fire in time. Which means the event isn't leaked, it's still there waiting for eventually a vblank to fire. That tends to happen when re-enabling the pipe, and then the trap springs and the kernel oopses. The correct fix here is simply to refcount the crtc commit to make sure that the event sticks around even for drivers which only sometimes fail to deliver vblanks for some arbitrary reasons. Since crtc commits are already refcounted that's easy to do. References: https://bugs.freedesktop.org/show_bug.cgi?id=96781 Cc: Jim Rees <rees@umich.edu> Cc: Chris Wilson <chris@chris-wilson.co.uk> Cc: Maarten Lankhorst <maarten.lankhorst@linux.intel.com> Cc: Jani Nikula <jani.nikula@linux.intel.com> Reviewed-by: Maarten Lankhorst <maarten.lankhorst@linux.intel.com> Signed-off-by: Daniel Vetter <daniel.vetter@intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/20161221102331.31033-1-daniel.vetter@ffwll.ch Cc: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:28 +02:00
Johannes Berg	56769e7a05	nl80211: fix dumpit error path RTNL deadlocks commit `ea90e0dc8c` upstream. Sowmini pointed out Dmitry's RTNL deadlock report to me, and it turns out to be perfectly accurate - there are various error paths that miss unlock of the RTNL. To fix those, change the locking a bit to not be conditional in all those nl80211_prepare_*_dump() functions, but make those require the RTNL to start with, and fix the buggy error paths. This also let me use sparse (by appropriately overriding the rtnl_lock/rtnl_unlock functions) to validate the changes. Reported-by: Sowmini Varadhan <sowmini.varadhan@oracle.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: Johannes Berg <johannes.berg@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:28 +02:00
Marek Szyprowski	7b3c8b2a2e	drm/bridge: analogix dp: Fix runtime PM state on driver bind commit `f0a8b49c03` upstream. Analogix_dp_bind() can be called from component framework, which doesn't guarantee proper runtime PM state of the device during bind operation, so ensure that device is runtime active before doing any register access. This ensures that the power domain, to which DP module belongs, is turned on. While at it, also fix the unbalanced call to phy_power_on() in analogix_dp_bind() function. This patch solves the following kernel oops on Samsung Exynos5250 Snow board: Unhandled fault: imprecise external abort (0x406) at 0x00000000 pgd = c0004000 [00000000] *pgd=00000000 Internal error: : 406 [#1] PREEMPT SMP ARM Modules linked in: CPU: 0 PID: 75 Comm: kworker/0:2 Not tainted 4.9.0 #1046 Hardware name: SAMSUNG EXYNOS (Flattened Device Tree) Workqueue: events deferred_probe_work_func task: ee272300 task.stack: ee312000 PC is at analogix_dp_enable_sw_function+0x18/0x2c LR is at analogix_dp_init_dp+0x2c/0x50 ... [<c03fcb38>] (analogix_dp_enable_sw_function) from [<c03fa9c4>] (analogix_dp_init_dp+0x2c/0x50) [<c03fa9c4>] (analogix_dp_init_dp) from [<c03fab6c>] (analogix_dp_bind+0x184/0x42c) [<c03fab6c>] (analogix_dp_bind) from [<c03fdb84>] (component_bind_all+0xf0/0x218) [<c03fdb84>] (component_bind_all) from [<c03ed64c>] (exynos_drm_load+0x134/0x200) [<c03ed64c>] (exynos_drm_load) from [<c03d5058>] (drm_dev_register+0xa0/0xd0) [<c03d5058>] (drm_dev_register) from [<c03d66b8>] (drm_platform_init+0x58/0xb0) [<c03d66b8>] (drm_platform_init) from [<c03fe0c4>] (try_to_bring_up_master+0x14c/0x188) [<c03fe0c4>] (try_to_bring_up_master) from [<c03fe188>] (component_add+0x88/0x138) [<c03fe188>] (component_add) from [<c0403a38>] (platform_drv_probe+0x50/0xb0) [<c0403a38>] (platform_drv_probe) from [<c0402470>] (driver_probe_device+0x1f0/0x2a8) [<c0402470>] (driver_probe_device) from [<c0400a54>] (bus_for_each_drv+0x44/0x8c) [<c0400a54>] (bus_for_each_drv) from [<c04021f8>] (__device_attach+0x9c/0x100) [<c04021f8>] (__device_attach) from [<c04018e8>] (bus_probe_device+0x84/0x8c) [<c04018e8>] (bus_probe_device) from [<c0401d1c>] (deferred_probe_work_func+0x60/0x8c) [<c0401d1c>] (deferred_probe_work_func) from [<c012fc14>] (process_one_work+0x120/0x318) [<c012fc14>] (process_one_work) from [<c012fe34>] (process_scheduled_works+0x28/0x38) [<c012fe34>] (process_scheduled_works) from [<c0130048>] (worker_thread+0x204/0x4ac) [<c0130048>] (worker_thread) from [<c01352c4>] (kthread+0xd8/0xf4) [<c01352c4>] (kthread) from [<c0107978>] (ret_from_fork+0x14/0x3c) Code: e59035f0 e5935018 f57ff04f e3c55001 (f57ff04e) ---[ end trace 3d1d0d87796de344 ]--- Reviewed-by: Sean Paul <seanpaul@chromium.org> Signed-off-by: Marek Szyprowski <m.szyprowski@samsung.com> Signed-off-by: Archit Taneja <architt@codeaurora.org> Link: http://patchwork.freedesktop.org/patch/msgid/1483091866-1088-1-git-send-email-m.szyprowski@samsung.com Cc: Javier Martinez Canillas <javier@osg.samsung.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:28 +02:00
Dave Jiang	eae72468c4	device-dax: fix pmd/pte fault fallback handling commit `0134ed4fb9` upstream. Jeff Moyer reports: With a device dax alignment of 4KB or 2MB, I get sigbus when running the attached fio job file for the current kernel (4.11.0-rc1+). If I specify an alignment of 1GB, it works. I turned on debug output, and saw that it was failing in the huge fault code. dax dax1.0: dax_open dax dax1.0: dax_mmap dax dax1.0: dax_dev_huge_fault: fio: write (0x7f08f0a00000 - dax dax1.0: __dax_dev_pud_fault: phys_to_pgoff(0xffffffffcf60 dax dax1.0: dax_release fio config for reproduce: [global] ioengine=dev-dax direct=0 filename=/dev/dax0.0 bs=2m [write] rw=write [read] stonewall rw=read The driver fails to fallback when taking a fault that is larger than the device alignment, or handling a larger fault when a smaller mapping is already established. While we could support larger mappings for a device with a smaller alignment, that change is too large for the immediate fix. The simplest change is to force fallback until the fault size matches the alignment. Fixes: `dee4107924` ("/dev/dax, core: file operations and dax-mmap") Cc: <stable@vger.kernel.org> Reported-by: Jeff Moyer <jmoyer@redhat.com> Signed-off-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Ilya Dryomov	81ec3dc1de	libceph: don't set weight to IN when OSD is destroyed commit `b581a5854e` upstream. Since ceph.git commit 4e28f9e63644 ("osd/OSDMap: clear osd_info, osd_xinfo on osd deletion"), weight is set to IN when OSD is deleted. This changes the result of applying an incremental for clients, not just OSDs. Because CRUSH computations are obviously affected, pre-4e28f9e63644 servers disagree with post-4e28f9e63644 clients on object placement, resulting in misdirected requests. Mirrors ceph.git commit a6009d1039a55e2c77f431662b3d6cc5a8e8e63f. Fixes: `930c532869` ("libceph: apply new_state before new_up_client on incrementals") Link: http://tracker.ceph.com/issues/19122 Signed-off-by: Ilya Dryomov <idryomov@gmail.com> Reviewed-by: Sage Weil <sage@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
K. Y. Srinivasan	df1fe6c9ad	Drivers: hv: vmbus: Don't leak memory when a channel is rescinded commit `5e030d5ce9` upstream. When we close a channel that has been rescinded, we will leak memory since vmbus_teardown_gpadl() returns an error. Fix this so that we can properly cleanup the memory allocated to the ring buffers. Fixes: `ccb61f8a99` ("Drivers: hv: vmbus: Fix a rescind handling bug") Signed-off-by: K. Y. Srinivasan <kys@microsoft.com> Cc: Dexuan Cui <decui@microsoft.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
K. Y. Srinivasan	b1f6b0a5a0	Drivers: hv: vmbus: Don't leak channel ids commit `9a5476020a` upstream. If we cannot allocate memory for the channel, free the relid associated with the channel. Signed-off-by: K. Y. Srinivasan <kys@microsoft.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Alexander Shishkin	3076066bb5	intel_th: Don't leak module refcount on failure to activate commit `e609ccef52` upstream. Output 'activation' may fail for the reasons of the output driver, for example, if msc's buffer is not allocated. We forget, however, to drop the module reference in this case. So each attempt at activation in this case leaks a reference, preventing the module from ever unloading. This patch adds the missing module_put() in the activation error path. Signed-off-by: Alexander Shishkin <alexander.shishkin@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Eric Biggers	b176a6eed3	jbd2: don't leak memory if setting up journal fails commit `cd9cb405e0` upstream. In journal_init_common(), if we failed to allocate the j_wbuf array, or if we failed to create the buffer_head for the journal superblock, we leaked the memory allocated for the revocation tables. Fix this. Fixes: `f0c9fd5458` Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu> Reviewed-by: Jan Kara <jack@suse.cz> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Dmitry Torokhov	90f39ad2ce	auxdisplay: img-ascii-lcd: add missing sentinel entry in img_ascii_lcd_matches commit `abda288bb2` upstream. The OF device table must be terminated, otherwise we'll be walking past it and into areas unknown. Fixes: `0cad855fbd` ("auxdisplay: img-ascii-lcd: driver for simple ASCII...") Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Tested-by: Fengguang Wu <fengguang.wu@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Alex Deucher	9740abe0fd	drm/amdgpu: reinstate oland workaround for sclk commit `e11ddff68a` upstream. Higher sclks seem to be unstable on some boards. bug: https://bugs.freedesktop.org/show_bug.cgi?id=100222 Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Ming Lei	21d17f1b53	blk-mq: don't complete un-started request in timeout handler commit `95a4960370` upstream. When iterating busy requests in timeout handler, if the STARTED flag of one request isn't set, that means the request is being processed in block layer or driver, and isn't submitted to hardware yet. In current implementation of blk_mq_check_expired(), if the request queue becomes dying, un-started requests are handled as being completed/freed immediately. This way is wrong, and can cause rq corruption or double allocation[1][2], when doing I/O and removing&resetting NVMe device at the sametime. This patch fixes several issues reported by Yi Zhang. [1]. oops log 1 [ 581.789754] ------------[ cut here ]------------ [ 581.789758] kernel BUG at block/blk-mq.c:374! [ 581.789760] invalid opcode: 0000 [#1] SMP [ 581.789761] Modules linked in: vfat fat ipmi_ssif intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm nvme irqbypass crct10dif_pclmul nvme_core crc32_pclmul ghash_clmulni_intel intel_cstate ipmi_si mei_me ipmi_devintf intel_uncore sg ipmi_msghandler intel_rapl_perf iTCO_wdt mei iTCO_vendor_support mxm_wmi lpc_ich dcdbas shpchp pcspkr acpi_power_meter wmi nfsd auth_rpcgss nfs_acl lockd dm_multipath grace sunrpc ip_tables xfs libcrc32c sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm ahci libahci crc32c_intel tg3 libata megaraid_sas i2c_core ptp fjes pps_core dm_mirror dm_region_hash dm_log dm_mod [ 581.789796] CPU: 1 PID: 1617 Comm: kworker/1:1H Not tainted 4.10.0.bz1420297+ #4 [ 581.789797] Hardware name: Dell Inc. PowerEdge R730xd/072T6D, BIOS 2.2.5 09/06/2016 [ 581.789804] Workqueue: kblockd blk_mq_timeout_work [ 581.789806] task: ffff8804721c8000 task.stack: ffffc90006ee4000 [ 581.789809] RIP: 0010:blk_mq_end_request+0x58/0x70 [ 581.789810] RSP: 0018:ffffc90006ee7d50 EFLAGS: 00010202 [ 581.789811] RAX: 0000000000000001 RBX: ffff8802e4195340 RCX: ffff88028e2f4b88 [ 581.789812] RDX: 0000000000001000 RSI: 0000000000001000 RDI: 0000000000000000 [ 581.789813] RBP: ffffc90006ee7d60 R08: 0000000000000003 R09: ffff88028e2f4b00 [ 581.789814] R10: 0000000000001000 R11: 0000000000000001 R12: 00000000fffffffb [ 581.789815] R13: ffff88042abe5780 R14: 000000000000002d R15: ffff88046fbdff80 [ 581.789817] FS: 0000000000000000(0000) GS:ffff88047fc00000(0000) knlGS:0000000000000000 [ 581.789818] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 581.789819] CR2: 00007f64f403a008 CR3: 000000014d078000 CR4: 00000000001406e0 [ 581.789820] Call Trace: [ 581.789825] blk_mq_check_expired+0x76/0x80 [ 581.789828] bt_iter+0x45/0x50 [ 581.789830] blk_mq_queue_tag_busy_iter+0xdd/0x1f0 [ 581.789832] ? blk_mq_rq_timed_out+0x70/0x70 [ 581.789833] ? blk_mq_rq_timed_out+0x70/0x70 [ 581.789840] ? __switch_to+0x140/0x450 [ 581.789841] blk_mq_timeout_work+0x88/0x170 [ 581.789845] process_one_work+0x165/0x410 [ 581.789847] worker_thread+0x137/0x4c0 [ 581.789851] kthread+0x101/0x140 [ 581.789853] ? rescuer_thread+0x3b0/0x3b0 [ 581.789855] ? kthread_park+0x90/0x90 [ 581.789860] ret_from_fork+0x2c/0x40 [ 581.789861] Code: 48 85 c0 74 0d 44 89 e6 48 89 df ff d0 5b 41 5c 5d c3 48 8b bb 70 01 00 00 48 85 ff 75 0f 48 89 df e8 7d f0 ff ff 5b 41 5c 5d c3 <0f> 0b e8 71 f0 ff ff 90 eb e9 0f 1f 40 00 66 2e 0f 1f 84 00 00 [ 581.789882] RIP: blk_mq_end_request+0x58/0x70 RSP: ffffc90006ee7d50 [ 581.789889] ---[ end trace bcaf03d9a14a0a70 ]--- [2]. oops log2 [ 6984.857362] BUG: unable to handle kernel NULL pointer dereference at 0000000000000010 [ 6984.857372] IP: nvme_queue_rq+0x6e6/0x8cd [nvme] [ 6984.857373] PGD 0 [ 6984.857374] [ 6984.857376] Oops: 0000 [#1] SMP [ 6984.857379] Modules linked in: ipmi_ssif vfat fat intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel ipmi_si iTCO_wdt iTCO_vendor_support mxm_wmi ipmi_devintf intel_cstate sg dcdbas intel_uncore mei_me intel_rapl_perf mei pcspkr lpc_ich ipmi_msghandler shpchp acpi_power_meter wmi nfsd auth_rpcgss dm_multipath nfs_acl lockd grace sunrpc ip_tables xfs libcrc32c sd_mod mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect crc32c_intel sysimgblt fb_sys_fops ttm nvme drm nvme_core ahci libahci i2c_core tg3 libata ptp megaraid_sas pps_core fjes dm_mirror dm_region_hash dm_log dm_mod [ 6984.857416] CPU: 7 PID: 1635 Comm: kworker/7:1H Not tainted 4.10.0-2.el7.bz1420297.x86_64 #1 [ 6984.857417] Hardware name: Dell Inc. PowerEdge R730xd/072T6D, BIOS 2.2.5 09/06/2016 [ 6984.857427] Workqueue: kblockd blk_mq_run_work_fn [ 6984.857429] task: ffff880476e3da00 task.stack: ffffc90002e90000 [ 6984.857432] RIP: 0010:nvme_queue_rq+0x6e6/0x8cd [nvme] [ 6984.857433] RSP: 0018:ffffc90002e93c50 EFLAGS: 00010246 [ 6984.857434] RAX: 0000000000000000 RBX: ffff880275646600 RCX: 0000000000001000 [ 6984.857435] RDX: 0000000000000fff RSI: 00000002fba2a000 RDI: ffff8804734e6950 [ 6984.857436] RBP: ffffc90002e93d30 R08: 0000000000002000 R09: 0000000000001000 [ 6984.857437] R10: 0000000000001000 R11: 0000000000000000 R12: ffff8804741d8000 [ 6984.857438] R13: 0000000000000040 R14: ffff880475649f80 R15: ffff8804734e6780 [ 6984.857439] FS: 0000000000000000(0000) GS:ffff88047fcc0000(0000) knlGS:0000000000000000 [ 6984.857440] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 6984.857442] CR2: 0000000000000010 CR3: 0000000001c09000 CR4: 00000000001406e0 [ 6984.857443] Call Trace: [ 6984.857451] ? mempool_free+0x2b/0x80 [ 6984.857455] ? bio_free+0x4e/0x60 [ 6984.857459] blk_mq_dispatch_rq_list+0xf5/0x230 [ 6984.857462] blk_mq_process_rq_list+0x133/0x170 [ 6984.857465] __blk_mq_run_hw_queue+0x8c/0xa0 [ 6984.857467] blk_mq_run_work_fn+0x12/0x20 [ 6984.857473] process_one_work+0x165/0x410 [ 6984.857475] worker_thread+0x137/0x4c0 [ 6984.857478] kthread+0x101/0x140 [ 6984.857480] ? rescuer_thread+0x3b0/0x3b0 [ 6984.857481] ? kthread_park+0x90/0x90 [ 6984.857489] ret_from_fork+0x2c/0x40 [ 6984.857490] Code: 8b bd 70 ff ff ff 89 95 50 ff ff ff 89 8d 58 ff ff ff 44 89 95 60 ff ff ff e8 b7 dd 12 e1 8b 95 50 ff ff ff 48 89 85 68 ff ff ff <4c> 8b 48 10 44 8b 58 18 8b 8d 58 ff ff ff 44 8b 95 60 ff ff ff [ 6984.857511] RIP: nvme_queue_rq+0x6e6/0x8cd [nvme] RSP: ffffc90002e93c50 [ 6984.857512] CR2: 0000000000000010 [ 6984.895359] ---[ end trace 2d7ceb528432bf83 ]--- Reported-by: Yi Zhang <yizhan@redhat.com> Tested-by: Yi Zhang <yizhan@redhat.com> Reviewed-by: Bart Van Assche <bart.vanassche@sandisk.com> Reviewed-by: Hannes Reinecke <hare@suse.com> Signed-off-by: Ming Lei <tom.leiming@gmail.com> Signed-off-by: Jens Axboe <axboe@fb.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Tejun Heo	62f6341c85	cgroup, net_cls: iterate the fds of only the tasks which are being migrated commit `a05d4fd917` upstream. The net_cls controller controls the classid field of each socket which is associated with the cgroup. Because the classid is per-socket attribute, when a task migrates to another cgroup or the configured classid of the cgroup changes, the controller needs to walk all sockets and update the classid value, which was implemented by `3b13758f51` ("cgroups: Allow dynamically changing net_classid"). While the approach is not scalable, migrating tasks which have a lot of fds attached to them is rare and the cost is born by the ones initiating the operations. However, for simplicity, both the migration and classid config change paths call update_classid() which scans all fds of all tasks in the target css. This is an overkill for the migration path which only needs to cover a much smaller subset of tasks which are actually getting migrated in. On cgroup v1, this can lead to unexpected scalability issues when one tries to migrate a task or process into a net_cls cgroup which already contains a lot of fds. Even if the migration traget doesn't have many to get scanned, update_classid() ends up scanning all fds in the target cgroup which can be extremely numerous. Unfortunately, on cgroup v2 which doesn't use net_cls, the problem is even worse. Before `bfc2cf6f61` ("cgroup: call subsys->attach() only for subsystems which are actually affected by migration"), cgroup core would call the ->css_attach callback even for controllers which don't see actual migration to a different css. As net_cls is always disabled but still mounted on cgroup v2, whenever a process is migrated on the cgroup v2 hierarchy, net_cls sees identity migration from root to root and cgroup core used to call ->css_attach callback for those. The net_cls ->css_attach ends up calling update_classid() on the root net_cls css to which all processes on the system belong to as the controller isn't used. This makes any cgroup v2 migration O(total_number_of_fds_on_the_system) which is horrible and easily leads to noticeable stalls triggering RCU stall warnings and so on. The worst symptom is already fixed in upstream by `bfc2cf6f61` ("cgroup: call subsys->attach() only for subsystems which are actually affected by migration"); however, backporting that commit is too invasive and we want to avoid other cases too. This patch updates net_cls's cgrp_attach() to iterate fds of only the processes which are actually getting migrated. This removes the surprising migration cost which is dependent on the total number of fds in the target cgroup. As this leaves write_classid() the only user of update_classid(), open-code the helper into write_classid(). Reported-by: David Goode <dgoode@fb.com> Fixes: `3b13758f51` ("cgroups: Allow dynamically changing net_classid") Cc: Nina Schiff <ninasc@fb.com> Cc: David S. Miller <davem@davemloft.net> Signed-off-by: Tejun Heo <tj@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Viresh Kumar	f565084692	cpufreq: Restore policy min/max limits on CPU online commit `ff010472fb` upstream. On CPU online the cpufreq core restores the previous governor (or the previous "policy" setting for ->setpolicy drivers), but it does not restore the min/max limits at the same time, which is confusing, inconsistent and real pain for users who set the limits and then suspend/resume the system (using full suspend), in which case the limits are reset on all CPUs except for the boot one. Fix this by making cpufreq_online() restore the limits when an inactive policy is brought online. The commit log and patch are inspired from Rafael's earlier work. Reported-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:27 +02:00
Neeraj Upadhyay	b9ed800f71	arm64: kaslr: Fix up the kernel image alignment commit `afd0e5a876` upstream. If kernel image extends across alignment boundary, existing code increases the KASLR offset by size of kernel image. The offset is masked after resizing. There are cases, where after masking, we may still have kernel image extending across boundary. This eventually results in only 2MB block getting mapped while creating the page tables. This results in data aborts while accessing unmapped regions during second relocation (with kaslr offset) in __primary_switch. To fix this problem, round up the kernel image size, by swapper block size, before adding it for correction. For example consider below case, where kernel image still crosses 1GB alignment boundary, after masking the offset, which is fixed by rounding up kernel image size. SWAPPER_TABLE_SHIFT = 30 Swapper using section maps with section size 2MB. CONFIG_PGTABLE_LEVELS = 3 VA_BITS = 39 _text : 0xffffff8008080000 _end : 0xffffff800aa1b000 offset : 0x1f35600000 mask = ((1UL << (VA_BITS - 2)) - 1) & ~(SZ_2M - 1) (_text + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7c (_end + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7d offset after existing correction (before mask) = 0x1f37f9b000 (_text + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7d (_end + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7d offset (after mask) = 0x1f37e00000 (_text + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7c (_end + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7d new offset w/ rounding up = 0x1f38000000 (_text + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7d (_end + offset) >> SWAPPER_TABLE_SHIFT = 0x3fffffe7d Fixes: `f80fb3a3d5` ("arm64: add support for kernel ASLR") Reviewed-by: Ard Biesheuvel <ard.biesheuvel@linaro.org> Signed-off-by: Neeraj Upadhyay <neeraju@codeaurora.org> Signed-off-by: Srinivas Ramana <sramana@codeaurora.org> Signed-off-by: Will Deacon <will.deacon@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Nicolas Ferre	2ab97521ce	ARM: at91: pm: cpu_idle: switch DDR to power-down mode commit `60b89f1928` upstream. On some DDR controllers, compatible with the sama5d3 one, the sequence to enter/exit/re-enter the self-refresh mode adds more constrains than what is currently written in the at91_idle driver. An actual access to the DDR chip is needed between exit and re-enter of this mode which is somehow difficult to implement. This sequence can completely hang the SoC. It is particularly experienced on parts which embed a L2 cache if the code run between IDLE calls fits in it... Moreover, as the intention is to enter and exit pretty rapidly from IDLE, the power-down mode is a good candidate. So now we use power-down instead of self-refresh. As we can simplify the code for sama5d3 compatible DDR controllers, we instantiate a new sama5d3_ddr_standby() function. Signed-off-by: Nicolas Ferre <nicolas.ferre@microchip.com> Fixes: `017b5522d5` ("ARM: at91: Add new binding for sama5d3-ddramc") Signed-off-by: Alexandre Belloni <alexandre.belloni@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Romain Izard	ca5477ad19	Revert "ARM: at91/dt: sama5d2: Use new compatible for ohci node" commit `9e10889a31` upstream. This reverts commit `cab4328268` ("ARM: at91/dt: sama5d2: Use new compatible for ohci node") It depends from commit `7150bc9b4d` ("usb: ohci-at91: Forcibly suspend ports while USB suspend") which was reverted and implemented differently. With the new implementation, the compatible string must remain the same. The compatible string introduced by this commit has been used in the default SAMA5D2 dtsi starting from Linux 4.8. As it has never been working correctly in an official release, removing it should not be breaking the stability rules. Fixes: `cab4328268` ("ARM: at91/dt: sama5d2: Use new compatible for ohci node") Signed-off-by: Romain Izard <romain.izard.pro@gmail.com> Signed-off-by: Alexandre Belloni <alexandre.belloni@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Koos Vriezen	352c0214b7	iommu/vt-d: Fix NULL pointer dereference in device_to_iommu commit `5003ae1e73` upstream. The function device_to_iommu() in the Intel VT-d driver lacks a NULL-ptr check, resulting in this oops at boot on some platforms: BUG: unable to handle kernel NULL pointer dereference at 00000000000007ab IP: [<ffffffff8132234a>] device_to_iommu+0x11a/0x1a0 PGD 0 [...] Call Trace: ? find_or_alloc_domain.constprop.29+0x1a/0x300 ? dw_dma_probe+0x561/0x580 [dw_dmac_core] ? __get_valid_domain_for_dev+0x39/0x120 ? __intel_map_single+0x138/0x180 ? intel_alloc_coherent+0xb6/0x120 ? sst_hsw_dsp_init+0x173/0x420 [snd_soc_sst_haswell_pcm] ? mutex_lock+0x9/0x30 ? kernfs_add_one+0xdb/0x130 ? devres_add+0x19/0x60 ? hsw_pcm_dev_probe+0x46/0xd0 [snd_soc_sst_haswell_pcm] ? platform_drv_probe+0x30/0x90 ? driver_probe_device+0x1ed/0x2b0 ? __driver_attach+0x8f/0xa0 ? driver_probe_device+0x2b0/0x2b0 ? bus_for_each_dev+0x55/0x90 ? bus_add_driver+0x110/0x210 ? 0xffffffffa11ea000 ? driver_register+0x52/0xc0 ? 0xffffffffa11ea000 ? do_one_initcall+0x32/0x130 ? free_vmap_area_noflush+0x37/0x70 ? kmem_cache_alloc+0x88/0xd0 ? do_init_module+0x51/0x1c4 ? load_module+0x1ee9/0x2430 ? show_taint+0x20/0x20 ? kernel_read_file+0xfd/0x190 ? SyS_finit_module+0xa3/0xb0 ? do_syscall_64+0x4a/0xb0 ? entry_SYSCALL64_slow_path+0x25/0x25 Code: 78 ff ff ff 4d 85 c0 74 ee 49 8b 5a 10 0f b6 9b e0 00 00 00 41 38 98 e0 00 00 00 77 da 0f b6 eb 49 39 a8 88 00 00 00 72 ce eb 8f <41> f6 82 ab 07 00 00 04 0f 85 76 ff ff ff 0f b6 4d 08 88 0e 49 RIP [<ffffffff8132234a>] device_to_iommu+0x11a/0x1a0 RSP <ffffc90001457a78> CR2: 00000000000007ab ---[ end trace 16f974b6d58d0aad ]--- Add the missing pointer check. Fixes: `1c387188c6` ("iommu/vt-d: Fix IOMMU lookup for SR-IOV Virtual Functions") Signed-off-by: Koos Vriezen <koos.vriezen@gmail.com> Signed-off-by: Joerg Roedel <jroedel@suse.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Ankur Arora	bc63212d22	xen/acpi: upload PM state from init-domain to Xen commit `1914f0cd20` upstream. This was broken in commit `cd979883b9` ("xen/acpi-processor: fix enabling interrupts on syscore_resume"). do_suspend (from xen/manage.c) and thus xen_resume_notifier never get called on the initial-domain at resume (it is if running as guest.) The rationale for the breaking change was that upload_pm_data() potentially does blocking work in syscore_resume(). This patch addresses the original issue by scheduling upload_pm_data() to execute in workqueue context. Cc: Stanislaw Gruszka <sgruszka@redhat.com> Based-on-patch-by: Konrad Wilk <konrad.wilk@oracle.com> Reviewed-by: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com> Reviewed-by: Stanislaw Gruszka <sgruszka@redhat.com> Signed-off-by: Ankur Arora <ankur.a.arora@oracle.com> Signed-off-by: Boris Ostrovsky <boris.ostrovsky@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Gary R Hook	b1ef9daa50	crypto: ccp - Assign DMA commands to the channel's CCP commit `7c468447f4` upstream. The CCP driver generally uses a round-robin approach when assigning operations to available CCPs. For the DMA engine, however, the DMA mappings of the SGs are associated with a specific CCP. When an IOMMU is enabled, the IOMMU is programmed based on this specific device. If the DMA operations are not performed by that specific CCP then addressing errors and I/O page faults will occur. Update the CCP driver to allow a specific CCP device to be requested for an operation and use this in the DMA engine support. Signed-off-by: Gary R Hook <gary.hook@amd.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Brian Norris	cb6e07ba58	mwifiex: pcie: don't leak DMA buffers when removing commit `4e841d3eb9` upstream. When PCIe FLR support was added, much of the remove/release code for PCIe was migrated to ->down_dev(), but ->down_dev() is never called for device removal. Let's refactor the cleanup to be done in both cases. Also, drop the comments above mwifiex_cleanup_pcie(), because they were clearly wrong, and it's better to have clear and obvious code than to detail the code steps in comments anyway. Fixes: `4c5dae59d2` ("mwifiex: add PCIe function level reset support") Signed-off-by: Brian Norris <briannorris@chromium.org> Signed-off-by: Kalle Valo <kvalo@codeaurora.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Chen-Yu Tsai	35ef543eb3	clk: sunxi-ng: mp: Adjust parent rate for pre-dividers commit `ac8616e4c8` upstream. The MP style clocks support an mux with pre-dividers. While the driver correctly accounted for them in the .determine_rate callback, it did not in the .recalc_rate and .set_rate callbacks. This means when calculating the factors in the .set_rate callback, they would be off by a factor of the active pre-divider. Same goes for reading back the clock rate after it is set. Fixes: `2ab836db50` ("clk: sunxi-ng: Add M-P factor clock support") Signed-off-by: Chen-Yu Tsai <wens@csie.org> Signed-off-by: Maxime Ripard <maxime.ripard@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Chen-Yu Tsai	867f7804ce	clk: sunxi-ng: sun6i: Fix enable bit offset for hdmi-ddc module clock commit `9ad0bb39fc` upstream. The enable bit offset for the hdmi-ddc module clock is wrong. It is pointing to the main hdmi module clock enable bit. Reported-by: Bob Ham <rah@settrans.net> Fixes: `c6e6c96d8f` ("clk: sunxi-ng: Add A31/A31s clocks") Signed-off-by: Chen-Yu Tsai <wens@csie.org> Signed-off-by: Maxime Ripard <maxime.ripard@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Prarit Bhargava	c445f99696	hwrng: geode - Revert managed API changes commit `8c75704ebc` upstream. After commit `e9afc74629` ("hwrng: geode - Use linux/io.h instead of asm/io.h") the geode-rng driver uses devres with pci_dev->dev to keep track of resources, but does not actually register a PCI driver. This results in the following issues: 1. The driver leaks memory because the driver does not attach to a device. The driver only uses the PCI device as a reference. devm_*() functions will release resources on driver detach, which the geode-rng driver will never do. As a result, 2. The driver cannot be reloaded because there is always a use of the ioport and region after the first load of the driver. Revert the changes made by `e9afc74629` ("hwrng: geode - Use linux/io.h instead of asm/io.h"). Signed-off-by: Prarit Bhargava <prarit@redhat.com> Fixes: `6e9b5e7688` ("hwrng: geode - Migrate to managed API") Cc: Matt Mackall <mpm@selenic.com> Cc: Corentin LABBE <clabbe.montjoie@gmail.com> Cc: PrasannaKumar Muralidharan <prasannatsmkumar@gmail.com> Cc: Wei Yongjun <weiyongjun1@huawei.com> Cc: linux-crypto@vger.kernel.org Cc: linux-geode@lists.infradead.org Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Prarit Bhargava	5d6f7b3611	hwrng: amd - Revert managed API changes commit `69db700931` upstream. After commit `31b2a73c9c` ("hwrng: amd - Migrate to managed API"), the amd-rng driver uses devres with pci_dev->dev to keep track of resources, but does not actually register a PCI driver. This results in the following issues: 1. The message WARNING: CPU: 2 PID: 621 at drivers/base/dd.c:349 driver_probe_device+0x38c is output when the i2c_amd756 driver loads and attempts to register a PCI driver. The PCI & device subsystems assume that no resources have been registered for the device, and the WARN_ON() triggers since amd-rng has already do so. 2. The driver leaks memory because the driver does not attach to a device. The driver only uses the PCI device as a reference. devm_*() functions will release resources on driver detach, which the amd-rng driver will never do. As a result, 3. The driver cannot be reloaded because there is always a use of the ioport and region after the first load of the driver. Revert the changes made by `31b2a73c9c` ("hwrng: amd - Migrate to managed API"). Signed-off-by: Prarit Bhargava <prarit@redhat.com> Fixes: `31b2a73c9c` ("hwrng: amd - Migrate to managed API"). Cc: Matt Mackall <mpm@selenic.com> Cc: Corentin LABBE <clabbe.montjoie@gmail.com> Cc: PrasannaKumar Muralidharan <prasannatsmkumar@gmail.com> Cc: Wei Yongjun <weiyongjun1@huawei.com> Cc: linux-crypto@vger.kernel.org Cc: linux-geode@lists.infradead.org Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Adrian Hunter	0d69a88041	mmc: sdhci-pci: Do not disable interrupts in sdhci_intel_set_power commit `027fb89e61` upstream. Disabling interrupts for even a millisecond can cause problems for some devices. That can happen when Intel host controllers wait for the present state to propagate. The spin lock is not necessary here. Anything that is racing with changes to the I/O state is already broken. The mmc core already provides synchronization via "claiming" the host. Although the spin lock probably should be removed from the code paths that lead to this point, such a patch would touch too much code to be suitable for stable trees. Consequently, for this patch, just drop the spin lock while waiting. Signed-off-by: Adrian Hunter <adrian.hunter@intel.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Tested-by: Ludovic Desroches <ludovic.desroches@microchip.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:26 +02:00
Adrian Hunter	b43ba21b35	mmc: sdhci: Do not disable interrupts while waiting for clock commit `e2ebfb2142` upstream. Disabling interrupts for even a millisecond can cause problems for some devices. That can happen when sdhci changes clock frequency because it waits for the clock to become stable under a spin lock. The spin lock is not necessary here. Anything that is racing with changes to the I/O state is already broken. The mmc core already provides synchronization via "claiming" the host. Although the spin lock probably should be removed from the code paths that lead to this point, such a patch would touch too much code to be suitable for stable trees. Consequently, for this patch, just drop the spin lock while waiting. Signed-off-by: Adrian Hunter <adrian.hunter@intel.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Tested-by: Ludovic Desroches <ludovic.desroches@microchip.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Anssi Hannula	2ab2e87911	mmc: sdhci-of-arasan: fix incorrect timeout clock commit `16681037e7` upstream. sdhci_arasan_get_timeout_clock() divides the frequency it has with (1 << (13 + divisor)). However, the divisor is not some Arasan-specific value, but instead is just the Data Timeout Counter Value from the SDHCI Timeout Control Register. Applying it here like this is wrong as the sdhci driver already takes that value into account when calculating timeouts, and in fact it sets that register value based on how long a timeout is wanted. Additionally, sdhci core interprets the .get_timeout_clock callback return value as if it were read from hardware registers, i.e. the unit should be kHz or MHz depending on SDHCI_TIMEOUT_CLK_UNIT capability bit. This bit is set at least on the tested Zynq-7000 SoC. With the tested hardware (SDHCI_TIMEOUT_CLK_UNIT set) this results in too high a timeout clock rate being reported, causing the core to use longer-than-needed timeouts. Additionally, on a partitioned MMC (therefore having erase_group_def bit set) mmc_calc_max_discard() disables discard support as it looks like controller does not support the long timeouts needed for that. Do not apply the extra divisor and return the timeout clock in the expected unit. Tested with a Zynq-7000 SoC and a partitioned Toshiba THGBMAG5A1JBAWR eMMC card. Signed-off-by: Anssi Hannula <anssi.hannula@bitwise.fi> Fixes: `e3ec3a3d11` ("mmc: arasan: Add driver for Arasan SDHCI") Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Romain Izard	ace22e6fdb	mmc: sdhci-of-at91: Support external regulators commit `2ce0c7b655` upstream. The SDHCI controller in the SAMA5D2 chip requires a valid voltage set in the power control register, otherwise commands will fail with a timeout error. When using the regulator framework to specify the regulator used by the mmc device, the voltage is not configured, and it is not possible to use the connected device. Implement a custom 'set_power' function for this specific hardware, that configures the voltage in the register in all cases. Signed-off-by: Romain Izard <romain.izard.pro@gmail.com> Acked-by: Ludovic Desroches <ludovic.desroches@microchip.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Nicholas Piggin	1e360905f6	powerpc/64s: Fix idle wakeup potential to clobber registers commit `6d98ce0be5` upstream. We concluded there may be a window where the idle wakeup code could get to pnv_wakeup_tb_loss() (which clobbers non-volatile GPRs), but the hardware may set SRR1[46:47] to 01b (no state loss) which would result in the wakeup code failing to restore non-volatile GPRs. I was not able to trigger this condition with trivial tests on real hardware or simulator, but the ISA (at least 2.07) seems to allow for it, and Gautham says that it can happen if there is an exception pending when the sleep/winkle instruction is executed. Fixes: `1706567117` ("powerpc/kvm: make hypervisor state restore a function") Signed-off-by: Nicholas Piggin <npiggin@gmail.com> Acked-by: Gautham R. Shenoy <ego@linux.vnet.ibm.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Theodore Ts'o	9a9dc3ec7c	ext4: lock the xattr block before checksuming it commit `dac7a4b4b1` upstream. We must lock the xattr block before calculating or verifying the checksum in order to avoid spurious checksum failures. https://bugzilla.kernel.org/show_bug.cgi?id=193661 Reported-by: Colin Ian King <colin.king@canonical.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Eric Biggers	f572ba9a71	ext4: mark inode dirty after converting inline directory commit `b9cf625d6e` upstream. If ext4_convert_inline_data() was called on a directory with inline data, the filesystem was left in an inconsistent state (as considered by e2fsck) because the file size was not increased to cover the new block. This happened because the inode was not marked dirty after i_disksize was updated. Fix this by marking the inode dirty at the end of ext4_finish_convert_inline_dir(). This bug was probably not noticed before because most users mark the inode dirty afterwards for other reasons. But if userspace executed FS_IOC_SET_ENCRYPTION_POLICY with invalid parameters, as exercised by 'kvm-xfstests -c adv generic/396', then the inode was never marked dirty after updating i_disksize. Fixes: `3c47d54170` Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Sudip Mukherjee	0f29bcfdcf	parport: fix attempt to write duplicate procfiles commit `03270c6ac6` upstream. Usually every parallel port will have a single pardev registered with it. But ppdev driver is an exception. This userspace parallel port driver allows to create multiple parrallel port devices for a single parallel port. And as a result we were having a nice warning like: "sysctl table check failed: /dev/parport/parport0/devices/ppdev0/timeslice Sysctl already exists" Use the same logic as used in parport_register_device() and register the proc files only once for each parallel port. Fixes: `6fa45a2268` ("parport: add device-model to parport subsystem") Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1414656 Bugzilla: https://bugs.archlinux.org/task/52322 Tested-by: James Feeney <james@nurealm.net> Signed-off-by: Sudip Mukherjee <sudip.mukherjee@codethink.co.uk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Dmitry Torokhov	1e6b503350	iio: magnetometer: ak8974: remove incorrect __exit markups commit `3ff861f59f` upstream. Even if bus is not hot-pluggable, devices can be unbound from the driver via sysfs, so we should not be using __exit annotations on remove() methods. The only exception is drivers registered with platform_driver_probe() which specifically disables sysfs bind/unbind attributes. Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Reviewed-by: Linus Walleij <linus.walleij@linaro.org> Signed-off-by: Jonathan Cameron <jic23@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Song Hongyan	9808ff1517	iio: hid-sensor-trigger: Change get poll value function order to avoid sensor properties losing after resume from S3 commit `3bec247474` upstream. In function _hid_sensor_power_state(), when hid_sensor_read_poll_value() is called, sensor's all properties will be updated by the value from sensor hardware/firmware. In some implementation, sensor hardware/firmware will do a power cycle during S3. In this case, after resume, once hid_sensor_read_poll_value() is called, sensor's all properties which are kept by driver during S3 will be changed to default value. But instead, if a set feature function is called first, sensor hardware/firmware will be recovered to the last status. So change the sensor_hub_set_feature() calling order to behind of set feature function to avoid sensor properties lose. Signed-off-by: Song Hongyan <hongyan.song@intel.com> Acked-by: Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com> Signed-off-by: Jonathan Cameron <jic23@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Lars-Peter Clausen	cf43d679cc	iio: sw-device: Fix config group initialization commit `c42f821861` upstream. Use the IS_ENABLED() helper macro to ensure that the configfs group is initialized either when configfs is built-in or when configfs is built as a module. Otherwise software device creation will result in undefined behaviour when configfs is built as a module since the configfs group for the device not properly initialized. Similar to commit `b2f0c09664` ("iio: sw-trigger: Fix config group initialization"). Fixes: `0f3a8c3f34` ("iio: Add support for creating IIO devices via configfs") Reported-by: Miguel Robles <miguel.robles@farole.net> Signed-off-by: Lars-Peter Clausen <lars@metafoo.de> Acked-by: Daniel Baluta <daniel.baluta@gmail.com> Signed-off-by: Jonathan Cameron <jic23@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Michael Engl	ec271b0e05	iio: adc: ti_am335x_adc: fix fifo overrun recovery commit `e83bb3e6f3` upstream. The tiadc_irq_h(int irq, void *private) function is handling FIFO overruns by clearing flags, disabling and enabling the ADC to recover. If the ADC is running in continuous mode a FIFO overrun happens regularly. If the disabling of the ADC happens concurrently with a new conversion. It might happen that the enabling of the ADC is ignored by the hardware. This stops the ADC permanently. No more interrupts are triggered. According to the AM335x Reference Manual (SPRUH73H October 2011 - Revised April 2013 - Chapter 12.4 and 12.5) it is necessary to check the ADC FSM bits in REG_ADCFSM before enabling the ADC again. Because the disabling of the ADC is done right after the current conversion has been finished. To trigger this bug it is necessary to run the ADC in continuous mode. The ADC values of all channels need to be read in an endless loop. The bug appears within the first 6 hours (~5.4 million handled FIFO overruns). The user space application will hang on reading new values from the character device. Fixes: `ca9a563805` ("iio: ti_am335x_adc: Add continuous sampling support") Signed-off-by: Michael Engl <michael.engl@wjw-solutions.com> Signed-off-by: Jonathan Cameron <jic23@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:25 +02:00
Johan Hovold	dfcacd07bf	mmc: ushc: fix NULL-deref at probe commit `181302dc72` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Fixes: `53f3a9e26e` ("mmc: USB SD Host Controller (USHC) driver") Cc: David Vrabel <david.vrabel@csr.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	2e7486a05b	uwb: hwa-rc: fix NULL-deref at probe commit `daf229b159` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Note that the dereference happens in the start callback which is called during probe. Fixes: `de520b8bd5` ("uwb: add HWA radio controller driver") Cc: Inaky Perez-Gonzalez <inaky.perez-gonzalez@intel.com> Cc: David Vrabel <david.vrabel@csr.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	f01c54fe0a	uwb: i1480-dfu: fix NULL-deref at probe commit `4ce362711d` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Note that the dereference happens in the cmd and wait_init_done callbacks which are called during probe. Fixes: `1ba47da527` ("uwb: add the i1480 DFU driver") Cc: Inaky Perez-Gonzalez <inaky.perez-gonzalez@intel.com> Cc: David Vrabel <david.vrabel@csr.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	5e50afb227	USB: usbtmc: fix probe error path commit `2e47c53503` upstream. Make sure to initialise the return value to avoid having allocation failures going unnoticed when allocating interrupt-endpoint resources. This prevents use-after-free or worse when the device is later unbound. Fixes: `dbf3e7f654` ("Implement an ioctl to support the USMTMC-USB488 READ_STATUS_BYTE operation.") Cc: Dave Penkler <dpenkler@gmail.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	73c4e2ede5	USB: usbtmc: add missing endpoint sanity check commit `687e0687f7` upstream. USBTMC devices are required to have a bulk-in and a bulk-out endpoint, but the driver failed to verify this, something which could lead to the endpoint addresses being taken from uninitialised memory. Make sure to zero all private data as part of allocation, and add the missing endpoint sanity check. Note that this also addresses a more recently introduced issue, where the interrupt-in-presence flag would also be uninitialised whenever the optional interrupt-in endpoint is not present. This in turn could lead to an interrupt urb being allocated, initialised and submitted based on uninitialised values. Fixes: `dbf3e7f654` ("Implement an ioctl to support the USMTMC-USB488 READ_STATUS_BYTE operation.") Fixes: `5b775f672c` ("USB: add USB test and measurement class driver") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Guenter Roeck	4e61588677	usb: hub: Fix crash after failure to read BOS descriptor commit `7b2db29fbb` upstream. If usb_get_bos_descriptor() returns an error, usb->bos will be NULL. Nevertheless, it is dereferenced unconditionally in hub_set_initial_usb2_lpm_policy() if usb2_hw_lpm_capable is set. This results in a crash. usb 5-1: unable to get BOS descriptor ... Unable to handle kernel NULL pointer dereference at virtual address 00000008 pgd = ffffffc00165f000 [00000008] pgd=000000000174f003, pud=000000000174f003, pmd=0000000001750003, pte=00e8000001751713 Internal error: Oops: 96000005 [#1] PREEMPT SMP Modules linked in: uinput uvcvideo videobuf2_vmalloc cmac [ ... ] CPU: 5 PID: 3353 Comm: kworker/5:3 Tainted: G B 4.4.52 #480 Hardware name: Google Kevin (DT) Workqueue: events driver_set_config_work task: ffffffc0c3690000 ti: ffffffc0ae9a8000 task.ti: ffffffc0ae9a8000 PC is at hub_port_init+0xc3c/0xd10 LR is at hub_port_init+0xc3c/0xd10 ... Call trace: [<ffffffc0007fbbfc>] hub_port_init+0xc3c/0xd10 [<ffffffc0007fbe2c>] usb_reset_and_verify_device+0x15c/0x82c [<ffffffc0007fc5e0>] usb_reset_device+0xe4/0x298 [<ffffffbffc0e3fcc>] rtl8152_probe+0x84/0x9b0 [r8152] [<ffffffc00080ca8c>] usb_probe_interface+0x244/0x2f8 [<ffffffc000774a24>] driver_probe_device+0x180/0x3b4 [<ffffffc000774e48>] __device_attach_driver+0xb4/0xe0 [<ffffffc000772168>] bus_for_each_drv+0xb4/0xe4 [<ffffffc0007747ec>] __device_attach+0xd0/0x158 [<ffffffc000775080>] device_initial_probe+0x24/0x30 [<ffffffc0007739d4>] bus_probe_device+0x50/0xe4 [<ffffffc000770bd0>] device_add+0x414/0x738 [<ffffffc000809fe8>] usb_set_configuration+0x89c/0x914 [<ffffffc00080a120>] driver_set_config_work+0xc0/0xf0 [<ffffffc000249bb8>] process_one_work+0x390/0x6b8 [<ffffffc00024abcc>] worker_thread+0x480/0x610 [<ffffffc000251a80>] kthread+0x164/0x178 [<ffffffc0002045d0>] ret_from_fork+0x10/0x40 Since we don't know anything about LPM capabilities without BOS descriptor, don't attempt to enable LPM if it is not available. Fixes: `890dae8867` ("xhci: Enable LPM support only for hardwired ...") Cc: Mathias Nyman <mathias.nyman@linux.intel.com> Signed-off-by: Guenter Roeck <linux@roeck-us.net> Acked-by: Mathias Nyman <mathias.nyman@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Bin Liu	51a3f3ec53	usb: musb: cppi41: don't check early-TX-interrupt for Isoch transfer commit `0090114d33` upstream. The CPPI 4.1 driver polls register to workaround the premature TX interrupt issue, but it causes audio playback underrun when triggered in Isoch transfers. Isoch doesn't do back-to-back transfers, the TX should be done by the time the next transfer is scheduled. So skip this polling workaround for Isoch transfer. Fixes: `a655f481d8` ("usb: musb: musb_cppi41: handle pre-mature TX complete interrupt") Reported-by: Alexandre Bailon <abailon@baylibre.com> Acked-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de> Tested-by: Alexandre Bailon <abailon@baylibre.com> Signed-off-by: Bin Liu <b-liu@ti.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	46aeeb2ac1	USB: wusbcore: fix NULL-deref at probe commit `03ace948a4` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer or accessing memory beyond the endpoint array should a malicious device lack the expected endpoints. This specifically fixes the NULL-pointer dereference when probing HWA HC devices. Fixes: `df3654236e` ("wusb: add the Wire Adapter (WA) core") Cc: Inaky Perez-Gonzalez <inaky.perez-gonzalez@intel.com> Cc: David Vrabel <david.vrabel@csr.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	a4887ea588	USB: idmouse: fix NULL-deref at probe commit `b0addd3fa6` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	21e0621e3c	USB: lvtest: fix NULL-deref at probe commit `1dc56c52d2` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should the probed device lack endpoints. Note that this driver does not bind to any devices by default. Fixes: `ce21bfe603` ("USB: Add LVS Test device driver") Cc: Pratyush Anand <pratyush.anand@gmail.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Johan Hovold	e4942716bb	USB: uss720: fix NULL-deref at probe commit `f259ca3eed` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer or accessing memory beyond the endpoint array should a malicious device lack the expected endpoints. Note that the endpoint access that causes the NULL-deref is currently only used for debugging purposes during probe so the oops only happens when dynamic debugging is enabled. This means the driver could be rewritten to continue to accept device with only two endpoints, should such devices exist. Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Samuel Thibault	222ccd4f83	usb-core: Add LINEAR_FRAME_INTR_BINTERVAL USB quirk commit `3243367b20` upstream. Some USB 2.0 devices erroneously report millisecond values in bInterval. The generic config code manages to catch most of them, but in some cases it's not completely enough. The case at stake here is a USB 2.0 braille device, which wants to announce 10ms and thus sets bInterval to 10, but with the USB 2.0 computation that yields to 64ms. It happens that one can type fast enough to reach this interval and get the device buffers overflown, leading to problematic latencies. The generic config code does not catch this case because the 64ms is considered a sane enough value. This change thus adds a USB_QUIRK_LINEAR_FRAME_INTR_BINTERVAL quirk to mark devices which actually report milliseconds in bInterval, and marks Vario Ultra devices as needing it. Signed-off-by: Samuel Thibault <samuel.thibault@ens-lyon.org> Acked-by: Alan Stern <stern@rowland.harvard.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:24 +02:00
Roger Quadros	d8ec35ad8b	usb: gadget: f_uvc: Fix SuperSpeed companion descriptor's wBytesPerInterval commit `09424c50b7` upstream. The streaming_maxburst module parameter is 0 offset (0..15) so we must add 1 while using it for wBytesPerInterval calculation for the SuperSpeed companion descriptor. Without this host uvcvideo driver will always see the wrong wBytesPerInterval for SuperSpeed uvc gadget and may not find a suitable video interface endpoint. e.g. for streaming_maxburst = 0 case it will always fail as wBytePerInterval was evaluating to 0. Reviewed-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com> Signed-off-by: Roger Quadros <rogerq@ti.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Oliver Neukum	ff8d97a822	ACM gadget: fix endianness in notifications commit `cdd7928df0` upstream. The gadget code exports the bitfield for serial status changes over the wire in its internal endianness. The fix is to convert to little endian before sending it over the wire. Signed-off-by: Oliver Neukum <oneukum@suse.com> Tested-by: 家瑋 <momo1208@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Bjørn Mork	381420a46a	USB: serial: qcserial: add Dell DW5811e commit `436ecf5519` upstream. This is a Dell branded Sierra Wireless EM7455. Signed-off-by: Bjørn Mork <bjorn@mork.no> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Dan Williams	21586e4dcc	USB: serial: option: add Quectel UC15, UC20, EC21, and EC25 modems commit `6e9f44eaae` upstream. Add Quectel UC15, UC20, EC21, and EC25. The EC20 is handled by qcserial due to a USB VID/PID conflict with an existing Acer device. Signed-off-by: Dan Williams <dcbw@redhat.com> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Hui Wang	0fe8771373	ALSA: hda - Adding a group of pin definition to fix headset problem commit `3f307834e6` upstream. A new Dell laptop needs to apply ALC269_FIXUP_DELL1_MIC_NO_PRESENCE to fix the headset problem, and the pin definiton of this machine is not in the pin quirk table yet, now adding it to the table. Signed-off-by: Hui Wang <hui.wang@canonical.com> Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Takashi Iwai	e800ff1fad	ALSA: ctxfi: Fix the incorrect check of dma_set_mask() call commit `f363a06642` upstream. In the commit [`15c75b09f8`: ALSA: ctxfi: Fallback DMA mask to 32bit], I forgot to put "!" at dam_set_mask() call check in cthw20k1.c (while cthw20k2.c is OK). This patch fixes that obvious bug. (As a side note: although the original commit was completely wrong, it's still working for most of machines, as it sets to 32bit DMA mask in the end. So the bug severity is low.) Fixes: `15c75b09f8` ("ALSA: ctxfi: Fallback DMA mask to 32bit") Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Takashi Iwai	ca79952dfc	ALSA: seq: Fix racy cell insertions during snd_seq_pool_done() commit `c520ff3d03` upstream. When snd_seq_pool_done() is called, it marks the closing flag to refuse the further cell insertions. But snd_seq_pool_done() itself doesn't clear the cells but just waits until all cells are cleared by the caller side. That is, it's racy, and this leads to the endless stall as syzkaller spotted. This patch addresses the racy by splitting the setup of pool->closing flag out of snd_seq_pool_done(), and calling it properly before snd_seq_pool_done(). BugLink: http://lkml.kernel.org/r/CACT4Y+aqqy8bZA1fFieifNxR2fAfFQQABcBHj801+u5ePV0URw@mail.gmail.com Reported-and-tested-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: Takashi Iwai <tiwai@suse.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Johan Hovold	425823646e	Input: sur40 - validate number of endpoints before using them commit `92461f5d72` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer or accessing memory that lie beyond the end of the endpoint array should a malicious device lack the expected endpoints. Fixes: `bdb5c57f20` ("Input: add sur40 driver for Samsung SUR40... ") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Johan Hovold	b1feb4e5aa	Input: kbtab - validate number of endpoints before using them commit `cb1b494663` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Johan Hovold	83d920e464	Input: cm109 - validate number of endpoints before using them commit `ac2ee9ba95` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Fixes: `c04148f915` ("Input: add driver for USB VoIP phones with CM109...") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Johan Hovold	9f6a8dca98	Input: yealink - validate number of endpoints before using them commit `5cc4a1a9f5` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Fixes: `aca951a22a` ("[PATCH] input-driver-yealink-P1K-usb-phone") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Johan Hovold	d9a3e6424e	Input: hanwang - validate number of endpoints before using them commit `ba340d7b83` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Fixes: `bba5394ad3` ("Input: add support for Hanwang tablets") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:23 +02:00
Johan Hovold	1f5a512844	Input: ims-pcu - validate number of endpoints before using them commit `1916d31927` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack control-interface endpoints. Fixes: `628329d524` ("Input: add IMS Passenger Control Unit driver") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Johan Hovold	0afac79f66	Input: iforce - validate number of endpoints before using them commit `59cf8bed44` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer or accessing memory that lie beyond the end of the endpoint array should a malicious device lack the expected endpoints. Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Kai-Heng Feng	de2ba8081e	Input: i8042 - add noloop quirk for Dell Embedded Box PC 3000 commit `45838660e3` upstream. The aux port does not get detected without noloop quirk, so external PS/2 mouse cannot work as result. The PS/2 mouse can work with this quirk. BugLink: https://bugs.launchpad.net/bugs/1591053 Signed-off-by: Kai-Heng Feng <kai.heng.feng@canonical.com> Reviewed-by: Marcos Paulo de Souza <marcos.souza.org@gmail.com> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Matjaz Hegedic	3c9da7cfb7	Input: elan_i2c - add ASUS EeeBook X205TA special touchpad fw commit `92ef6f97a6` upstream. EeeBook X205TA is yet another ASUS device with a special touchpad firmware that needs to be accounted for during initialization, or else the touchpad will go into an invalid state upon suspend/resume. Adding the appropriate ic_type and product_id check fixes the problem. Signed-off-by: Matjaz Hegedic <matjaz.hegedic@gmail.com> Acked-by: KT Liao <kt.liao@emc.com.tw> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Masaki Ota	4ccef6bd3a	Input: ALPS - fix trackstick button handling on V8 devices commit `47e6fb4212` upstream. Alps stick devices always have physical buttons, so we should not check ALPS_BUTTONPAD flag to decide whether we should report them. Fixes: `4777ac220c` ("Input: ALPS - add touchstick support for SS5 hardware") Signed-off-by: Masaki Ota <masaki.ota@jp.alps.com> Acked-by: Pali Rohar <pali.rohar@gmail.com> Tested-by: Paul Donohue <linux-kernel@PaulSD.com> Tested-by: Nick Fletcher <nick.m.fletcher@gmail.com> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Masaki Ota	81643d0003	Input: ALPS - fix V8+ protocol handling (73 03 28) commit `e7348396c6` upstream. Devices identified as E7="73 03 28" use slightly modified version of V8 protocol, with lower count per electrode, different offsets, and different feature bits in OTP data. Fixes: `aeaa881f9b` ("Input: ALPS - set DualPoint flag for 74 03 28 devices") Signed-off-by: Masaki Ota <masaki.ota@jp.alps.com> Acked-by: Pali Rohar <pali.rohar@gmail.com> Tested-by: Paul Donohue <linux-kernel@PaulSD.com> Tested-by: Nick Fletcher <nick.m.fletcher@gmail.com> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Eric Dumazet	9e13bcef24	tcp: initialize icsk_ack.lrcvtime at session start time [ Upstream commit `15bb7745e9` ] icsk_ack.lrcvtime has a 0 value at socket creation time. tcpi_last_data_recv can have bogus value if no payload is ever received. This patch initializes icsk_ack.lrcvtime for active sessions in tcp_finish_connect(), and for passive sessions in tcp_create_openreq_child() Signed-off-by: Eric Dumazet <edumazet@google.com> Acked-by: Neal Cardwell <ncardwell@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Daniel Borkmann	a53ea6051c	socket, bpf: fix sk_filter use after free in sk_clone_lock [ Upstream commit `a97e50cc4c` ] In sk_clone_lock(), we create a new socket and inherit most of the parent's members via sock_copy() which memcpy()'s various sections. Now, in case the parent socket had a BPF socket filter attached, then newsk->sk_filter points to the same instance as the original sk->sk_filter. sk_filter_charge() is then called on the newsk->sk_filter to take a reference and should that fail due to hitting max optmem, we bail out and release the newsk instance. The issue is that commit `278571baca` ("net: filter: simplify socket charging") wrongly combined the dismantle path with the failure path of xfrm_sk_clone_policy(). This means, even when charging failed, we call sk_free_unlock_clone() on the newsk, which then still points to the same sk_filter as the original sk. Thus, sk_free_unlock_clone() calls into __sk_destruct() eventually where it tests for present sk_filter and calls sk_filter_uncharge() on it, which potentially lets sk_omem_alloc wrap around and releases the eBPF prog and sk_filter structure from the (still intact) parent. Fix it by making sure that when sk_filter_charge() failed, we reset newsk->sk_filter back to NULL before passing to sk_free_unlock_clone(), so that we don't mess with the parents sk_filter. Only if xfrm_sk_clone_policy() fails, we did reach the point where either the parent's filter was NULL and as a result newsk's as well or where we previously had a successful sk_filter_charge(), thus for that case, we do need sk_filter_uncharge() to release the prior taken reference on sk_filter. Fixes: `278571baca` ("net: filter: simplify socket charging") Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Eric Dumazet	62e85fe590	ipv4: provide stronger user input validation in nl_fib_input() [ Upstream commit `c64c0b3cac` ] Alexander reported a KMSAN splat caused by reads of uninitialized field (tb_id_in) from user provided struct fib_result_nl It turns out nl_fib_input() sanity tests on user input is a bit wrong : User can pretend nlh->nlmsg_len is big enough, but provide at sendmsg() time a too small buffer. Reported-by: Alexander Potapenko <glider@google.com> Signed-off-by: Eric Dumazet <edumazet@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Doug Berger	2f94dd54cf	net: bcmgenet: remove bcmgenet_internal_phy_setup() [ Upstream commit `31739eae73` ] Commit `6ac3ce8295` ("net: bcmgenet: Remove excessive PHY reset") removed the bcmgenet_mii_reset() function from bcmgenet_power_up() and bcmgenet_internal_phy_setup() functions. In so doing it broke the reset of the internal PHY devices used by the GENETv1-GENETv3 which required this reset before the UniMAC was enabled. It also broke the internal GPHY devices used by the GENETv4 because the config_init that installed the AFE workaround was no longer occurring after the reset of the GPHY performed by bcmgenet_phy_power_set() in bcmgenet_internal_phy_setup(). In addition the code in bcmgenet_internal_phy_setup() related to the "enable APD" comment goes with the bcmgenet_mii_reset() so it should have also been removed. Commit `bd4060a610` ("net: bcmgenet: Power on integrated GPHY in bcmgenet_power_up()") moved the bcmgenet_phy_power_set() call to the bcmgenet_power_up() function, but failed to remove it from the bcmgenet_internal_phy_setup() function. Had it done so, the bcmgenet_internal_phy_setup() function would have been empty and could have been removed at that time. Commit `5dbebbb44a` ("net: bcmgenet: Software reset EPHY after power on") was submitted to correct the functional problems introduced by commit `6ac3ce8295` ("net: bcmgenet: Remove excessive PHY reset"). It was included in v4.4 and made available on 4.3-stable. Unfortunately, it didn't fully revert the commit because this bcmgenet_mii_reset() doesn't apply the soft reset to the internal GPHY used by GENETv4 like the previous one did. This prevents the restoration of the AFE work- arounds for internal GPHY devices after the bcmgenet_phy_power_set() in bcmgenet_internal_phy_setup(). This commit takes the alternate approach of removing the unnecessary bcmgenet_internal_phy_setup() function which shouldn't have been in v4.3 so that when bcmgenet_mii_reset() was restored it should have only gone into bcmgenet_power_up(). This will avoid the problems while also removing the redundancy (and hopefully some of the confusion). Fixes: `6ac3ce8295` ("net: bcmgenet: Remove excessive PHY reset") Signed-off-by: Doug Berger <opendmb@gmail.com> Reviewed-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Alexander Potapenko	d80caeb9b8	ipv6: make sure to initialize sockc.tsflags before first use [ Upstream commit `d515684d78` ] In the case udp_sk(sk)->pending is AF_INET6, udpv6_sendmsg() would jump to do_append_data, skipping the initialization of sockc.tsflags. Fix the problem by moving sockc.tsflags initialization earlier. The bug was detected with KMSAN. Fixes: `c14ac9451c` ("sock: enable timestamping using control messages") Signed-off-by: Alexander Potapenko <glider@google.com> Acked-by: Soheil Hassas Yeganeh <soheil@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:22 +02:00
Gal Pressman	c87ef8734d	net/mlx5e: Count LRO packets correctly [ Upstream commit `8ab7e2ae15` ] RX packets statistics ('rx_packets' counter) used to count LRO packets as one, even though it contains multiple segments. This patch will increment the counter by the number of segments, and align the driver with the behavior of other drivers in the stack. Note that no information is lost in this patch due to 'rx_lro_packets' counter existence. Before, ethtool showed: $ ethtool -S ens6 \| egrep "rx_packets\|rx_lro_packets" rx_packets: 435277 rx_lro_packets: 35847 rx_packets_phy: 1935066 Now, we will see the more logical statistics: $ ethtool -S ens6 \| egrep "rx_packets\|rx_lro_packets" rx_packets: 1935066 rx_lro_packets: 35847 rx_packets_phy: 1935066 Fixes: `e586b3b0ba` ("net/mlx5: Ethernet Datapath files") Signed-off-by: Gal Pressman <galp@mellanox.com> Cc: kernel-team@fb.com Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Acked-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Gal Pressman	36bb24fadb	net/mlx5e: Count GSO packets correctly [ Upstream commit `d3a4e4da54` ] TX packets statistics ('tx_packets' counter) used to count GSO packets as one, even though it contains multiple segments. This patch will increment the counter by the number of segments, and align the driver with the behavior of other drivers in the stack. Note that no information is lost in this patch due to 'tx_tso_packets' counter existence. Before, ethtool showed: $ ethtool -S ens6 \| egrep "tx_packets\|tx_tso_packets" tx_packets: 61340 tx_tso_packets: 60954 tx_packets_phy: 2451115 Now, we will see the more logical statistics: $ ethtool -S ens6 \| egrep "tx_packets\|tx_tso_packets" tx_packets: 2451115 tx_tso_packets: 60954 tx_packets_phy: 2451115 Fixes: `e586b3b0ba` ("net/mlx5: Ethernet Datapath files") Signed-off-by: Gal Pressman <galp@mellanox.com> Cc: kernel-team@fb.com Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Acked-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Maor Gottlieb	0ceb7181e8	net/mlx5: Increase number of max QPs in default profile [ Upstream commit `5f40b4ed97` ] With ConnectX-4 sharing SRQs from the same space as QPs, we hit a limit preventing some applications to allocate needed QPs amount. Double the size to 256K. Fixes: `e126ba97db` ('mlx5: Add driver for Mellanox Connect-IB adapters') Signed-off-by: Maor Gottlieb <maorg@mellanox.com> Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Or Gerlitz	38d23e13ba	net/mlx5e: Use the proper UAPI values when offloading TC vlan actions [ Upstream commit `09c91ddf2c` ] Currently we use the non UAPI values and we miss erring on the modify action which is not supported, fix that. Fixes: `8b32580df1` ('net/mlx5e: Add TC vlan action for SRIOV offloads') Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reported-by: Petr Machata <petrm@mellanox.com> Reviewed-by: Jiri Pirko <jiri@mellanox.com> Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Or Gerlitz	f375f73c72	net/mlx5: Add missing entries for set/query rate limit commands [ Upstream commit `1f30a86c58` ] The switch cases for the rate limit set and query commands were missing, which could get us wrong under fw error or driver reset flow, fix that. Fixes: `1466cc5b23` ('net/mlx5: Rate limit tables support') Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-by: Hadar Hen Zion <hadarh@mellanox.com> Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
David Ahern	4e7c821b68	net: vrf: Reset rt6i_idev in local dst after put [ Upstream commit `3dc857f0e8` ] The VRF driver takes a reference to the inet6_dev on the VRF device for its rt6_local dst when handling local traffic through the VRF device as a loopback. When the device is deleted the driver does a put on the idev but does not reset rt6i_idev in the rt6_info struct. When the dst is destroyed, dst_destroy calls ip6_dst_destroy which does a second put for what is essentially the same reference causing it to be prematurely freed. Reset rt6i_idev after the put in the vrf driver. Fixes: `b4869aa2f8` ("net: vrf: ipv6 support for local traffic to local addresses") Signed-off-by: David Ahern <dsa@cumulusnetworks.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Bjørn Mork	4b9b7f17ba	qmi_wwan: add Dell DW5811e [ Upstream commit `6bd845d1cf` ] This is a Dell branded Sierra Wireless EM7455. It is operating in MBIM mode by default, but can be configured to provide two QMI/RMNET functions. Signed-off-by: Bjørn Mork <bjorn@mork.no> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Andrey Ulanov	91ad0c0885	net: unix: properly re-increment inflight counter of GC discarded candidates [ Upstream commit `7df9c24625` ] Dmitry has reported that a BUG_ON() condition in unix_notinflight() may be triggered by a simple code that forwards unix socket in an SCM_RIGHTS message. That is caused by incorrect unix socket GC implementation in unix_gc(). The GC first collects list of candidates, then (a) decrements their "children's" inflight counter, (b) checks which inflight counters are now 0, and then (c) increments all inflight counters back. (a) and (c) are done by calling scan_children() with inc_inflight or dec_inflight as the second argument. Commit `6209344f5a` ("net: unix: fix inflight counting bug in garbage collector") changed scan_children() such that it no longer considers sockets that do not have UNIX_GC_CANDIDATE flag. It also added a block of code that that unsets this flag _before_ invoking scan_children(, dec_iflight, ). This may lead to incorrect inflight counters for some sockets. This change fixes this bug by changing order of operations: UNIX_GC_CANDIDATE is now unset only after all inflight counters are restored to the original state. kernel BUG at net/unix/garbage.c:149! RIP: 0010:[<ffffffff8717ebf4>] [<ffffffff8717ebf4>] unix_notinflight+0x3b4/0x490 net/unix/garbage.c:149 Call Trace: [<ffffffff8716cfbf>] unix_detach_fds.isra.19+0xff/0x170 net/unix/af_unix.c:1487 [<ffffffff8716f6a9>] unix_destruct_scm+0xf9/0x210 net/unix/af_unix.c:1496 [<ffffffff86a90a01>] skb_release_head_state+0x101/0x200 net/core/skbuff.c:655 [<ffffffff86a9808a>] skb_release_all+0x1a/0x60 net/core/skbuff.c:668 [<ffffffff86a980ea>] __kfree_skb+0x1a/0x30 net/core/skbuff.c:684 [<ffffffff86a98284>] kfree_skb+0x184/0x570 net/core/skbuff.c:705 [<ffffffff871789d5>] unix_release_sock+0x5b5/0xbd0 net/unix/af_unix.c:559 [<ffffffff87179039>] unix_release+0x49/0x90 net/unix/af_unix.c:836 [<ffffffff86a694b2>] sock_release+0x92/0x1f0 net/socket.c:570 [<ffffffff86a6962b>] sock_close+0x1b/0x20 net/socket.c:1017 [<ffffffff81a76b8e>] __fput+0x34e/0x910 fs/file_table.c:208 [<ffffffff81a771da>] ____fput+0x1a/0x20 fs/file_table.c:244 [<ffffffff81483ab0>] task_work_run+0x1a0/0x280 kernel/task_work.c:116 [< inline >] exit_task_work include/linux/task_work.h:21 [<ffffffff8141287a>] do_exit+0x183a/0x2640 kernel/exit.c:828 [<ffffffff8141383e>] do_group_exit+0x14e/0x420 kernel/exit.c:931 [<ffffffff814429d3>] get_signal+0x663/0x1880 kernel/signal.c:2307 [<ffffffff81239b45>] do_signal+0xc5/0x2190 arch/x86/kernel/signal.c:807 [<ffffffff8100666a>] exit_to_usermode_loop+0x1ea/0x2d0 arch/x86/entry/common.c:156 [< inline >] prepare_exit_to_usermode arch/x86/entry/common.c:190 [<ffffffff81009693>] syscall_return_slowpath+0x4d3/0x570 arch/x86/entry/common.c:259 [<ffffffff881478e6>] entry_SYSCALL_64_fastpath+0xc4/0xc6 Link: https://lkml.org/lkml/2017/3/6/252 Signed-off-by: Andrey Ulanov <andreyu@google.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Fixes: `6209344` ("net: unix: fix inflight counting bug in garbage collector") Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Kris Murphy	ef7c1e297d	openvswitch: Add missing case OVS_TUNNEL_KEY_ATTR_PAD [ Upstream commit `8f3dbfd79e` ] Added a case for OVS_TUNNEL_KEY_ATTR_PAD to the switch statement in ip_tun_from_nlattr in order to prevent the default case returning an error. Fixes: `b46f6ded90` ("libnl: nla_put_be64(): align on a 64-bit area") Signed-off-by: Kris Murphy <kriskend@linux.vnet.ibm.com> Acked-by: Joe Stringer <joe@ovn.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Lendacky, Thomas	fafc9555d8	amd-xgbe: Fix jumbo MTU processing on newer hardware [ Upstream commit `622c36f143` ] Newer hardware does not provide a cumulative payload length when multiple descriptors are needed to handle the data. Once the MTU increases beyond the size that can be handled by a single descriptor, the SKB does not get built properly by the driver. The driver will now calculate the size of the data buffers used by the hardware. The first buffer of the first descriptor is for packet headers or packet headers and data when the headers can't be split. Subsequent descriptors in a multi-descriptor chain will not use the first buffer. The second buffer is used by all the descriptors in the chain for payload data. Based on whether the driver is processing the first, intermediate, or last descriptor it can calculate the buffer usage and build the SKB properly. Tested and verified on both old and new hardware. Signed-off-by: Tom Lendacky <thomas.lendacky@amd.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Eric Dumazet	e9c1b1ab12	net: properly release sk_frag.page [ Upstream commit `22a0e18eac` ] I mistakenly added the code to release sk->sk_frag in sk_common_release() instead of sk_destruct() TCP sockets using sk->sk_allocation == GFP_ATOMIC do no call sk_common_release() at close time, thus leaking one (order-3) page. iSCSI is using such sockets. Fixes: `5640f76858` ("net: use a per task frag allocator") Signed-off-by: Eric Dumazet <edumazet@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:21 +02:00
Florian Fainelli	4d5bc781eb	net: bcmgenet: Do not suspend PHY if Wake-on-LAN is enabled [ Upstream commit `5371bbf4b2` ] Suspending the PHY would be putting it in a low power state where it may no longer allow us to do Wake-on-LAN. Fixes: `cc013fb488` ("net: bcmgenet: correctly suspend and resume PHY device") Signed-off-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:20 +02:00
Or Gerlitz	03e9554ac4	net/openvswitch: Set the ipv6 source tunnel key address attribute correctly [ Upstream commit `3d20f1f7bd` ] When dealing with ipv6 source tunnel key address attribute (OVS_TUNNEL_KEY_ATTR_IPV6_SRC) we are wrongly setting the tunnel dst ip, fix that. Fixes: `6b26ba3a7d` ('openvswitch: netlink attributes for IPv6 tunneling') Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reported-by: Paul Blakey <paulb@mellanox.com> Acked-by: Jiri Benc <jbenc@redhat.com> Acked-by: Joe Stringer <joe@ovn.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-30 09:41:20 +02:00
Greg Kroah-Hartman	2a486264b7	Linux 4.9.18	2017-03-26 13:06:11 +02:00
Theodore Ts'o	c2eeabe941	ext4: fix fencepost in s_first_meta_bg validation commit `2ba3e6e8af` upstream. It is OK for s_first_meta_bg to be equal to the number of block group descriptor blocks. (It rarely happens, but it shouldn't cause any problems.) https://bugzilla.kernel.org/show_bug.cgi?id=194567 Fixes: `3a4b77cd47` Signed-off-by: Theodore Ts'o <tytso@mit.edu> Cc: Jiri Slaby <jslaby@suse.cz> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Alex Deucher	dea2f1e0c5	drm/amdgpu/si: add dpm quirk for Oland commit `18a8de1bc3` upstream. OLAND 0x1002:0x6604 0x1028:0x066F 0x00 seems to have problems with higher sclks. Acked-by: Christian König <christian.koenig@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Tejun Heo	228514bf24	cgroup/pids: remove spurious suspicious RCU usage warning commit `1d18c2747f` upstream. pids_can_fork() is special in that the css association is guaranteed to be stable throughout the function and thus doesn't need RCU protection around task_css access. When determining the css to charge the pid, task_css_check() is used to override the RCU sanity check. While adding a warning message on fork rejection from pids limit, `135b8b37bd` ("cgroup: Add pids controller event when fork fails because of pid limit") incorrectly added a task_css access which is neither RCU protected or explicitly annotated. This triggers the following suspicious RCU usage warning when RCU debugging is enabled. cgroup: fork rejected by pids controller in =============================== [ ERR: suspicious RCU usage. ] 4.10.0-work+ #1 Not tainted ------------------------------- ./include/linux/cgroup.h:435 suspicious rcu_dereference_check() usage! other info that might help us debug this: rcu_scheduler_active = 2, debug_locks = 0 1 lock held by bash/1748: #0: (&cgroup_threadgroup_rwsem){+++++.}, at: [<ffffffff81052c96>] _do_fork+0xe6/0x6e0 stack backtrace: CPU: 3 PID: 1748 Comm: bash Not tainted 4.10.0-work+ #1 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.9.3-1.fc25 04/01/2014 Call Trace: dump_stack+0x68/0x93 lockdep_rcu_suspicious+0xd7/0x110 pids_can_fork+0x1c7/0x1d0 cgroup_can_fork+0x67/0xc0 copy_process.part.58+0x1709/0x1e90 _do_fork+0xe6/0x6e0 SyS_clone+0x19/0x20 do_syscall_64+0x5c/0x140 entry_SYSCALL64_slow_path+0x25/0x25 RIP: 0033:0x7f7853fab93a RSP: 002b:00007ffc12d05c90 EFLAGS: 00000246 ORIG_RAX: 0000000000000038 RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f7853fab93a RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000001200011 RBP: 00007ffc12d05cc0 R08: 0000000000000000 R09: 00007f78548db700 R10: 00007f78548db9d0 R11: 0000000000000246 R12: 00000000000006d4 R13: 0000000000000001 R14: 0000000000000000 R15: 000055e3ebe2c04d /asdf There's no reason to dereference task_css again here when the associated css is already available. Fix it by replacing the task_cgroup() call with css->cgroup. Signed-off-by: Tejun Heo <tj@kernel.org> Reported-by: Mike Galbraith <efault@gmx.de> Fixes: `135b8b37bd` ("cgroup: Add pids controller event when fork fails because of pid limit") Cc: Kenny Yu <kennyyu@fb.com> Signed-off-by: Tejun Heo <tj@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Tahsin Erdogan	3f406ecddf	percpu: acquire pcpu_lock when updating pcpu_nr_empty_pop_pages commit `320661b08d` upstream. Update to pcpu_nr_empty_pop_pages in pcpu_alloc() is currently done without holding pcpu_lock. This can lead to bad updates to the variable. Add missing lock calls. Fixes: `b539b87fed` ("percpu: implmeent pcpu_nr_empty_pop_pages and chunk->nr_populated") Signed-off-by: Tahsin Erdogan <tahsin@google.com> Signed-off-by: Tejun Heo <tj@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Andreas Gruenbacher	a33e71c5f0	gfs2: Avoid alignment hole in struct lm_lockname commit `28ea06c46f` upstream. Commit `88ffbf3e03` switches to using rhashtables for glocks, hashing over the entire struct lm_lockname instead of its individual fields. On some architectures, struct lm_lockname contains a hole of uninitialized memory due to alignment rules, which now leads to incorrect hash values. Get rid of that hole. Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com> Signed-off-by: Bob Peterson <rpeterso@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Johan Hovold	c795d8780c	isdn/gigaset: fix NULL-deref at probe commit `68c32f9c2a` upstream. Make sure to check the number of endpoints to avoid dereferencing a NULL-pointer should a malicious device lack endpoints. Fixes: `cf7776dc05` ("[PATCH] isdn4linux: Siemens Gigaset drivers - direct USB connection") Cc: Hansjoerg Lipp <hjlipp@web.de> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Max Lohrmann	033850b953	target: Fix VERIFY_16 handling in sbc_parse_cdb commit `13603685c1` upstream. As reported by Max, the Windows 2008 R2 chkdsk utility expects VERIFY_16 to be supported, and does not handle the returned CHECK_CONDITION properly, resulting in an infinite loop. The kernel will log huge amounts of this error: kernel: TARGET_CORE[iSCSI]: Unsupported SCSI Opcode 0x8f, sending CHECK_CONDITION. Signed-off-by: Max Lohrmann <post@wickenrode.com> Signed-off-by: Nicholas Bellinger <nab@linux-iscsi.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Chris Leech	de1ff848c7	scsi: libiscsi: add lock around task lists to fix list corruption regression commit `6f8830f5bb` upstream. There's a rather long standing regression from the commit "libiscsi: Reduce locking contention in fast path" Depending on iSCSI target behavior, it's possible to hit the case in iscsi_complete_task where the task is still on a pending list (!list_empty(&task->running)). When that happens the task is removed from the list while holding the session back_lock, but other task list modification occur under the frwd_lock. That leads to linked list corruption and eventually a panicked system. Rather than back out the session lock split entirely, in order to try and keep some of the performance gains this patch adds another lock to maintain the task lists integrity. Major enterprise supported kernels have been backing out the lock split for while now, thanks to the efforts at IBM where a lab setup has the most reliable reproducer I've seen on this issue. This patch has been tested there successfully. Signed-off-by: Chris Leech <cleech@redhat.com> Fixes: `659743b02c` ("[SCSI] libiscsi: Reduce locking contention in fast path") Reported-by: Prashantha Subbarao <psubbara@us.ibm.com> Reviewed-by: Guilherme G. Piccoli <gpiccoli@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Anton Blanchard	42ba2c265b	scsi: lpfc: Add shutdown method for kexec commit `85e8a23936` upstream. We see lpfc devices regularly fail during kexec. Fix this by adding a shutdown method which mirrors the remove method. Signed-off-by: Anton Blanchard <anton@samba.org> Reviewed-by: Mauricio Faria de Oliveira <mauricfo@linux.vnet.ibm.com> Tested-by: Mauricio Faria de Oliveira <mauricfo@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:58 +02:00
Nicholas Bellinger	f8e24eab9b	target/pscsi: Fix TYPE_TAPE + TYPE_MEDIMUM_CHANGER export commit `a04e54f2c3` upstream. The following fixes a divide by zero OOPs with TYPE_TAPE due to pscsi_tape_read_blocksize() failing causing a zero sd->sector_size being propigated up via dev_attrib.hw_block_size. It also fixes another long-standing bug where TYPE_TAPE and TYPE_MEDIMUM_CHANGER where using pscsi_create_type_other(), which does not call scsi_device_get() to take the device reference. Instead, rename pscsi_create_type_rom() to pscsi_create_type_nondisk() and use it for all cases. Finally, also drop a dump_stack() in pscsi_get_blocks() for non TYPE_DISK, which in modern target-core can get invoked via target_sense_desc_format() during CHECK_CONDITION. Reported-by: Malcolm Haak <insanemal@gmail.com> Signed-off-by: Nicholas Bellinger <nab@linux-iscsi.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Shaohua Li	ad5166415f	md/raid1/10: fix potential deadlock commit `61eb2b43b9` upstream. Neil Brown pointed out a potential deadlock in raid 10 code with bio_split/chain. The raid1 code could have the same issue, but recent barrier rework makes it less likely to happen. The deadlock happens in below sequence: 1. generic_make_request(bio), this will set current->bio_list 2. raid10_make_request will split bio to bio1 and bio2 3. __make_request(bio1), wait_barrer, add underlayer disk bio to current->bio_list 4. __make_request(bio2), wait_barrer If raise_barrier happens between 3 & 4, since wait_barrier runs at 3, raise_barrier waits for IO completion from 3. And since raise_barrier sets barrier, 4 waits for raise_barrier. But IO from 3 can't be dispatched because raid10_make_request() doesn't finished yet. The solution is to adjust the IO ordering. Quotes from Neil: " It is much safer to: if (need to split) { split = bio_split(bio, ...) bio_chain(...) make_request_fn(split); generic_make_request(bio); } else make_request_fn(mddev, bio); This way we first process the initial section of the bio (in 'split') which will queue some requests to the underlying devices. These requests will be queued in generic_make_request. Then we queue the remainder of the bio, which will be added to the end of the generic_make_request queue. Then we return. generic_make_request() will pop the lower-level device requests off the queue and handle them first. Then it will process the remainder of the original bio once the first section has been fully processed. " Note, this only happens in read path. In write path, the bio is flushed to underlaying disks either by blk flush (from schedule) or offladed to raid1/10d. It's queued in current->bio_list. Cc: Coly Li <colyli@suse.de> Suggested-by: NeilBrown <neilb@suse.com> Reviewed-by: Jack Wang <jinpu.wang@profitbricks.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Michael Ellerman	4265e0b487	powerpc/boot: Fix zImage TOC alignment commit `97ee351b50` upstream. Recent toolchains force the TOC to be 256 byte aligned. We need to enforce this alignment in the zImage linker script, otherwise pointers to our TOC variables (__toc_start) could be incorrect. If the actual start of the TOC and __toc_start don't have the same value we crash early in the zImage wrapper. Suggested-by: Alan Modra <amodra@gmail.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Rafael J. Wysocki	bb8c61ad78	cpufreq: Fix and clean up show_cpuinfo_cur_freq() commit `9b4f603e7a` upstream. There is a missing newline in show_cpuinfo_cur_freq(), so add it, but while at it clean that function up somewhat too. Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Olga Kornievskaia	b76d4fb2d9	NFS prevent double free in async nfs4_exchange_id commit `63513232f8` upstream. Since rpc_task is async, the release function should be called which will free the impl_id, scope, and owner. Trond pointed at 2 more problems: -- use of client pointer after free in the nfs4_exchangeid_release() function -- cl_count mismatch if rpc_run_task() isn't run Fixes: `8d89bd70bc` ("NFS setup async exchange_id") Signed-off-by: Olga Kornievskaia <kolga@netapp.com> Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Chuck Lever	87144ec250	xprtrdma: Squelch kbuild sparse complaint commit `eed50879d6` upstream. New complaint from kbuild for 4.9.y: net/sunrpc/xprtrdma/verbs.c:489:19: sparse: incompatible types in comparison expression (different type sizes) verbs.c: 489 max_sge = min(ia->ri_device->attrs.max_sge, RPCRDMA_MAX_SEND_SGES); I can't reproduce this running sparse here. Likewise, "make W=1 net/sunrpc/xprtrdma/verbs.o" never indicated any issue. A little poking suggests that because the range of its values is small, gcc can make the actual width of RPCRDMA_MAX_SEND_SGES smaller than the width of an unsigned integer. Fixes: `16f906d66c` ("xprtrdma: Reduce required number of send SGEs") Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Peter Zijlstra	69efd8e212	perf/core: Fix event inheritance on fork() commit `e7cc4865f0` upstream. While hunting for clues to a use-after-free, Oleg spotted that perf_event_init_context() can loose an error value with the result that fork() can succeed even though we did not fully inherit the perf event context. Spotted-by: Oleg Nesterov <oleg@redhat.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com> Cc: Arnaldo Carvalho de Melo <acme@kernel.org> Cc: Arnaldo Carvalho de Melo <acme@redhat.com> Cc: Dmitry Vyukov <dvyukov@google.com> Cc: Frederic Weisbecker <fweisbec@gmail.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Stephane Eranian <eranian@google.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Vince Weaver <vincent.weaver@maine.edu> Cc: oleg@redhat.com Fixes: `889ff01506` ("perf/core: Split context's event group list into pinned and non-pinned lists") Link: http://lkml.kernel.org/r/20170316125823.190342547@infradead.org Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Peter Zijlstra	c04a938229	perf/core: Fix use-after-free in perf_release() commit `e552a8389a` upstream. Dmitry reported syzcaller tripped a use-after-free in perf_release(). After much puzzlement Oleg spotted the below scenario: Task1 Task2 fork() perf_event_init_task() /* ... / goto bad_fork_$foo; / ... / perf_event_free_task() mutex_lock(ctx->lock) perf_free_event(B) perf_event_release_kernel(A) mutex_lock(A->child_mutex) list_for_each_entry(child, ...) { / child == B / ctx = B->ctx; get_ctx(ctx); mutex_unlock(A->child_mutex); mutex_lock(A->child_mutex) list_del_init(B->child_list) mutex_unlock(A->child_mutex) / ... / mutex_unlock(ctx->lock); put_ctx() / >0 / free_task(); mutex_lock(ctx->lock); mutex_lock(A->child_mutex); / ... / mutex_unlock(A->child_mutex); mutex_unlock(ctx->lock) put_ctx() / 0 / ctx->task && !TOMBSTONE put_task_struct() / UAF */ This patch closes the hole by making perf_event_free_task() destroy the task <-> ctx relation such that perf_event_release_kernel() will no longer observe the now dead task. Spotted-by: Oleg Nesterov <oleg@redhat.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com> Cc: Arnaldo Carvalho de Melo <acme@kernel.org> Cc: Arnaldo Carvalho de Melo <acme@redhat.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Stephane Eranian <eranian@google.com> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Vince Weaver <vincent.weaver@maine.edu> Cc: fweisbec@gmail.com Cc: oleg@redhat.com Fixes: `c6e5b73242` ("perf: Synchronously clean up child events") Link: http://lkml.kernel.org/r/20170314155949.GE32474@worktop Link: http://lkml.kernel.org/r/20170316125823.140295131@infradead.org Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Helge Deller	13695ce5b1	parisc: Fix system shutdown halt commit `73580dac76` upstream. On those parisc machines which don't provide a software power off function, the system currently kills the init process at the end of a shutdown and unexpectedly restarts insteads of halting. Fix it by adding a loop which will not return. Signed-off-by: Helge Deller <deller@gmx.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
John David Anglin	a690a42ae7	parisc: Optimize flush_kernel_vmap_range and invalidate_kernel_vmap_range commit `316ec0624f` upstream. The previously submitted patch did not resolve the random segmentation faults observed on the phantom buildd system. There are still unresolved problems with the Debian 4.8 and 4.9 kernels on C8000. The attached patch removes the flush of the offset map pages and does a whole data cache flush for large ranges. No other arch flushes the offset map in these routines as far as I can tell. I have not observed any random segmentation faults on rp3440 in two weeks of testing with 4.10.0 and 4.10.1. Signed-off-by: John David Anglin <dave.anglin@bell.net> Signed-off-by: Helge Deller <deller@gmx.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:57 +02:00
Quinn Tran	66e70bdca5	qla2xxx: Fix request queue corruption. commit `8b666809e1` upstream. When FW notify driver or driver detects low FW resource, driver tries to send out Busy SCSI Status to tell Initiator side to back off. During the send process, the lock was not held. Signed-off-by: Quinn Tran <quinn.tran@qlogic.com> Signed-off-by: Himanshu Madhani <himanshu.madhani@cavium.com> Signed-off-by: Nicholas Bellinger <nab@linux-iscsi.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:56 +02:00
Quinn Tran	c5ad350d61	qla2xxx: Fix memory leak for abts processing commit `ae940f2c47` upstream. Signed-off-by: Quinn Tran <quinn.tran@cavium.com> Signed-off-by: Himanshu Madhani <himanshu.madhani@cavium.com> Signed-off-by: Nicholas Bellinger <nab@linux-iscsi.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:56 +02:00
Linus Torvalds	85f687708c	give up on gcc ilog2() constant optimizations commit `474c90156c` upstream. gcc-7 has an "optimization" pass that completely screws up, and generates the code expansion for the (impossible) case of calling ilog2() with a zero constant, even when the code gcc compiles does not actually have a zero constant. And we try to generate a compile-time error for anybody doing ilog2() on a constant where that doesn't make sense (be it zero or negative). So now gcc7 will fail the build due to our sanity checking, because it created that constant-zero case that didn't actually exist in the source code. There's a whole long discussion on the kernel mailing about how to work around this gcc bug. The gcc people themselevs have discussed their "feature" in https://gcc.gnu.org/bugzilla/show_bug.cgi?id=72785 but it's all water under the bridge, because while it looked at one point like it would be solved by the time gcc7 was released, that was not to be. So now we have to deal with this compiler braindamage. And the only simple approach seems to be to just delete the code that tries to warn about bad uses of ilog2(). So now "ilog2()" will just return 0 not just for the value 1, but for any non-positive value too. It's not like I can recall anybody having ever actually tried to use this function on any invalid value, but maybe the sanity check just meant that such code never made it out in public. Reported-by: Laura Abbott <labbott@redhat.com> Cc: John Stultz <john.stultz@linaro.org>, Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Ard Biesheuvel <ard.biesheuvel@linaro.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Cc: Jiri Slaby <jslaby@suse.cz> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:56 +02:00
Eric Anholt	2bbcbc2447	drm/vc4: Use runtime autosuspend to avoid thrashing V3D power state. commit `3a62234680` upstream. The pm_runtime_put() we were using immediately released power on the device, which meant that we were generally turning the device off and on once per frame. In many profiles I've looked at, that added up to about 1% of CPU time, but this could get worse in the case of frequent rendering and readback (as may happen in X rendering). By keeping the device on until we've been idle for a couple of frames, we drop the overhead of runtime PM down to sub-.1%. Signed-off-by: Eric Anholt <eric@anholt.net> Cc: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:56 +02:00
Eric Anholt	dd0b0e22a5	drm/vc4: Fix termination of the initial scan for branch targets. commit `457e67a728` upstream. The loop is scanning until the original max_ip (size of the BO), but we want to not examine any code after the PROG_END's delay slots. There was a block trying to do that, except that we had some early continue statements if the signal wasn't a PROG_END or a BRANCH. The failure mode would be that a valid shader is rejected because some undefined memory after the PROG_END slots is parsed as a branch and the rest of its setup is illegal. I haven't seen this in the wild, but valgrind was complaining when about this up in the userland simulator mode. Signed-off-by: Eric Anholt <eric@anholt.net> Cc: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-26 13:05:56 +02:00
Greg Kroah-Hartman	c3825da143	Linux 4.9.17	2017-03-22 12:44:07 +01:00
Daniel Axtens	ecdc5b12d7	crypto: powerpc - Fix initialisation of crc32c context commit `aa2be9b3d6` upstream. Turning on crypto self-tests on a POWER8 shows: alg: hash: Test 1 failed for crc32c-vpmsum 00000000: ff ff ff ff Comparing the code with the Intel CRC32c implementation on which ours is based shows that we are doing an init with 0, not ~0 as CRC32c requires. This probably wasn't caught because btrfs does its own weird open-coded initialisation. Initialise our internal context to ~0 on init. This makes the self-tests pass, and btrfs continues to work. Fixes: `6dd7a82cc5` ("crypto: powerpc - Add POWER8 optimised crc32c") Cc: Anton Blanchard <anton@samba.org> Signed-off-by: Daniel Axtens <dja@axtens.net> Acked-by: Anton Blanchard <anton@samba.org> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:40 +01:00
Niklas Cassel	c406096522	locking/rwsem: Fix down_write_killable() for CONFIG_RWSEM_GENERIC_SPINLOCK=y commit `17fcbd590d` upstream. We hang if SIGKILL has been sent, but the task is stuck in down_read() (after do_exit()), even though no task is doing down_write() on the rwsem in question: INFO: task libupnp:21868 blocked for more than 120 seconds. libupnp D 0 21868 1 0x08100008 ... Call Trace: __schedule() schedule() __down_read() do_exit() do_group_exit() __wake_up_parent() This bug has already been fixed for CONFIG_RWSEM_XCHGADD_ALGORITHM=y in the following commit: `04cafed7fc` ("locking/rwsem: Fix down_write_killable()") ... however, this bug also exists for CONFIG_RWSEM_GENERIC_SPINLOCK=y. Signed-off-by: Niklas Cassel <niklas.cassel@axis.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: <mhocko@suse.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Niklas Cassel <niklass@axis.com> Cc: Paul E. McKenney <paulmck@linux.vnet.ibm.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Fixes: `d47996082f` ("locking/rwsem: Introduce basis for down_write_killable()") Link: http://lkml.kernel.org/r/1487981873-12649-1-git-send-email-niklass@axis.com Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:40 +01:00
Peter Zijlstra	1522181f4b	futex: Add missing error handling to FUTEX_REQUEUE_PI commit `9bbb25afeb` upstream. Thomas spotted that fixup_pi_state_owner() can return errors and we fail to unlock the rt_mutex in that case. Reported-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Darren Hart <dvhart@linux.intel.com> Cc: juri.lelli@arm.com Cc: bigeasy@linutronix.de Cc: xlpang@redhat.com Cc: rostedt@goodmis.org Cc: mathieu.desnoyers@efficios.com Cc: jdesfossez@efficios.com Cc: dvhart@infradead.org Cc: bristot@redhat.com Link: http://lkml.kernel.org/r/20170304093558.867401760@infradead.org Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:40 +01:00
Peter Zijlstra	6244ffc5a1	futex: Fix potential use-after-free in FUTEX_REQUEUE_PI commit `c236c8e95a` upstream. While working on the futex code, I stumbled over this potential use-after-free scenario. Dmitry triggered it later with syzkaller. pi_mutex is a pointer into pi_state, which we drop the reference on in unqueue_me_pi(). So any access to that pointer after that is bad. Since other sites already do rt_mutex_unlock() with hb->lock held, see for example futex_lock_pi(), simply move the unlock before unqueue_me_pi(). Reported-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Reviewed-by: Darren Hart <dvhart@linux.intel.com> Cc: juri.lelli@arm.com Cc: bigeasy@linutronix.de Cc: xlpang@redhat.com Cc: rostedt@goodmis.org Cc: mathieu.desnoyers@efficios.com Cc: jdesfossez@efficios.com Cc: dvhart@infradead.org Cc: bristot@redhat.com Link: http://lkml.kernel.org/r/20170304093558.801744246@infradead.org Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:40 +01:00
Andy Lutomirski	9f9115b67a	x86/perf: Fix CR4.PCE propagation to use active_mm instead of mm commit `5dc855d44c` upstream. If one thread mmaps a perf event while another thread in the same mm is in some context where active_mm != mm (which can happen in the scheduler, for example), refresh_pce() would write the wrong value to CR4.PCE. This broke some PAPI tests. Reported-and-tested-by: Vince Weaver <vincent.weaver@maine.edu> Signed-off-by: Andy Lutomirski <luto@kernel.org> Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com> Cc: Arnaldo Carvalho de Melo <acme@redhat.com> Cc: Borislav Petkov <bpetkov@suse.de> Cc: H. Peter Anvin <hpa@zytor.com> Cc: Jiri Olsa <jolsa@redhat.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Stephane Eranian <eranian@google.com> Cc: Thomas Gleixner <tglx@linutronix.de> Fixes: `7911d3f7af` ("perf/x86: Only allow rdpmc if a perf_event is mapped") Link: http://lkml.kernel.org/r/0c5b38a76ea50e405f9abe07a13dfaef87c173a1.1489694270.git.luto@kernel.org Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:40 +01:00
Andrey Ryabinin	24ba2842a4	x86/kasan: Fix boot with KASAN=y and PROFILE_ANNOTATED_BRANCHES=y commit `be3606ff73` upstream. The kernel doesn't boot with both PROFILE_ANNOTATED_BRANCHES=y and KASAN=y options selected. With branch profiling enabled we end up calling ftrace_likely_update() before kasan_early_init(). ftrace_likely_update() is built with KASAN instrumentation, so calling it before kasan has been initialized leads to crash. Use DISABLE_BRANCH_PROFILING define to make sure that we don't call ftrace_likely_update() from early code before kasan_early_init(). Fixes: `ef7f0d6a6c` ("x86_64: add KASan support") Reported-by: Fengguang Wu <fengguang.wu@intel.com> Signed-off-by: Andrey Ryabinin <aryabinin@virtuozzo.com> Cc: kasan-dev@googlegroups.com Cc: Alexander Potapenko <glider@google.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: lkp@01.org Cc: Dmitry Vyukov <dvyukov@google.com> Link: http://lkml.kernel.org/r/20170313163337.1704-1-aryabinin@virtuozzo.com Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:40 +01:00
Peter Zijlstra	5ec98e6ffd	x86/tsc: Fix ART for TSC_KNOWN_FREQ commit `44fee88cea` upstream. Subhransu reported that convert_art_to_tsc() isn't working for him. The ART to TSC relation is only set up for systems which use the refined TSC calibration. Systems with known TSC frequency (available via CPUID 15) are not using the refined calibration and therefor the ART to TSC relation is never established. Add the setup to the known frequency init path which skips ART calibration. The init code needs to be duplicated as for systems which use refined calibration the ART setup must be delayed until calibration has been done. The problem has been there since the ART support was introdduced, but only detected now because Subhransu tested the first time on hardware which has TSC frequency enumerated via CPUID 15. Note for stable: The conditional has changed from TSC_RELIABLE to TSC_KNOWN_FREQUENCY. [ tglx: Rewrote changelog and identified the proper 'Fixes' commit ] Fixes: `f9677e0f83` ("x86/tsc: Always Running Timer (ART) correlated clocksource") Reported-by: "Prusty, Subhransu S" <subhransu.s.prusty@intel.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: stable@vger.kernel.org Cc: christopher.s.hall@intel.com Cc: kevin.b.stanton@intel.com Cc: john.stultz@linaro.org Cc: akataria@vmware.com Link: http://lkml.kernel.org/r/20170313145712.GI3312@twins.programming.kicks-ass.net Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Shanker Donthineni	095635be80	irqchip/gicv3-its: Add workaround for QDF2400 ITS erratum 0065 commit `90922a2d03` upstream. On Qualcomm Datacenter Technologies QDF2400 SoCs, the ITS hardware implementation uses 16Bytes for Interrupt Translation Entry (ITE), but reports an incorrect value of 8Bytes in GITS_TYPER.ITTE_size. It might cause kernel memory corruption depending on the number of MSI(x) that are configured and the amount of memory that has been allocated for ITEs in its_create_device(). This patch fixes the potential memory corruption by setting the correct ITE size to 16Bytes. Cc: stable@vger.kernel.org Signed-off-by: Shanker Donthineni <shankerd@codeaurora.org> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Marc Zyngier	61e79860b4	arm64: KVM: VHE: Clear HCR_TGE when invalidating guest TLBs commit `6892517629` upstream. When invalidating guest TLBs, special care must be taken to actually shoot the guest TLBs and not the host ones if we're running on a VHE system. This is controlled by the HCR_EL2.TGE bit, which we forget to clear before invalidating TLBs. Address the issue by introducing two wrappers (__tlb_switch_to_guest and __tlb_switch_to_host) that take care of both the VTTBR_EL2 and HCR_EL2.TGE switching. Reported-by: Tomasz Nowicki <tnowicki@caviumnetworks.com> Tested-by: Tomasz Nowicki <tnowicki@caviumnetworks.com> Reviewed-by: Christoffer Dall <cdall@linaro.org> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Boris Brezillon	8ca7ef0d9a	drm/vc4: Fix ->clock_select setting for the VEC encoder commit `ab8df60e3a` upstream. PV_CONTROL_CLK_SELECT_VEC is actually 2 and not 0. Fix the definition and rework the vc4_set_crtc_possible_masks() to cover the full range of the PV_CONTROL_CLK_SELECT field. Signed-off-by: Boris Brezillon <boris.brezillon@free-electrons.com> Signed-off-by: Eric Anholt <eric@anholt.net> Cc: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Derek Foreman	6b33067067	drm/vc4: Fix race between page flip completion event and clean-up commit `26fc78f6fe` upstream. There was a small window where a userspace program could submit a pageflip after receiving a pageflip completion event yet still receive EBUSY. Signed-off-by: Derek Foreman <derekf@osg.samsung.com> Signed-off-by: Eric Anholt <eric@anholt.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Daniel Stone <daniels@collabora.com> Cc: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Boris Brezillon	f4d40cfd61	clk: bcm2835: Fix ->fixed_divider of pllh_aux commit `f2a46926ab` upstream. There is no fixed divider on pllh_aux. Signed-off-by: Boris Brezillon <boris.brezillon@free-electrons.com> Signed-off-by: Eric Anholt <eric@anholt.net> Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Stephen Boyd <sboyd@codeaurora.org> Cc: Amit Pundir <amit.pundir@linaro.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Michael Ellerman	7885195b91	powerpc/mm: Fix build break when CMA=n && SPAPR_TCE_IOMMU=y [ Upstream commit `a05ef161cd` ] Currently the build breaks if CMA=n and SPAPR_TCE_IOMMU=y: arch/powerpc/mm/mmu_context_iommu.c: In function ‘mm_iommu_get’: arch/powerpc/mm/mmu_context_iommu.c:193:42: error: ‘MIGRATE_CMA’ undeclared (first use in this function) if (get_pageblock_migratetype(page) == MIGRATE_CMA) { ^~~~~~~~~~~ Fix it by using the existing is_migrate_cma_page(), which evaulates to false when CMA=n. Fixes: `2e5bbb5461` ("KVM: PPC: Book3S HV: Migrate pinned pages out of CMA") Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Alexandre Belloni	ce7aeffe75	usb: gadget: udc: atmel: remove memory leak [ Upstream commit `32856eea7b` ] Commit `bbe097f092` ("usb: gadget: udc: atmel: fix endpoint name") introduced a memory leak when unbinding the driver. The endpoint names would not be freed. Solve that by including the name as a string in struct usba_ep so it is freed when the endpoint is. Signed-off-by: Alexandre Belloni <alexandre.belloni@free-electrons.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Gabriel Krisman Bertazi	4fa1c65cf0	serial: 8250_pci: Detach low-level driver during PCI error recovery [ Upstream commit `f209fa03fc` ] During a PCI error recovery, like the ones provoked by EEH in the ppc64 platform, all IO to the device must be blocked while the recovery is completed. Current 8250_pci implementation only suspends the port instead of detaching it, which doesn't prevent incoming accesses like TIOCMGET and TIOCMSET calls from reaching the device. Those end up racing with the EEH recovery, crashing it. Similar races were also observed when opening the device and when shutting it down during recovery. This patch implements a more robust IO blockage for the 8250_pci recovery by unregistering the port at the beginning of the procedure and re-adding it afterwards. Since the port is detached from the uart layer, we can be sure that no request will make through to the device during recovery. This is similar to the solution used by the JSM serial driver. I thank Peter Hurley <peter@hurleysoftware.com> for valuable input on this one over one year ago. Signed-off-by: Gabriel Krisman Bertazi <krisman@linux.vnet.ibm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:39 +01:00
Michael Pobega	56c28e7983	ACPI / blacklist: Make Dell Latitude 3350 ethernet work [ Upstream commit `708f5dcc21` ] The Dell Latitude 3350's ethernet card attempts to use a reserved IRQ (18), resulting in ACPI being unable to enable the ethernet. Adding it to acpi_rev_dmi_table[] helps to work around this problem. Signed-off-by: Michael Pobega <mpobega@neverware.com> [ rjw: Changelog ] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Alex Hung	5c19e9070d	ACPI / blacklist: add _REV quirks for Dell Precision 5520 and 3520 [ Upstream commit `9523b9bf6d` ] Precision 5520 and 3520 either hang at login and during suspend or reboot. It turns out that that adding them to acpi_rev_dmi_table[] helps to work around those issues. Signed-off-by: Alex Hung <alex.hung@canonical.com> [ rjw: Changelog ] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Vladimir Davydov	bc01eb9398	slub: move synchronize_sched out of slab_mutex on shrink [ Upstream commit `89e364db71` ] synchronize_sched() is a heavy operation and calling it per each cache owned by a memory cgroup being destroyed may take quite some time. What is worse, it's currently called under the slab_mutex, stalling all works doing cache creation/destruction. Actually, there isn't much point in calling synchronize_sched() for each cache - it's enough to call it just once - after setting cpu_partial for all caches and before shrinking them. This way, we can also move it out of the slab_mutex, which we have to hold for iterating over the slab cache list. Link: https://bugzilla.kernel.org/show_bug.cgi?id=172991 Link: http://lkml.kernel.org/r/0a10d71ecae3db00fb4421bcd3f82bcc911f4be4.1475329751.git.vdavydov.dev@gmail.com Signed-off-by: Vladimir Davydov <vdavydov.dev@gmail.com> Reported-by: Doug Smythies <dsmythies@telus.net> Acked-by: Joonsoo Kim <iamjoonsoo.kim@lge.com> Cc: Christoph Lameter <cl@linux.com> Cc: David Rientjes <rientjes@google.com> Cc: Johannes Weiner <hannes@cmpxchg.org> Cc: Michal Hocko <mhocko@kernel.org> Cc: Pekka Enberg <penberg@kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Henrik Ingo	bd2de45031	uvcvideo: uvc_scan_fallback() for webcams with broken chain [ Upstream commit `e950267ab8` ] Some devices have invalid baSourceID references, causing uvc_scan_chain() to fail, but if we just take the entities we can find and put them together in the most sensible chain we can think of, turns out they do work anyway. Note: This heuristic assumes there is a single chain. At the time of writing, devices known to have such a broken chain are - Acer Integrated Camera (5986:055a) - Realtek rtl157a7 (0bda:57a7) Signed-off-by: Henrik Ingo <henrik.ingo@avoinelama.fi> Signed-off-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com> Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Harald Freudenberger	e627116c0c	s390/zcrypt: Introduce CEX6 toleration [ Upstream commit `b3e8652bcb` ] Signed-off-by: Harald Freudenberger <freude@linux.vnet.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Mauricio Faria de Oliveira	61a153d06e	block: allow WRITE_SAME commands with the SG_IO ioctl [ Upstream commit `25cdb64510` ] The WRITE_SAME commands are not present in the blk_default_cmd_filter write_ok list, and thus are failed with -EPERM when the SG_IO ioctl() is executed without CAP_SYS_RAWIO capability (e.g., unprivileged users). [ sg_io() -> blk_fill_sghdr_rq() > blk_verify_command() -> -EPERM ] The problem can be reproduced with the sg_write_same command # sg_write_same --num 1 --xferlen 512 /dev/sda # # capsh --drop=cap_sys_rawio -- -c \ 'sg_write_same --num 1 --xferlen 512 /dev/sda' Write same: pass through os error: Operation not permitted # For comparison, the WRITE_VERIFY command does not observe this problem, since it is in that list: # capsh --drop=cap_sys_rawio -- -c \ 'sg_write_verify --num 1 --ilen 512 --lba 0 /dev/sda' # So, this patch adds the WRITE_SAME commands to the list, in order for the SG_IO ioctl to finish successfully: # capsh --drop=cap_sys_rawio -- -c \ 'sg_write_same --num 1 --xferlen 512 /dev/sda' # That case happens to be exercised by QEMU KVM guests with 'scsi-block' devices (qemu "-device scsi-block" [1], libvirt "<disk type='block' device='lun'>" [2]), which employs the SG_IO ioctl() and runs as an unprivileged user (libvirt-qemu). In that scenario, when a filesystem (e.g., ext4) performs its zero-out calls, which are translated to write-same calls in the guest kernel, and then into SG_IO ioctls to the host kernel, SCSI I/O errors may be observed in the guest: [...] sd 0:0:0:0: [sda] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE [...] sd 0:0:0:0: [sda] tag#0 Sense Key : Aborted Command [current] [...] sd 0:0:0:0: [sda] tag#0 Add. Sense: I/O process terminated [...] sd 0:0:0:0: [sda] tag#0 CDB: Write Same(10) 41 00 01 04 e0 78 00 00 08 00 [...] blk_update_request: I/O error, dev sda, sector 17096824 Links: [1] http://git.qemu.org/?p=qemu.git;a=commit;h=336a6915bc7089fb20fea4ba99972ad9a97c5f52 [2] https://libvirt.org/formatdomain.html#elementsDisks (see 'disk' -> 'device') Signed-off-by: Mauricio Faria de Oliveira <mauricfo@linux.vnet.ibm.com> Signed-off-by: Brahadambal Srinivasan <latha@linux.vnet.ibm.com> Reported-by: Manjunatha H R <manjuhr1@in.ibm.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@fb.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Ben Skeggs	0042afe117	drm/nouveau/disp/nv50-: specify ctrl/user separately when constructing classes [ Upstream commit `2a32b9b186` ] Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Ben Skeggs	5001756c1c	drm/nouveau/disp/nv50-: split chid into chid.ctrl and chid.user [ Upstream commit `4391d7f5c7` ] GP102/GP104 make life difficult by redefining the channel indices for some registers, but not others. Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Ben Skeggs	ddc23b5212	drm/nouveau/disp/gp102: fix cursor/overlay immediate channel indices [ Upstream commit `e50fcff15f` ] Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Alexey Kardashevskiy	53e18968a9	vfio/spapr: Postpone default window creation [ Upstream commit `d9c728949d` ] We are going to allow the userspace to configure container in one memory context and pass container fd to another so we are postponing memory allocations accounted against the locked memory limit. One of previous patches took care of it_userspace. At the moment we create the default DMA window when the first group is attached to a container; this is done for the userspace which is not DDW-aware but familiar with the SPAPR TCE IOMMU v2 in the part of memory pre-registration - such client expects the default DMA window to exist. This postpones the default DMA window allocation till one of the folliwing happens: 1. first map/unmap request arrives; 2. new window is requested; This adds noop for the case when the userspace requested removal of the default window which has not been created yet. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Acked-by: Alex Williamson <alex.williamson@redhat.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:38 +01:00
Alexey Kardashevskiy	2e60baca23	vfio/spapr: Add a helper to create default DMA window [ Upstream commit `6f01cc692a` ] There is already a helper to create a DMA window which does allocate a table and programs it to the IOMMU group. However tce_iommu_take_ownership_ddw() did not use it and did these 2 calls itself to simplify error path. Since we are going to delay the default window creation till the default window is accessed/removed or new window is added, we need a helper to create a default window from all these cases. This adds tce_iommu_create_default_window(). Since it relies on a VFIO container to have at least one IOMMU group (for future use), this changes tce_iommu_attach_group() to add a group to the container first and then call the new helper. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Acked-by: Alex Williamson <alex.williamson@redhat.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Alexey Kardashevskiy	080eb13542	powerpc/mm/iommu, vfio/spapr: Put pages on VFIO container shutdown [ Upstream commit `4b6fad7097` ] At the moment the userspace tool is expected to request pinning of the entire guest RAM when VFIO IOMMU SPAPR v2 driver is present. When the userspace process finishes, all the pinned pages need to be put; this is done as a part of the userspace memory context (MM) destruction which happens on the very last mmdrop(). This approach has a problem that a MM of the userspace process may live longer than the userspace process itself as kernel threads use userspace process MMs which was runnning on a CPU where the kernel thread was scheduled to. If this happened, the MM remains referenced until this exact kernel thread wakes up again and releases the very last reference to the MM, on an idle system this can take even hours. This moves preregistered regions tracking from MM to VFIO; insteads of using mm_iommu_table_group_mem_t::used, tce_container::prereg_list is added so each container releases regions which it has pre-registered. This changes the userspace interface to return EBUSY if a memory region is already registered in a container. However it should not have any practical effect as the only userspace tool available now does register memory region once per container anyway. As tce_iommu_register_pages/tce_iommu_unregister_pages are called under container->lock, this does not need additional locking. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Reviewed-by: Nicholas Piggin <npiggin@gmail.com> Acked-by: Alex Williamson <alex.williamson@redhat.com> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Alexey Kardashevskiy	92e44bcd71	vfio/spapr: Reference mm in tce_container [ Upstream commit `bc82d122ae` ] In some situations the userspace memory context may live longer than the userspace process itself so if we need to do proper memory context cleanup, we better have tce_container take a reference to mm_struct and use it later when the process is gone (@current or @current->mm is NULL). This references mm and stores the pointer in the container; this is done in a new helper - tce_iommu_mm_set() - when one of the following happens: - a container is enabled (IOMMU v1); - a first attempt to pre-register memory is made (IOMMU v2); - a DMA window is created (IOMMU v2). The @mm stays referenced till the container is destroyed. This replaces current->mm with container->mm everywhere except debug prints. This adds a check that current->mm is the same as the one stored in the container to prevent userspace from making changes to a memory context of other processes. DMA map/unmap ioctls() do not check for @mm as they already check for @enabled which is set after tce_iommu_mm_set() is called. This does not reference a task as multiple threads within the same mm are allowed to ioctl() to vfio and supposedly they will have same limits and capabilities and if they do not, we'll just fail with no harm made. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Acked-by: Alex Williamson <alex.williamson@redhat.com> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Alexey Kardashevskiy	5b34666bd2	powerpc/iommu: Stop using @current in mm_iommu_xxx [ Upstream commit `d7baee6901` ] This changes mm_iommu_xxx helpers to take mm_struct as a parameter instead of getting it from @current which in some situations may not have a valid reference to mm. This changes helpers to receive @mm and moves all references to @current to the caller, including checks for !current and !current->mm; checks in mm_iommu_preregistered() are removed as there is no caller yet. This moves the mm_iommu_adjust_locked_vm() call to the caller as it receives mm_iommu_table_group_mem_t but it needs mm. This should cause no behavioral change. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Acked-by: Alex Williamson <alex.williamson@redhat.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Alexey Kardashevskiy	2ba7ef2182	powerpc/iommu: Pass mm_struct to init/cleanup helpers [ Upstream commit `88f54a3581` ] We are going to get rid of @current references in mmu_context_boos3s64.c and cache mm_struct in the VFIO container. Since mm_context_t does not have reference counting, we will be using mm_struct which does have the reference counter. This changes mm_iommu_init/mm_iommu_cleanup to receive mm_struct rather than mm_context_t (which is embedded into mm). This should not cause any behavioral change. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Alexey Kardashevskiy	5d8b3e7559	vfio/spapr: Postpone allocation of userspace version of TCE table [ Upstream commit `39701e56f5` ] The iommu_table struct manages a hardware TCE table and a vmalloc'd table with corresponding userspace addresses. Both are allocated when the default DMA window is created and this happens when the very first group is attached to a container. As we are going to allow the userspace to configure container in one memory context and pas container fd to another, we have to postpones such allocations till a container fd is passed to the destination user process so we would account locked memory limit against the actual container user constrainsts. This postpones the it_userspace array allocation till it is used first time for mapping. The unmapping patch already checks if the array is allocated. Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Acked-by: Alex Williamson <alex.williamson@redhat.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Vitaly Kuznetsov	3c0cbb47a8	Drivers: hv: ring_buffer: count on wrap around mappings in get_next_pkt_raw() (v2) [ Upstream commit `fa32ff6576` ] With wrap around mappings in place we can always provide drivers with direct links to packets on the ring buffer, even when they wrap around. Do the required updates to get_next_pkt_raw()/put_pkt_raw() The first version of this commit was reverted (`65a532f3d5`) to deal with cross-tree merge issues which are (hopefully) resolved now. Signed-off-by: Vitaly Kuznetsov <vkuznets@redhat.com> Signed-off-by: K. Y. Srinivasan <kys@microsoft.com> Tested-by: Dexuan Cui <decui@microsoft.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Thomas Falcon	3e5a7f5b95	ibmveth: calculate gso_segs for large packets [ Upstream commit `94acf164dc` ] Include calculations to compute the number of segments that comprise an aggregated large packet. Signed-off-by: Thomas Falcon <tlfalcon@linux.vnet.ibm.com> Reviewed-by: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com> Reviewed-by: Jonathan Maxwell <jmaxwell37@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Gavin Shan	fb7c521a14	PCI: Do any VF BAR updates before enabling the BARs [ Upstream commit `f40ec3c748` ] Previously we enabled VFs and enable their memory space before calling pcibios_sriov_enable(). But pcibios_sriov_enable() may update the VF BARs: for example, on PPC PowerNV we may change them to manage the association of VFs to PEs. Because 64-bit BARs cannot be updated atomically, it's unsafe to update them while they're enabled. The half-updated state may conflict with other devices in the system. Call pcibios_sriov_enable() before enabling the VFs so any BAR updates happen while the VF BARs are disabled. [bhelgaas: changelog] Tested-by: Carol Soto <clsoto@us.ibm.com> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Bjorn Helgaas	3d58444dea	PCI: Ignore BAR updates on virtual functions [ Upstream commit `63880b230a` ] VF BARs are read-only zero, so updating VF BARs will not have any effect. See the SR-IOV spec r1.1, sec 3.4.1.11. We already ignore these updates because of `70675e0b6a` ("PCI: Don't try to restore VF BARs"); this merely restructures it slightly to make it easier to split updates for standard and SR-IOV BARs. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Bjorn Helgaas	74cce811a4	PCI: Update BARs using property bits appropriate for type [ Upstream commit `45d004f4af` ] The BAR property bits (0-3 for memory BARs, 0-1 for I/O BARs) are supposed to be read-only, but we do save them in res->flags and include them when updating the BAR. Mask the I/O property bits with ~PCI_BASE_ADDRESS_IO_MASK (0x3) instead of PCI_REGION_FLAG_MASK (0xf) to make it obvious that we can't corrupt bits 2-3 of I/O addresses. Use PCI_ROM_ADDRESS_MASK for ROM BARs. This means we'll only check the top 21 bits (instead of the 28 bits we used to check) of a ROM BAR to see if the update was successful. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:37 +01:00
Bjorn Helgaas	a38012dc69	PCI: Don't update VF BARs while VF memory space is enabled [ Upstream commit `546ba9f8f2` ] If we update a VF BAR while it's enabled, there are two potential problems: 1) Any driver that's using the VF has a cached BAR value that is stale after the update, and 2) We can't update 64-bit BARs atomically, so the intermediate state (new lower dword with old upper dword) may conflict with another device, and an access by a driver unrelated to the VF may cause a bus error. Warn about attempts to update VF BARs while they are enabled. This is a programming error, so use dev_WARN() to get a backtrace. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Bjorn Helgaas	bb479246bb	PCI: Decouple IORESOURCE_ROM_ENABLE and PCI_ROM_ADDRESS_ENABLE [ Upstream commit `7a6d312b50` ] Remove the assumption that IORESOURCE_ROM_ENABLE == PCI_ROM_ADDRESS_ENABLE. PCI_ROM_ADDRESS_ENABLE is the ROM enable bit defined by the PCI spec, so if we're reading or writing a BAR register value, that's what we should use. IORESOURCE_ROM_ENABLE is a corresponding bit in struct resource flags. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Bjorn Helgaas	ed09d211d2	PCI: Add comments about ROM BAR updating [ Upstream commit `0b457dde3c` ] pci_update_resource() updates a hardware BAR so its address matches the kernel's struct resource UNLESS it's a disabled ROM BAR. We only update those when we enable the ROM. It's not obvious from the code why ROM BARs should be handled specially. Apparently there are Matrox devices with defective ROM BARs that read as zero when disabled. That means that if pci_enable_rom() reads the disabled BAR, sets PCI_ROM_ADDRESS_ENABLE (without re-inserting the address), and writes it back, it would enable the ROM at address zero. Add comments and references to explain why we can't make the code look more rational. The code changes are from `755528c860` ("Ignore disabled ROM resources at setup") and `8085ce084c` ("[PATCH] Fix PCI ROM mapping"). Link: https://lkml.org/lkml/2005/8/30/138 Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Bjorn Helgaas	7b65c3a843	PCI: Remove pci_resource_bar() and pci_iov_resource_bar() [ Upstream commit `286c2378aa` ] pci_std_update_resource() only deals with standard BARs, so we don't have to worry about the complications of VF BARs in an SR-IOV capability. Compute the BAR address inline and remove pci_resource_bar(). That makes pci_iov_resource_bar() unused, so remove that as well. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Bjorn Helgaas	6a5f3e664a	PCI: Separate VF BAR updates from standard BAR updates [ Upstream commit `6ffa2489c5` ] Previously pci_update_resource() used the same code path for updating standard BARs and VF BARs in SR-IOV capabilities. Split the VF BAR update into a new pci_iov_update_resource() internal interface, which makes it simpler to compute the BAR address (we can get rid of pci_resource_bar() and pci_iov_resource_bar()). This patch: - Renames pci_update_resource() to pci_std_update_resource(), - Adds pci_iov_update_resource(), - Makes pci_update_resource() a wrapper that calls the appropriate one, No functional change intended. Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Vitaly Kuznetsov	29d928785f	x86/hyperv: Handle unknown NMIs on one CPU when unknown_nmi_panic [ Upstream commit `59107e2f48` ] There is a feature in Hyper-V ('Debug-VM --InjectNonMaskableInterrupt') which injects NMI to the guest. We may want to crash the guest and do kdump on this NMI by enabling unknown_nmi_panic. To make kdump succeed we need to allow the kdump kernel to re-establish VMBus connection so it will see VMBus devices (storage, network,..). To properly unload VMBus making it possible to start over during kdump we need to do the following: - Send an 'unload' message to the hypervisor. This can be done on any CPU so we do this the crashing CPU. - Receive the 'unload finished' reply message. WS2012R2 delivers this message to the CPU which was used to establish VMBus connection during module load and this CPU may differ from the CPU sending 'unload'. Receiving a VMBus message means the following: - There is a per-CPU slot in memory for one message. This slot can in theory be accessed by any CPU. - We get an interrupt on the CPU when a message was placed into the slot. - When we read the message we need to clear the slot and signal the fact to the hypervisor. In case there are more messages to this CPU pending the hypervisor will deliver the next message. The signaling is done by writing to an MSR so this can only be done on the appropriate CPU. To avoid doing cross-CPU work on crash we have vmbus_wait_for_unload() function which checks message slots for all CPUs in a loop waiting for the 'unload finished' messages. However, there is an issue which arises when these conditions are met: - We're crashing on a CPU which is different from the one which was used to initially contact the hypervisor. - The CPU which was used for the initial contact is blocked with interrupts disabled and there is a message pending in the message slot. In this case we won't be able to read the 'unload finished' message on the crashing CPU. This is reproducible when we receive unknown NMIs on all CPUs simultaneously: the first CPU entering panic() will proceed to crash and all other CPUs will stop themselves with interrupts disabled. The suggested solution is to handle unknown NMIs for Hyper-V guests on the first CPU which gets them only. This will allow us to rely on VMBus interrupt handler being able to receive the 'unload finish' message in case it is delivered to a different CPU. The issue is not reproducible on WS2016 as Debug-VM delivers NMI to the boot CPU only, WS2012R2 and earlier Hyper-V versions are affected. Signed-off-by: Vitaly Kuznetsov <vkuznets@redhat.com> Acked-by: K. Y. Srinivasan <kys@microsoft.com> Cc: devel@linuxdriverproject.org Cc: Haiyang Zhang <haiyangz@microsoft.com> Link: http://lkml.kernel.org/r/20161202100720.28121-1-vkuznets@redhat.com Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Michael Cyr	456be98b4e	scsi: ibmvscsis: Synchronize cmds at remove time [ Upstream commit `8bf11557d4` ] This patch adds code to disconnect from the client, which will make sure any outstanding commands have been completed, before continuing on with the remove operation. Signed-off-by: Michael Cyr <mikecyr@us.ibm.com> Signed-off-by: Bryant G. Ly <bryantly@linux.vnet.ibm.com> Tested-by: Steven Royer <seroyer@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Michael Cyr	94700877c2	scsi: ibmvscsis: Synchronize cmds at tpg_enable_store time [ Upstream commit `c9b3379f60` ] This patch changes the way the IBM vSCSI server driver manages its Command/Response Queue (CRQ). We used to register the CRQ with phyp at probe time. Now we wait until tpg_enable_store. Similarly, when tpg_enable_store is called to "disable" (i.e. the stored value is 0), we unregister the queue with phyp. One consquence to this is that we have no need for the PART_UP_WAIT_ENAB state, since we can't get an Init Message from the client in our CRQ if we're waiting to be enabled, since we haven't registered the queue yet. Signed-off-by: Michael Cyr <mikecyr@us.ibm.com> Signed-off-by: Bryant G. Ly <bryantly@linux.vnet.ibm.com> Tested-by: Steven Royer <seroyer@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Michael Cyr	189491f81c	scsi: ibmvscsis: Rearrange functions for future patches [ Upstream commit `79fac9c9b7` ] This patch reorders functions in a manner necessary for a follow-on patch. It also makes some minor styling changes (mostly removing extra spaces) and fixes some typos. There are no code changes in this patch, with one exception: due to the reordering of the functions, I needed to explicitly declare a function at the top of the file. However, this will be removed in the next patch, since the code requiring the predeclaration will be removed. Signed-off-by: Michael Cyr <mikecyr@us.ibm.com> Signed-off-by: Bryant G. Ly <bryantly@linux.vnet.ibm.com> Tested-by: Steven Royer <seroyer@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Michael Cyr	4d36f4859f	scsi: ibmvscsis: Clean up properly if target_submit_cmd/tmr fails [ Upstream commit `7435b32e2d` ] Signed-off-by: Michael Cyr <mikecyr@us.ibm.com> Signed-off-by: Bryant G. Ly <bryantly@linux.vnet.ibm.com> Tested-by: Steven Royer <seroyer@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Michael Cyr	29022860df	scsi: ibmvscsis: Return correct partition name/# to client [ Upstream commit `9c93cf03d4` ] Signed-off-by: Michael Cyr <mikecyr@us.ibm.com> Signed-off-by: Bryant G. Ly <bryantly@linux.vnet.ibm.com> Tested-by: Steven Royer <seroyer@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:36 +01:00
Michael Cyr	0ad1d7660a	scsi: ibmvscsis: Issues from Dan Carpenter/Smatch [ Upstream commit `11950d70b5` ] Signed-off-by: Michael Cyr <mikecyr@us.ibm.com> Signed-off-by: Bryant G. Ly <bryantly@linux.vnet.ibm.com> Tested-by: Steven Royer <seroyer@linux.vnet.ibm.com> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Todd Fujinaka	61229e62c1	igb: add i211 to i210 PHY workaround [ Upstream commit `5bc8c230e2` ] i210 and i211 share the same PHY but have different PCI IDs. Don't forget i211 for any i210 workarounds. Signed-off-by: Todd Fujinaka <todd.fujinaka@intel.com> Tested-by: Aaron Brown <aaron.f.brown@intel.com> Signed-off-by: Jeff Kirsher <jeffrey.t.kirsher@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Chris J Arges	15ffc931ee	igb: Workaround for igb i210 firmware issue [ Upstream commit `4e684f59d7` ] Sometimes firmware may not properly initialize I347AT4_PAGE_SELECT causing the probe of an igb i210 NIC to fail. This patch adds an addition zeroing of this register during igb_get_phy_id to workaround this issue. Thanks for Jochen Henneberg for the idea and original patch. Signed-off-by: Chris J Arges <christopherarges@gmail.com> Tested-by: Aaron Brown <aaron.f.brown@intel.com> Signed-off-by: Jeff Kirsher <jeffrey.t.kirsher@intel.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Dan Streetman	4b40611a9b	xen: do not re-use pirq number cached in pci device msi msg data [ Upstream commit `c74fd80f2f` ] Revert the main part of commit: `af42b8d12f` ("xen: fix MSI setup and teardown for PV on HVM guests") That commit introduced reading the pci device's msi message data to see if a pirq was previously configured for the device's msi/msix, and re-use that pirq. At the time, that was the correct behavior. However, a later change to Qemu caused it to call into the Xen hypervisor to unmap all pirqs for a pci device, when the pci device disables its MSI/MSIX vectors; specifically the Qemu commit: c976437c7dba9c7444fb41df45468968aaa326ad ("qemu-xen: free all the pirqs for msi/msix when driver unload") Once Qemu added this pirq unmapping, it was no longer correct for the kernel to re-use the pirq number cached in the pci device msi message data. All Qemu releases since 2.1.0 contain the patch that unmaps the pirqs when the pci device disables its MSI/MSIX vectors. This bug is causing failures to initialize multiple NVMe controllers under Xen, because the NVMe driver sets up a single MSIX vector for each controller (concurrently), and then after using that to talk to the controller for some configuration data, it disables the single MSIX vector and re-configures all the MSIX vectors it needs. So the MSIX setup code tries to re-use the cached pirq from the first vector for each controller, but the hypervisor has already given away that pirq to another controller, and its initialization fails. This is discussed in more detail at: https://lists.xen.org/archives/html/xen-devel/2017-01/msg00447.html Fixes: `af42b8d12f` ("xen: fix MSI setup and teardown for PV on HVM guests") Signed-off-by: Dan Streetman <dan.streetman@canonical.com> Reviewed-by: Stefano Stabellini <sstabellini@kernel.org> Acked-by: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com> Signed-off-by: Boris Ostrovsky <boris.ostrovsky@oracle.com> Signed-off-by: Sasha Levin <alexander.levin@verizon.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Krister Johansen	2382c1486c	dmaengine: iota: ioat_alloc_chan_resources should not perform sleeping allocations. commit `21d25f6a42` upstream. On a kernel with DEBUG_LOCKS, ioat_free_chan_resources triggers an in_interrupt() warning. With PROVE_LOCKING, it reports detecting a SOFTIRQ-safe to SOFTIRQ-unsafe lock ordering in the same code path. This is because dma_generic_alloc_coherent() checks if the GFP flags permit blocking. It allocates from different subsystems if blocking is permitted. The free path knows how to return the memory to the correct allocator. If GFP_KERNEL is specified then the alloc and free end up going through cma_alloc(), which uses mutexes. Given that ioat_free_chan_resources() can be called in interrupt context, ioat_alloc_chan_resources() must specify GFP_NOWAIT so that the allocations do not block and instead use an allocator that uses spinlocks. Signed-off-by: Krister Johansen <kjlx@templeofstupid.com> Acked-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Vinod Koul <vinod.koul@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Daniel Borkmann	0e0f1d6fdb	bpf: fix mark_reg_unknown_value for spilled regs on map value marking [ Upstream commit `6760bf2ddd` ] Martin reported a verifier issue that hit the BUG_ON() for his test case in the mark_reg_unknown_value() function: [ 202.861380] kernel BUG at kernel/bpf/verifier.c:467! [...] [ 203.291109] Call Trace: [ 203.296501] [<ffffffff811364d5>] mark_map_reg+0x45/0x50 [ 203.308225] [<ffffffff81136558>] mark_map_regs+0x78/0x90 [ 203.320140] [<ffffffff8113938d>] do_check+0x226d/0x2c90 [ 203.331865] [<ffffffff8113a6ab>] bpf_check+0x48b/0x780 [ 203.343403] [<ffffffff81134c8e>] bpf_prog_load+0x27e/0x440 [ 203.355705] [<ffffffff8118a38f>] ? handle_mm_fault+0x11af/0x1230 [ 203.369158] [<ffffffff812d8188>] ? security_capable+0x48/0x60 [ 203.382035] [<ffffffff811351a4>] SyS_bpf+0x124/0x960 [ 203.393185] [<ffffffff810515f6>] ? __do_page_fault+0x276/0x490 [ 203.406258] [<ffffffff816db320>] entry_SYSCALL_64_fastpath+0x13/0x94 This issue got uncovered after the fix in `a08dd0da53` ("bpf: fix regression on verifier pruning wrt map lookups"). The reason why it wasn't noticed before was, because as mentioned in `a08dd0da53`, mark_map_regs() was doing the id matching incorrectly based on the uncached regs[regno].id. So, in the first loop, we walked all regs and as soon as we found regno == i, then this reg's id was cleared when calling mark_reg_unknown_value() thus that every subsequent register was probed against id of 0 (which, in combination with the PTR_TO_MAP_VALUE_OR_NULL type is an invalid condition that no other register state can hold), and therefore wasn't type transitioned such as in the spilled register case for the second loop. Now since that got fixed, it turned out that `57a09bf0a4` ("bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers") used mark_reg_unknown_value() incorrectly for the spilled regs, and thus hitting the BUG_ON() in some cases due to regno >= MAX_BPF_REG. Although spilled regs have the same type as the non-spilled regs for the verifier state, that is, struct bpf_reg_state, they are semantically different from the non-spilled regs. In other words, there can be up to 64 (MAX_BPF_STACK / BPF_REG_SIZE) spilled regs in the stack, for example, register R<x> could have been spilled by the program to stack location X, Y, Z, and in mark_map_regs() we need to scan these stack slots of type STACK_SPILL for potential registers that we have to transition from PTR_TO_MAP_VALUE_OR_NULL. Therefore, depending on the location, the spilled_regs regno can be a lot higher than just MAX_BPF_REG's value since we operate on stack instead. The reset in mark_reg_unknown_value() itself is just fine, only that the BUG_ON() was inappropriate for this. Fix it by making a __mark_reg_unknown_value() version that can be called from mark_map_reg() generically; we know for the non-spilled case that the regno is always < MAX_BPF_REG anyway. Fixes: `57a09bf0a4` ("bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers") Reported-by: Martin KaFai Lau <kafai@fb.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Daniel Borkmann	1889d6d9b5	bpf: fix regression on verifier pruning wrt map lookups [ Upstream commit `a08dd0da53` ] Commit `57a09bf0a4` ("bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers") introduced a regression where existing programs stopped loading due to reaching the verifier's maximum complexity limit, whereas prior to this commit they were loading just fine; the affected program has roughly 2k instructions. What was found is that state pruning couldn't be performed effectively anymore due to mismatches of the verifier's register state, in particular in the id tracking. It doesn't mean that `57a09bf0a4` is incorrect per se, but rather that verifier needs to perform a lot more work for the same program with regards to involved map lookups. Since commit `57a09bf0a4` is only about tracking registers with type PTR_TO_MAP_VALUE_OR_NULL, the id is only needed to follow registers until they are promoted through pattern matching with a NULL check to either PTR_TO_MAP_VALUE or UNKNOWN_VALUE type. After that point, the id becomes irrelevant for the transitioned types. For UNKNOWN_VALUE, id is already reset to 0 via mark_reg_unknown_value(), but not so for PTR_TO_MAP_VALUE where id is becoming stale. It's even transferred further into other types that don't make use of it. Among others, one example is where UNKNOWN_VALUE is set on function call return with RET_INTEGER return type. states_equal() will then fall through the memcmp() on register state; note that the second memcmp() uses offsetofend(), so the id is part of that since `d2a4dd37f6` ("bpf: fix state equivalence"). But the bisect pointed already to `57a09bf0a4`, where we really reach beyond complexity limit. What I found was that states_equal() often failed in this case due to id mismatches in spilled regs with registers in type PTR_TO_MAP_VALUE. Unlike non-spilled regs, spilled regs just perform a memcmp() on their reg state and don't have any other optimizations in place, therefore also id was relevant in this case for making a pruning decision. We can safely reset id to 0 as well when converting to PTR_TO_MAP_VALUE. For the affected program, it resulted in a ~17 fold reduction of complexity and let the program load fine again. Selftest suite also runs fine. The only other place where env->id_gen is used currently is through direct packet access, but for these cases id is long living, thus a different scenario. Also, the current logic in mark_map_regs() is not fully correct when marking NULL branch with UNKNOWN_VALUE. We need to cache the destination reg's id in any case. Otherwise, once we marked that reg as UNKNOWN_VALUE, it's id is reset and any subsequent registers that hold the original id and are of type PTR_TO_MAP_VALUE_OR_NULL won't be marked UNKNOWN_VALUE anymore, since mark_map_reg() reuses the uncached regs[regno].id that was just overridden. Note, we don't need to cache it outside of mark_map_regs(), since it's called once on this_branch and the other time on other_branch, which are both two independent verifier states. A test case for this is added here, too. Fixes: `57a09bf0a4` ("bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers") Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Thomas Graf <tgraf@suug.ch> Acked-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Alexei Starovoitov	b7f5aa1ca0	bpf: fix state equivalence [ Upstream commit `d2a4dd37f6` ] Commmits `57a09bf0a4` ("bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers") and `484611357c` ("bpf: allow access into map value arrays") by themselves are correct, but in combination they make state equivalence ignore 'id' field of the register state which can lead to accepting invalid program. Fixes: `57a09bf0a4` ("bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers") Fixes: `484611357c` ("bpf: allow access into map value arrays") Signed-off-by: Alexei Starovoitov <ast@kernel.org> Acked-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Thomas Graf	1411707acb	bpf: Detect identical PTR_TO_MAP_VALUE_OR_NULL registers [ Upstream commit `57a09bf0a4` ] A BPF program is required to check the return register of a map_elem_lookup() call before accessing memory. The verifier keeps track of this by converting the type of the result register from PTR_TO_MAP_VALUE_OR_NULL to PTR_TO_MAP_VALUE after a conditional jump ensures safety. This check is currently exclusively performed for the result register 0. In the event the compiler reorders instructions, BPF_MOV64_REG instructions may be moved before the conditional jump which causes them to keep their type PTR_TO_MAP_VALUE_OR_NULL to which the verifier objects when the register is accessed: 0: (b7) r1 = 10 1: (7b) (u64 )(r10 -8) = r1 2: (bf) r2 = r10 3: (07) r2 += -8 4: (18) r1 = 0x59c00000 6: (85) call 1 7: (bf) r4 = r0 8: (15) if r0 == 0x0 goto pc+1 R0=map_value(ks=8,vs=8) R4=map_value_or_null(ks=8,vs=8) R10=fp 9: (7a) (u64 )(r4 +0) = 0 R4 invalid mem access 'map_value_or_null' This commit extends the verifier to keep track of all identical PTR_TO_MAP_VALUE_OR_NULL registers after a map_elem_lookup() by assigning them an ID and then marking them all when the conditional jump is observed. Signed-off-by: Thomas Graf <tgraf@suug.ch> Reviewed-by: Josef Bacik <jbacik@fb.com> Acked-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Hannes Frederic Sowa	9e38375a4b	dccp: fix memory leak during tear-down of unsuccessful connection request [ Upstream commit `72ef9c4125` ] This patch fixes a memory leak, which happens if the connection request is not fulfilled between parsing the DCCP options and handling the SYN (because e.g. the backlog is full), because we forgot to free the list of ack vectors. Reported-by: Jianwen Ji <jiji@redhat.com> Signed-off-by: Hannes Frederic Sowa <hannes@stressinduktion.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:35 +01:00
Hannes Frederic Sowa	beaa66cce5	tun: fix premature POLLOUT notification on tun devices [ Upstream commit `b20e2d5478` ] aszlig observed failing ssh tunnels (-w) during initialization since commit `cc9da6cc4f` ("ipv6: addrconf: use stable address generator for ARPHRD_NONE"). We already had reports that the mentioned commit breaks Juniper VPN connections. I can't clearly say that the Juniper VPN client has the same problem, but it is worth a try to hint to this patch. Because of the early generation of link local addresses, the kernel now can start asking for routers on the local subnet much earlier than usual. Those router solicitation packets arrive inside the ssh channels and should be transmitted to the tun fd before the configuration scripts might have upped the interface and made it ready for transmission. ssh polls on the interface and receives back a POLL_OUT. It tries to send the earily router solicitation packet to the tun interface. Unfortunately it hasn't been up'ed yet by config scripts, thus failing with -EIO. ssh doesn't retry again and considers the tun interface broken forever. Link: https://bugzilla.kernel.org/show_bug.cgi?id=121131 Fixes: `cc9da6cc4f` ("ipv6: addrconf: use stable address generator for ARPHRD_NONE") Cc: Bjørn Mork <bjorn@mork.no> Reported-by: Valdis Kletnieks <Valdis.Kletnieks@vt.edu> Cc: Valdis Kletnieks <Valdis.Kletnieks@vt.edu> Reported-by: Jonas Lippuner <jonas@lippuner.ca> Cc: Jonas Lippuner <jonas@lippuner.ca> Reported-by: aszlig <aszlig@redmoonstudios.org> Cc: aszlig <aszlig@redmoonstudios.org> Signed-off-by: Hannes Frederic Sowa <hannes@stressinduktion.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Jon Maxwell	98933eb36d	dccp/tcp: fix routing redirect race [ Upstream commit `45caeaa5ac` ] As Eric Dumazet pointed out this also needs to be fixed in IPv6. v2: Contains the IPv6 tcp/Ipv6 dccp patches as well. We have seen a few incidents lately where a dst_enty has been freed with a dangling TCP socket reference (sk->sk_dst_cache) pointing to that dst_entry. If the conditions/timings are right a crash then ensues when the freed dst_entry is referenced later on. A Common crashing back trace is: #8 [] page_fault at ffffffff8163e648 [exception RIP: __tcp_ack_snd_check+74] . . #9 [] tcp_rcv_established at ffffffff81580b64 #10 [] tcp_v4_do_rcv at ffffffff8158b54a #11 [] tcp_v4_rcv at ffffffff8158cd02 #12 [] ip_local_deliver_finish at ffffffff815668f4 #13 [] ip_local_deliver at ffffffff81566bd9 #14 [] ip_rcv_finish at ffffffff8156656d #15 [] ip_rcv at ffffffff81566f06 #16 [] __netif_receive_skb_core at ffffffff8152b3a2 #17 [] __netif_receive_skb at ffffffff8152b608 #18 [] netif_receive_skb at ffffffff8152b690 #19 [] vmxnet3_rq_rx_complete at ffffffffa015eeaf [vmxnet3] #20 [] vmxnet3_poll_rx_only at ffffffffa015f32a [vmxnet3] #21 [] net_rx_action at ffffffff8152bac2 #22 [] __do_softirq at ffffffff81084b4f #23 [] call_softirq at ffffffff8164845c #24 [] do_softirq at ffffffff81016fc5 #25 [] irq_exit at ffffffff81084ee5 #26 [] do_IRQ at ffffffff81648ff8 Of course it may happen with other NIC drivers as well. It's found the freed dst_entry here: 224 static bool tcp_in_quickack_mode(struct sock sk)↩ 225 {↩ 226 ▹ const struct inet_connection_sock icsk = inet_csk(sk);↩ 227 ▹ const struct dst_entry *dst = __sk_dst_get(sk);↩ 228 ↩ 229 ▹ return (dst && dst_metric(dst, RTAX_QUICKACK)) \|\|↩ 230 ▹ ▹ (icsk->icsk_ack.quick && !icsk->icsk_ack.pingpong);↩ 231 }↩ But there are other backtraces attributed to the same freed dst_entry in netfilter code as well. All the vmcores showed 2 significant clues: - Remote hosts behind the default gateway had always been redirected to a different gateway. A rtable/dst_entry will be added for that host. Making more dst_entrys with lower reference counts. Making this more probable. - All vmcores showed a postitive LockDroppedIcmps value, e.g: LockDroppedIcmps 267 A closer look at the tcp_v4_err() handler revealed that do_redirect() will run regardless of whether user space has the socket locked. This can result in a race condition where the same dst_entry cached in sk->sk_dst_entry can be decremented twice for the same socket via: do_redirect()->__sk_dst_check()-> dst_release(). Which leads to the dst_entry being prematurely freed with another socket pointing to it via sk->sk_dst_cache and a subsequent crash. To fix this skip do_redirect() if usespace has the socket locked. Instead let the redirect take place later when user space does not have the socket locked. The dccp/IPv6 code is very similar in this respect, so fixing it there too. As Eric Garver pointed out the following commit now invalidates routes. Which can set the dst->obsolete flag so that ipv4_dst_check() returns null and triggers the dst_release(). Fixes: `ceb3320610` ("ipv4: Kill routes during PMTU/redirect updates.") Cc: Eric Garver <egarver@redhat.com> Cc: Hannes Sowa <hsowa@redhat.com> Signed-off-by: Jon Maxwell <jmaxwell37@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Florian Westphal	9bce26f224	bridge: drop netfilter fake rtable unconditionally [ Upstream commit `a13b2082ec` ] Andreas reports kernel oops during rmmod of the br_netfilter module. Hannes debugged the oops down to a NULL rt6info->rt6i_indev. Problem is that br_netfilter has the nasty concept of adding a fake rtable to skb->dst; this happens in a br_netfilter prerouting hook. A second hook (in bridge LOCAL_IN) is supposed to remove these again before the skb is handed up the stack. However, on module unload hooks get unregistered which means an skb could traverse the prerouting hook that attaches the fake_rtable, while the 'fake rtable remove' hook gets removed from the hooklist immediately after. Fixes: `34666d467c` ("netfilter: bridge: move br_netfilter out of the core") Reported-by: Andreas Karis <akaris@redhat.com> Debugged-by: Hannes Frederic Sowa <hannes@stressinduktion.org> Signed-off-by: Florian Westphal <fw@strlen.de> Acked-by: Pablo Neira Ayuso <pablo@netfilter.org> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Florian Westphal	683100ed45	ipv6: avoid write to a possibly cloned skb [ Upstream commit `79e49503ef` ] ip6_fragment, in case skb has a fraglist, checks if the skb is cloned. If it is, it will move to the 'slow path' and allocates new skbs for each fragment. However, right before entering the slowpath loop, it updates the nexthdr value of the last ipv6 extension header to NEXTHDR_FRAGMENT, to account for the fragment header that will be inserted in the new ipv6-fragment skbs. In case original skb is cloned this munges nexthdr value of another skb. Avoid this by doing the nexthdr update for each of the new fragment skbs separately. This was observed with tcpdump on a bridge device where netfilter ipv6 reassembly is active: tcpdump shows malformed fragment headers as the l4 header (icmpv6, tcp, etc). is decoded as a fragment header. Cc: Hannes Frederic Sowa <hannes@stressinduktion.org> Reported-by: Andreas Karis <akaris@redhat.com> Signed-off-by: Florian Westphal <fw@strlen.de> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Sabrina Dubroca	4a8d3bb73a	ipv6: make ECMP route replacement less greedy [ Upstream commit `67e194007b` ] Commit `2759647247` ("ipv6: fix ECMP route replacement") introduced a loop that removes all siblings of an ECMP route that is being replaced. However, this loop doesn't stop when it has replaced siblings, and keeps removing other routes with a higher metric. We also end up triggering the WARN_ON after the loop, because after this nsiblings < 0. Instead, stop the loop when we have taken care of all routes with the same metric as the route being replaced. Reproducer: =========== #!/bin/sh ip netns add ns1 ip netns add ns2 ip -net ns1 link set lo up for x in 0 1 2 ; do ip link add veth$x netns ns2 type veth peer name eth$x netns ns1 ip -net ns1 link set eth$x up ip -net ns2 link set veth$x up done ip -net ns1 -6 r a 2000::/64 nexthop via fe80::0 dev eth0 \ nexthop via fe80::1 dev eth1 nexthop via fe80::2 dev eth2 ip -net ns1 -6 r a 2000::/64 via fe80::42 dev eth0 metric 256 ip -net ns1 -6 r a 2000::/64 via fe80::43 dev eth0 metric 2048 echo "before replace, 3 routes" ip -net ns1 -6 r \| grep -v '^fe80\\|^ff00' echo ip -net ns1 -6 r c 2000::/64 nexthop via fe80::4 dev eth0 \ nexthop via fe80::5 dev eth1 nexthop via fe80::6 dev eth2 echo "after replace, only 2 routes, metric 2048 is gone" ip -net ns1 -6 r \| grep -v '^fe80\\|^ff00' Fixes: `2759647247` ("ipv6: fix ECMP route replacement") Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Acked-by: Nicolas Dichtel <nicolas.dichtel@6wind.com> Reviewed-by: Xin Long <lucien.xin@gmail.com> Reviewed-by: Michal Kubecek <mkubecek@suse.cz> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
David Ahern	87c0286a07	mpls: Do not decrement alive counter for unregister events [ Upstream commit `79099aab38` ] Multipath routes can be rendered usesless when a device in one of the paths is deleted. For example: $ ip -f mpls ro ls 100 nexthop as to 200 via inet 172.16.2.2 dev virt12 nexthop as to 300 via inet 172.16.3.2 dev br0 101 nexthop as to 201 via inet6 2000:2::2 dev virt12 nexthop as to 301 via inet6 2000:3::2 dev br0 $ ip li del br0 When br0 is deleted the other hop is not considered in mpls_select_multipath because of the alive check -- rt_nhn_alive is 0. rt_nhn_alive is decremented once in mpls_ifdown when the device is taken down (NETDEV_DOWN) and again when it is deleted (NETDEV_UNREGISTER). For a 2 hop route, deleting one device drops the alive count to 0. Since devices are taken down before unregistering, the decrement on NETDEV_UNREGISTER is redundant. Fixes: `c89359a42e` ("mpls: support for dead routes") Signed-off-by: David Ahern <dsa@cumulusnetworks.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
David Ahern	b61206e253	mpls: Send route delete notifications when router module is unloaded [ Upstream commit `e37791ec1a` ] When the mpls_router module is unloaded, mpls routes are deleted but notifications are not sent to userspace leaving userspace caches out of sync. Add the call to mpls_notify_route in mpls_net_exit as routes are freed. Fixes: `0189197f44` ("mpls: Basic routing support") Signed-off-by: David Ahern <dsa@cumulusnetworks.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Etienne Noss	47c8dc47c0	act_connmark: avoid crashing on malformed nlattrs with null parms [ Upstream commit `52491c7607` ] tcf_connmark_init does not check in its configuration if TCA_CONNMARK_PARMS is set, resulting in a null pointer dereference when trying to access it. [501099.043007] BUG: unable to handle kernel NULL pointer dereference at 0000000000000004 [501099.043039] IP: [<ffffffffc10c60fb>] tcf_connmark_init+0x8b/0x180 [act_connmark] ... [501099.044334] Call Trace: [501099.044345] [<ffffffffa47270e8>] ? tcf_action_init_1+0x198/0x1b0 [501099.044363] [<ffffffffa47271b0>] ? tcf_action_init+0xb0/0x120 [501099.044380] [<ffffffffa47250a4>] ? tcf_exts_validate+0xc4/0x110 [501099.044398] [<ffffffffc0f5fa97>] ? u32_set_parms+0xa7/0x270 [cls_u32] [501099.044417] [<ffffffffc0f60bf0>] ? u32_change+0x680/0x87b [cls_u32] [501099.044436] [<ffffffffa4725d1d>] ? tc_ctl_tfilter+0x4dd/0x8a0 [501099.044454] [<ffffffffa44a23a1>] ? security_capable+0x41/0x60 [501099.044471] [<ffffffffa470ca01>] ? rtnetlink_rcv_msg+0xe1/0x220 [501099.044490] [<ffffffffa470c920>] ? rtnl_newlink+0x870/0x870 [501099.044507] [<ffffffffa472cc61>] ? netlink_rcv_skb+0xa1/0xc0 [501099.044524] [<ffffffffa47073f4>] ? rtnetlink_rcv+0x24/0x30 [501099.044541] [<ffffffffa472c634>] ? netlink_unicast+0x184/0x230 [501099.044558] [<ffffffffa472c9d8>] ? netlink_sendmsg+0x2f8/0x3b0 [501099.044576] [<ffffffffa46d8880>] ? sock_sendmsg+0x30/0x40 [501099.044592] [<ffffffffa46d8e03>] ? SYSC_sendto+0xd3/0x150 [501099.044608] [<ffffffffa425fda1>] ? __do_page_fault+0x2d1/0x510 [501099.044626] [<ffffffffa47fbd7b>] ? system_call_fast_compare_end+0xc/0x9b Fixes: `22a5dc0e5e` ("net: sched: Introduce connmark action") Signed-off-by: Étienne Noss <etienne.noss@wifirst.fr> Signed-off-by: Victorien Molle <victorien.molle@wifirst.fr> Acked-by: Cong Wang <xiyou.wangcong@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Dmitry V. Levin	ccb65adc6c	uapi: fix linux/packet_diag.h userspace compilation error [ Upstream commit `745cb7f8a5` ] Replace MAX_ADDR_LEN with its numeric value to fix the following linux/packet_diag.h userspace compilation error: /usr/include/linux/packet_diag.h:67:17: error: 'MAX_ADDR_LEN' undeclared here (not in a function) __u8 pdmc_addr[MAX_ADDR_LEN]; This is not the first case in the UAPI where the numeric value of MAX_ADDR_LEN is used instead of symbolic one, uapi/linux/if_link.h already does the same: $ grep MAX_ADDR_LEN include/uapi/linux/if_link.h __u8 mac[32]; /* MAX_ADDR_LEN */ There are no UAPI headers besides these two that use MAX_ADDR_LEN. Signed-off-by: Dmitry V. Levin <ldv@altlinux.org> Acked-by: Pavel Emelyanov <xemul@virtuozzo.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
Paolo Abeni	b07eed8f71	net/tunnel: set inner protocol in network gro hooks [ Upstream commit `294acf1c01` ] The gso code of several tunnels type (gre and udp tunnels) takes for granted that the skb->inner_protocol is properly initialized and drops the packet elsewhere. On the forwarding path no one is initializing such field, so gro encapsulated packets are dropped on forward. Since commit `3872035241` ("gre: Use inner_proto to obtain inner header protocol"), this can be reproduced when the encapsulated packets use gre as the tunneling protocol. The issue happens also with vxlan and geneve tunnels since commit `8bce6d7d0d` ("udp: Generalize skb_udp_segment"), if the forwarding host's ingress nic has h/w offload for such tunnel and a vxlan/geneve device is configured on top of it, regardless of the configured peer address and vni. To address the issue, this change initialize the inner_protocol field for encapsulated packets in both ipv4 and ipv6 gro complete callbacks. Fixes: `3872035241` ("gre: Use inner_proto to obtain inner header protocol") Fixes: `8bce6d7d0d` ("udp: Generalize skb_udp_segment") Signed-off-by: Paolo Abeni <pabeni@redhat.com> Acked-by: Alexander Duyck <alexander.h.duyck@intel.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:34 +01:00
David Ahern	db6e779618	vrf: Fix use-after-free in vrf_xmit [ Upstream commit `f7887d40e5` ] KASAN detected a use-after-free: [ 269.467067] BUG: KASAN: use-after-free in vrf_xmit+0x7f1/0x827 [vrf] at addr ffff8800350a21c0 [ 269.467067] Read of size 4 by task ssh/1879 [ 269.467067] CPU: 1 PID: 1879 Comm: ssh Not tainted 4.10.0+ #249 [ 269.467067] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.7.5-20140531_083030-gandalf 04/01/2014 [ 269.467067] Call Trace: [ 269.467067] dump_stack+0x81/0xb6 [ 269.467067] kasan_object_err+0x21/0x78 [ 269.467067] kasan_report+0x2f7/0x450 [ 269.467067] ? vrf_xmit+0x7f1/0x827 [vrf] [ 269.467067] ? ip_output+0xa4/0xdb [ 269.467067] __asan_load4+0x6b/0x6d [ 269.467067] vrf_xmit+0x7f1/0x827 [vrf] ... Which corresponds to the skb access after xmit handling. Fix by saving skb->len and using the saved value to update stats. Fixes: `193125dbd8` ("net: Introduce VRF device driver") Signed-off-by: David Ahern <dsa@cumulusnetworks.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Eric Dumazet	7c0eaeec84	dccp: fix use-after-free in dccp_feat_activate_values [ Upstream commit `62f8f4d906` ] Dmitry reported crashes in DCCP stack [1] Problem here is that when I got rid of listener spinlock, I missed the fact that DCCP stores a complex state in struct dccp_request_sock, while TCP does not. Since multiple cpus could access it at the same time, we need to add protection. [1] BUG: KASAN: use-after-free in dccp_feat_activate_values+0x967/0xab0 net/dccp/feat.c:1541 at addr ffff88003713be68 Read of size 8 by task syz-executor2/8457 CPU: 2 PID: 8457 Comm: syz-executor2 Not tainted 4.10.0-rc7+ #127 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011 Call Trace: <IRQ> __dump_stack lib/dump_stack.c:15 [inline] dump_stack+0x292/0x398 lib/dump_stack.c:51 kasan_object_err+0x1c/0x70 mm/kasan/report.c:162 print_address_description mm/kasan/report.c:200 [inline] kasan_report_error mm/kasan/report.c:289 [inline] kasan_report.part.1+0x20e/0x4e0 mm/kasan/report.c:311 kasan_report mm/kasan/report.c:332 [inline] __asan_report_load8_noabort+0x29/0x30 mm/kasan/report.c:332 dccp_feat_activate_values+0x967/0xab0 net/dccp/feat.c:1541 dccp_create_openreq_child+0x464/0x610 net/dccp/minisocks.c:121 dccp_v6_request_recv_sock+0x1f6/0x1960 net/dccp/ipv6.c:457 dccp_check_req+0x335/0x5a0 net/dccp/minisocks.c:186 dccp_v6_rcv+0x69e/0x1d00 net/dccp/ipv6.c:711 ip6_input_finish+0x46d/0x17a0 net/ipv6/ip6_input.c:279 NF_HOOK include/linux/netfilter.h:257 [inline] ip6_input+0xdb/0x590 net/ipv6/ip6_input.c:322 dst_input include/net/dst.h:507 [inline] ip6_rcv_finish+0x289/0x890 net/ipv6/ip6_input.c:69 NF_HOOK include/linux/netfilter.h:257 [inline] ipv6_rcv+0x12ec/0x23d0 net/ipv6/ip6_input.c:203 __netif_receive_skb_core+0x1ae5/0x3400 net/core/dev.c:4190 __netif_receive_skb+0x2a/0x170 net/core/dev.c:4228 process_backlog+0xe5/0x6c0 net/core/dev.c:4839 napi_poll net/core/dev.c:5202 [inline] net_rx_action+0xe70/0x1900 net/core/dev.c:5267 __do_softirq+0x2fb/0xb7d kernel/softirq.c:284 do_softirq_own_stack+0x1c/0x30 arch/x86/entry/entry_64.S:902 </IRQ> do_softirq.part.17+0x1e8/0x230 kernel/softirq.c:328 do_softirq kernel/softirq.c:176 [inline] __local_bh_enable_ip+0x1f2/0x200 kernel/softirq.c:181 local_bh_enable include/linux/bottom_half.h:31 [inline] rcu_read_unlock_bh include/linux/rcupdate.h:971 [inline] ip6_finish_output2+0xbb0/0x23d0 net/ipv6/ip6_output.c:123 ip6_finish_output+0x302/0x960 net/ipv6/ip6_output.c:148 NF_HOOK_COND include/linux/netfilter.h:246 [inline] ip6_output+0x1cb/0x8d0 net/ipv6/ip6_output.c:162 ip6_xmit+0xcdf/0x20d0 include/net/dst.h:501 inet6_csk_xmit+0x320/0x5f0 net/ipv6/inet6_connection_sock.c:179 dccp_transmit_skb+0xb09/0x1120 net/dccp/output.c:141 dccp_xmit_packet+0x215/0x760 net/dccp/output.c:280 dccp_write_xmit+0x168/0x1d0 net/dccp/output.c:362 dccp_sendmsg+0x79c/0xb10 net/dccp/proto.c:796 inet_sendmsg+0x164/0x5b0 net/ipv4/af_inet.c:744 sock_sendmsg_nosec net/socket.c:635 [inline] sock_sendmsg+0xca/0x110 net/socket.c:645 SYSC_sendto+0x660/0x810 net/socket.c:1687 SyS_sendto+0x40/0x50 net/socket.c:1655 entry_SYSCALL_64_fastpath+0x1f/0xc2 RIP: 0033:0x4458b9 RSP: 002b:00007f8ceb77bb58 EFLAGS: 00000282 ORIG_RAX: 000000000000002c RAX: ffffffffffffffda RBX: 0000000000000017 RCX: 00000000004458b9 RDX: 0000000000000023 RSI: 0000000020e60000 RDI: 0000000000000017 RBP: 00000000006e1b90 R08: 00000000200f9fe1 R09: 0000000000000020 R10: 0000000000008010 R11: 0000000000000282 R12: 00000000007080a8 R13: 0000000000000000 R14: 00007f8ceb77c9c0 R15: 00007f8ceb77c700 Object at ffff88003713be50, in cache kmalloc-64 size: 64 Allocated: PID = 8446 save_stack_trace+0x16/0x20 arch/x86/kernel/stacktrace.c:57 save_stack+0x43/0xd0 mm/kasan/kasan.c:502 set_track mm/kasan/kasan.c:514 [inline] kasan_kmalloc+0xad/0xe0 mm/kasan/kasan.c:605 kmem_cache_alloc_trace+0x82/0x270 mm/slub.c:2738 kmalloc include/linux/slab.h:490 [inline] dccp_feat_entry_new+0x214/0x410 net/dccp/feat.c:467 dccp_feat_push_change+0x38/0x220 net/dccp/feat.c:487 __feat_register_sp+0x223/0x2f0 net/dccp/feat.c:741 dccp_feat_propagate_ccid+0x22b/0x2b0 net/dccp/feat.c:949 dccp_feat_server_ccid_dependencies+0x1b3/0x250 net/dccp/feat.c:1012 dccp_make_response+0x1f1/0xc90 net/dccp/output.c:423 dccp_v6_send_response+0x4ec/0xc20 net/dccp/ipv6.c:217 dccp_v6_conn_request+0xaba/0x11b0 net/dccp/ipv6.c:377 dccp_rcv_state_process+0x51e/0x1650 net/dccp/input.c:606 dccp_v6_do_rcv+0x213/0x350 net/dccp/ipv6.c:632 sk_backlog_rcv include/net/sock.h:893 [inline] __sk_receive_skb+0x36f/0xcc0 net/core/sock.c:479 dccp_v6_rcv+0xba5/0x1d00 net/dccp/ipv6.c:742 ip6_input_finish+0x46d/0x17a0 net/ipv6/ip6_input.c:279 NF_HOOK include/linux/netfilter.h:257 [inline] ip6_input+0xdb/0x590 net/ipv6/ip6_input.c:322 dst_input include/net/dst.h:507 [inline] ip6_rcv_finish+0x289/0x890 net/ipv6/ip6_input.c:69 NF_HOOK include/linux/netfilter.h:257 [inline] ipv6_rcv+0x12ec/0x23d0 net/ipv6/ip6_input.c:203 __netif_receive_skb_core+0x1ae5/0x3400 net/core/dev.c:4190 __netif_receive_skb+0x2a/0x170 net/core/dev.c:4228 process_backlog+0xe5/0x6c0 net/core/dev.c:4839 napi_poll net/core/dev.c:5202 [inline] net_rx_action+0xe70/0x1900 net/core/dev.c:5267 __do_softirq+0x2fb/0xb7d kernel/softirq.c:284 Freed: PID = 15 save_stack_trace+0x16/0x20 arch/x86/kernel/stacktrace.c:57 save_stack+0x43/0xd0 mm/kasan/kasan.c:502 set_track mm/kasan/kasan.c:514 [inline] kasan_slab_free+0x73/0xc0 mm/kasan/kasan.c:578 slab_free_hook mm/slub.c:1355 [inline] slab_free_freelist_hook mm/slub.c:1377 [inline] slab_free mm/slub.c:2954 [inline] kfree+0xe8/0x2b0 mm/slub.c:3874 dccp_feat_entry_destructor.part.4+0x48/0x60 net/dccp/feat.c:418 dccp_feat_entry_destructor net/dccp/feat.c:416 [inline] dccp_feat_list_pop net/dccp/feat.c:541 [inline] dccp_feat_activate_values+0x57f/0xab0 net/dccp/feat.c:1543 dccp_create_openreq_child+0x464/0x610 net/dccp/minisocks.c:121 dccp_v6_request_recv_sock+0x1f6/0x1960 net/dccp/ipv6.c:457 dccp_check_req+0x335/0x5a0 net/dccp/minisocks.c:186 dccp_v6_rcv+0x69e/0x1d00 net/dccp/ipv6.c:711 ip6_input_finish+0x46d/0x17a0 net/ipv6/ip6_input.c:279 NF_HOOK include/linux/netfilter.h:257 [inline] ip6_input+0xdb/0x590 net/ipv6/ip6_input.c:322 dst_input include/net/dst.h:507 [inline] ip6_rcv_finish+0x289/0x890 net/ipv6/ip6_input.c:69 NF_HOOK include/linux/netfilter.h:257 [inline] ipv6_rcv+0x12ec/0x23d0 net/ipv6/ip6_input.c:203 __netif_receive_skb_core+0x1ae5/0x3400 net/core/dev.c:4190 __netif_receive_skb+0x2a/0x170 net/core/dev.c:4228 process_backlog+0xe5/0x6c0 net/core/dev.c:4839 napi_poll net/core/dev.c:5202 [inline] net_rx_action+0xe70/0x1900 net/core/dev.c:5267 __do_softirq+0x2fb/0xb7d kernel/softirq.c:284 Memory state around the buggy address: ffff88003713bd00: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc ffff88003713bd80: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc >ffff88003713be00: fc fc fc fc fc fc fc fc fc fc fb fb fb fb fb fb ^ Fixes: `079096f103` ("tcp/dccp: install syn_recv requests into ehash table") Signed-off-by: Eric Dumazet <edumazet@google.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Tested-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Alexey Khoroshilov	5f79aab41d	net/sched: act_skbmod: remove unneeded rcu_read_unlock in tcf_skbmod_dump [ Upstream commit `6c4dc75c25` ] Found by Linux Driver Verification project (linuxtesting.org). Signed-off-by: Alexey Khoroshilov <khoroshilov@ispras.ru> Acked-by: Jamal Hadi Salim <jhs@mojatatu.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Eric Dumazet	f157cc1d72	net: fix socket refcounting in skb_complete_tx_timestamp() [ Upstream commit `9ac25fc063` ] TX skbs do not necessarily hold a reference on skb->sk->sk_refcnt By the time TX completion happens, sk_refcnt might be already 0. sock_hold()/sock_put() would then corrupt critical state, like sk_wmem_alloc and lead to leaks or use after free. Fixes: `62bccb8cdb` ("net-timestamp: Make the clone operation stand-alone from phy timestamping") Signed-off-by: Eric Dumazet <edumazet@google.com> Cc: Alexander Duyck <alexander.h.duyck@intel.com> Cc: Johannes Berg <johannes@sipsolutions.net> Cc: Soheil Hassas Yeganeh <soheil@google.com> Cc: Willem de Bruijn <willemb@google.com> Acked-by: Soheil Hassas Yeganeh <soheil@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Eric Dumazet	98fa3d2a8e	net: fix socket refcounting in skb_complete_wifi_ack() [ Upstream commit `dd4f10722a` ] TX skbs do not necessarily hold a reference on skb->sk->sk_refcnt By the time TX completion happens, sk_refcnt might be already 0. sock_hold()/sock_put() would then corrupt critical state, like sk_wmem_alloc. Fixes: `bf7fa551e0` ("mac80211: Resolve sk_refcnt/sk_wmem_alloc issue in wifi ack path") Signed-off-by: Eric Dumazet <edumazet@google.com> Cc: Alexander Duyck <alexander.h.duyck@intel.com> Cc: Johannes Berg <johannes@sipsolutions.net> Cc: Soheil Hassas Yeganeh <soheil@google.com> Cc: Willem de Bruijn <willemb@google.com> Acked-by: Soheil Hassas Yeganeh <soheil@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Eric Dumazet	07753bc6a2	tcp: fix various issues for sockets morphing to listen state [ Upstream commit `02b2faaf0a` ] Dmitry Vyukov reported a divide by 0 triggered by syzkaller, exploiting tcp_disconnect() path that was never really considered and/or used before syzkaller ;) I was not able to reproduce the bug, but it seems issues here are the three possible actions that assumed they would never trigger on a listener. 1) tcp_write_timer_handler 2) tcp_delack_timer_handler 3) MTU reduction Only IPv6 MTU reduction was properly testing TCP_CLOSE and TCP_LISTEN states from tcp_v6_mtu_reduced() Signed-off-by: Eric Dumazet <edumazet@google.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
WANG Cong	4547f03d1a	strparser: destroy workqueue on module exit [ Upstream commit `f78ef7cd9a` ] Fixes: `43a0c6751a` ("strparser: Stream parser for messages") Cc: Tom Herbert <tom@herbertland.com> Signed-off-by: Cong Wang <xiyou.wangcong@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Arnaldo Carvalho de Melo	51ae1fbcf1	dccp: Unlock sock before calling sk_free() [ Upstream commit `d5afb6f9b6` ] The code where sk_clone() came from created a new socket and locked it, but then, on the error path didn't unlock it. This problem stayed there for a long while, till `b0691c8ee7` ("net: Unlock sock before calling sk_free()") fixed it, but unfortunately the callers of sk_clone() (now sk_clone_locked()) were not audited and the one in dccp_create_openreq_child() remained. Now in the age of the syskaller fuzzer, this was finally uncovered, as reported by Dmitry: ---- 8< ---- I've got the following report while running syzkaller fuzzer on `86292b33d4` ("Merge branch 'akpm' (patches from Andrew)") [ BUG: held lock freed! ] 4.10.0+ #234 Not tainted ------------------------- syz-executor6/6898 is freeing memory ffff88006286cac0-ffff88006286d3b7, with a lock still held there! (slock-AF_INET6){+.-...}, at: [<ffffffff8362c2c9>] spin_lock include/linux/spinlock.h:299 [inline] (slock-AF_INET6){+.-...}, at: [<ffffffff8362c2c9>] sk_clone_lock+0x3d9/0x12c0 net/core/sock.c:1504 5 locks held by syz-executor6/6898: #0: (sk_lock-AF_INET6){+.+.+.}, at: [<ffffffff839a34b4>] lock_sock include/net/sock.h:1460 [inline] #0: (sk_lock-AF_INET6){+.+.+.}, at: [<ffffffff839a34b4>] inet_stream_connect+0x44/0xa0 net/ipv4/af_inet.c:681 #1: (rcu_read_lock){......}, at: [<ffffffff83bc1c2a>] inet6_csk_xmit+0x12a/0x5d0 net/ipv6/inet6_connection_sock.c:126 #2: (rcu_read_lock){......}, at: [<ffffffff8369b424>] __skb_unlink include/linux/skbuff.h:1767 [inline] #2: (rcu_read_lock){......}, at: [<ffffffff8369b424>] __skb_dequeue include/linux/skbuff.h:1783 [inline] #2: (rcu_read_lock){......}, at: [<ffffffff8369b424>] process_backlog+0x264/0x730 net/core/dev.c:4835 #3: (rcu_read_lock){......}, at: [<ffffffff83aeb5c0>] ip6_input_finish+0x0/0x1700 net/ipv6/ip6_input.c:59 #4: (slock-AF_INET6){+.-...}, at: [<ffffffff8362c2c9>] spin_lock include/linux/spinlock.h:299 [inline] #4: (slock-AF_INET6){+.-...}, at: [<ffffffff8362c2c9>] sk_clone_lock+0x3d9/0x12c0 net/core/sock.c:1504 Fix it just like was done by `b0691c8ee7` ("net: Unlock sock before calling sk_free()"). Reported-by: Dmitry Vyukov <dvyukov@google.com> Cc: Cong Wang <xiyou.wangcong@gmail.com> Cc: Eric Dumazet <edumazet@google.com> Cc: Gerrit Renker <gerrit@erg.abdn.ac.uk> Cc: Thomas Gleixner <tglx@linutronix.de> Link: http://lkml.kernel.org/r/20170301153510.GE15145@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Eric Dumazet	0bcc319d55	ipv6: orphan skbs in reassembly unit [ Upstream commit `48cac18ecf` ] Andrey reported a use-after-free in IPv6 stack. Issue here is that we free the socket while it still has skb in TX path and in some queues. It happens here because IPv6 reassembly unit messes skb->truesize, breaking skb_set_owner_w() badly. We fixed a similar issue for IPV4 in commit `8282f27449` ("inet: frag: Always orphan skbs inside ip_defrag()") Acked-by: Joe Stringer <joe@ovn.org> ================================================================== BUG: KASAN: use-after-free in sock_wfree+0x118/0x120 Read of size 8 at addr ffff880062da0060 by task a.out/4140 page:ffffea00018b6800 count:1 mapcount:0 mapping: (null) index:0x0 compound_mapcount: 0 flags: 0x100000000008100(slab\|head) raw: 0100000000008100 0000000000000000 0000000000000000 0000000180130013 raw: dead000000000100 dead000000000200 ffff88006741f140 0000000000000000 page dumped because: kasan: bad access detected CPU: 0 PID: 4140 Comm: a.out Not tainted 4.10.0-rc3+ #59 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011 Call Trace: __dump_stack lib/dump_stack.c:15 dump_stack+0x292/0x398 lib/dump_stack.c:51 describe_address mm/kasan/report.c:262 kasan_report_error+0x121/0x560 mm/kasan/report.c:370 kasan_report mm/kasan/report.c:392 __asan_report_load8_noabort+0x3e/0x40 mm/kasan/report.c:413 sock_flag ./arch/x86/include/asm/bitops.h:324 sock_wfree+0x118/0x120 net/core/sock.c:1631 skb_release_head_state+0xfc/0x250 net/core/skbuff.c:655 skb_release_all+0x15/0x60 net/core/skbuff.c:668 __kfree_skb+0x15/0x20 net/core/skbuff.c:684 kfree_skb+0x16e/0x4e0 net/core/skbuff.c:705 inet_frag_destroy+0x121/0x290 net/ipv4/inet_fragment.c:304 inet_frag_put ./include/net/inet_frag.h:133 nf_ct_frag6_gather+0x1125/0x38b0 net/ipv6/netfilter/nf_conntrack_reasm.c:617 ipv6_defrag+0x21b/0x350 net/ipv6/netfilter/nf_defrag_ipv6_hooks.c:68 nf_hook_entry_hookfn ./include/linux/netfilter.h:102 nf_hook_slow+0xc3/0x290 net/netfilter/core.c:310 nf_hook ./include/linux/netfilter.h:212 __ip6_local_out+0x52c/0xaf0 net/ipv6/output_core.c:160 ip6_local_out+0x2d/0x170 net/ipv6/output_core.c:170 ip6_send_skb+0xa1/0x340 net/ipv6/ip6_output.c:1722 ip6_push_pending_frames+0xb3/0xe0 net/ipv6/ip6_output.c:1742 rawv6_push_pending_frames net/ipv6/raw.c:613 rawv6_sendmsg+0x2cff/0x4130 net/ipv6/raw.c:927 inet_sendmsg+0x164/0x5b0 net/ipv4/af_inet.c:744 sock_sendmsg_nosec net/socket.c:635 sock_sendmsg+0xca/0x110 net/socket.c:645 sock_write_iter+0x326/0x620 net/socket.c:848 new_sync_write fs/read_write.c:499 __vfs_write+0x483/0x760 fs/read_write.c:512 vfs_write+0x187/0x530 fs/read_write.c:560 SYSC_write fs/read_write.c:607 SyS_write+0xfb/0x230 fs/read_write.c:599 entry_SYSCALL_64_fastpath+0x1f/0xc2 arch/x86/entry/entry_64.S:203 RIP: 0033:0x7ff26e6f5b79 RSP: 002b:00007ff268e0ed98 EFLAGS: 00000206 ORIG_RAX: 0000000000000001 RAX: ffffffffffffffda RBX: 00007ff268e0f9c0 RCX: 00007ff26e6f5b79 RDX: 0000000000000010 RSI: 0000000020f50fe1 RDI: 0000000000000003 RBP: 00007ff26ebc1220 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000206 R12: 0000000000000000 R13: 00007ff268e0f9c0 R14: 00007ff26efec040 R15: 0000000000000003 The buggy address belongs to the object at ffff880062da0000 which belongs to the cache RAWv6 of size 1504 The buggy address ffff880062da0060 is located 96 bytes inside of 1504-byte region [ffff880062da0000, ffff880062da05e0) Freed by task 4113: save_stack_trace+0x16/0x20 arch/x86/kernel/stacktrace.c:57 save_stack+0x43/0xd0 mm/kasan/kasan.c:502 set_track mm/kasan/kasan.c:514 kasan_slab_free+0x73/0xc0 mm/kasan/kasan.c:578 slab_free_hook mm/slub.c:1352 slab_free_freelist_hook mm/slub.c:1374 slab_free mm/slub.c:2951 kmem_cache_free+0xb2/0x2c0 mm/slub.c:2973 sk_prot_free net/core/sock.c:1377 __sk_destruct+0x49c/0x6e0 net/core/sock.c:1452 sk_destruct+0x47/0x80 net/core/sock.c:1460 __sk_free+0x57/0x230 net/core/sock.c:1468 sk_free+0x23/0x30 net/core/sock.c:1479 sock_put ./include/net/sock.h:1638 sk_common_release+0x31e/0x4e0 net/core/sock.c:2782 rawv6_close+0x54/0x80 net/ipv6/raw.c:1214 inet_release+0xed/0x1c0 net/ipv4/af_inet.c:425 inet6_release+0x50/0x70 net/ipv6/af_inet6.c:431 sock_release+0x8d/0x1e0 net/socket.c:599 sock_close+0x16/0x20 net/socket.c:1063 __fput+0x332/0x7f0 fs/file_table.c:208 ____fput+0x15/0x20 fs/file_table.c:244 task_work_run+0x19b/0x270 kernel/task_work.c:116 exit_task_work ./include/linux/task_work.h:21 do_exit+0x186b/0x2800 kernel/exit.c:839 do_group_exit+0x149/0x420 kernel/exit.c:943 SYSC_exit_group kernel/exit.c:954 SyS_exit_group+0x1d/0x20 kernel/exit.c:952 entry_SYSCALL_64_fastpath+0x1f/0xc2 arch/x86/entry/entry_64.S:203 Allocated by task 4115: save_stack_trace+0x16/0x20 arch/x86/kernel/stacktrace.c:57 save_stack+0x43/0xd0 mm/kasan/kasan.c:502 set_track mm/kasan/kasan.c:514 kasan_kmalloc+0xad/0xe0 mm/kasan/kasan.c:605 kasan_slab_alloc+0x12/0x20 mm/kasan/kasan.c:544 slab_post_alloc_hook mm/slab.h:432 slab_alloc_node mm/slub.c:2708 slab_alloc mm/slub.c:2716 kmem_cache_alloc+0x1af/0x250 mm/slub.c:2721 sk_prot_alloc+0x65/0x2a0 net/core/sock.c:1334 sk_alloc+0x105/0x1010 net/core/sock.c:1396 inet6_create+0x44d/0x1150 net/ipv6/af_inet6.c:183 __sock_create+0x4f6/0x880 net/socket.c:1199 sock_create net/socket.c:1239 SYSC_socket net/socket.c:1269 SyS_socket+0xf9/0x230 net/socket.c:1249 entry_SYSCALL_64_fastpath+0x1f/0xc2 arch/x86/entry/entry_64.S:203 Memory state around the buggy address: ffff880062d9ff00: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc ffff880062d9ff80: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc >ffff880062da0000: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb ^ ffff880062da0080: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb ffff880062da0100: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb ================================================================== Reported-by: Andrey Konovalov <andreyknvl@google.com> Signed-off-by: Eric Dumazet <edumazet@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:33 +01:00
Eric Dumazet	3d87dce3df	net: net_enable_timestamp() can be called from irq contexts [ Upstream commit `13baa00ad0` ] It is now very clear that silly TCP listeners might play with enabling/disabling timestamping while new children are added to their accept queue. Meaning net_enable_timestamp() can be called from BH context while current state of the static key is not enabled. Lets play safe and allow all contexts. The work queue is scheduled only under the problematic cases, which are the static key enable/disable transition, to not slow down critical paths. This extends and improves what we did in commit `5fa8bbda38` ("net: use a work queue to defer net_disable_timestamp() work") Fixes: `b90e5794c5` ("net: dont call jump_label_dec from irq context") Signed-off-by: Eric Dumazet <edumazet@google.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Alexander Potapenko	62fe0521fb	net: don't call strlen() on the user buffer in packet_bind_spkt() [ Upstream commit `540e2894f7` ] KMSAN (KernelMemorySanitizer, a new error detection tool) reports use of uninitialized memory in packet_bind_spkt(): Acked-by: Eric Dumazet <edumazet@google.com> ================================================================== BUG: KMSAN: use of unitialized memory CPU: 0 PID: 1074 Comm: packet Not tainted 4.8.0-rc6+ #1891 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011 0000000000000000 ffff88006b6dfc08 ffffffff82559ae8 ffff88006b6dfb48 ffffffff818a7c91 ffffffff85b9c870 0000000000000092 ffffffff85b9c550 0000000000000000 0000000000000092 00000000ec400911 0000000000000002 Call Trace: [< inline >] __dump_stack lib/dump_stack.c:15 [<ffffffff82559ae8>] dump_stack+0x238/0x290 lib/dump_stack.c:51 [<ffffffff818a6626>] kmsan_report+0x276/0x2e0 mm/kmsan/kmsan.c:1003 [<ffffffff818a783b>] __msan_warning+0x5b/0xb0 mm/kmsan/kmsan_instr.c:424 [< inline >] strlen lib/string.c:484 [<ffffffff8259b58d>] strlcpy+0x9d/0x200 lib/string.c:144 [<ffffffff84b2eca4>] packet_bind_spkt+0x144/0x230 net/packet/af_packet.c:3132 [<ffffffff84242e4d>] SYSC_bind+0x40d/0x5f0 net/socket.c:1370 [<ffffffff84242a22>] SyS_bind+0x82/0xa0 net/socket.c:1356 [<ffffffff8515991b>] entry_SYSCALL_64_fastpath+0x13/0x8f arch/x86/entry/entry_64.o:? chained origin: 00000000eba00911 [<ffffffff810bb787>] save_stack_trace+0x27/0x50 arch/x86/kernel/stacktrace.c:67 [< inline >] kmsan_save_stack_with_flags mm/kmsan/kmsan.c:322 [< inline >] kmsan_save_stack mm/kmsan/kmsan.c:334 [<ffffffff818a59f8>] kmsan_internal_chain_origin+0x118/0x1e0 mm/kmsan/kmsan.c:527 [<ffffffff818a7773>] __msan_set_alloca_origin4+0xc3/0x130 mm/kmsan/kmsan_instr.c:380 [<ffffffff84242b69>] SYSC_bind+0x129/0x5f0 net/socket.c:1356 [<ffffffff84242a22>] SyS_bind+0x82/0xa0 net/socket.c:1356 [<ffffffff8515991b>] entry_SYSCALL_64_fastpath+0x13/0x8f arch/x86/entry/entry_64.o:? origin description: ----address@SYSC_bind (origin=00000000eb400911) ================================================================== (the line numbers are relative to 4.8-rc6, but the bug persists upstream) , when I run the following program as root: ===================================== #include <string.h> #include <sys/socket.h> #include <netpacket/packet.h> #include <net/ethernet.h> int main() { struct sockaddr addr; memset(&addr, 0xff, sizeof(addr)); addr.sa_family = AF_PACKET; int fd = socket(PF_PACKET, SOCK_PACKET, htons(ETH_P_ALL)); bind(fd, &addr, sizeof(addr)); return 0; } ===================================== This happens because addr.sa_data copied from the userspace is not zero-terminated, and copying it with strlcpy() in packet_bind_spkt() results in calling strlen() on the kernel copy of that non-terminated buffer. Signed-off-by: Alexander Potapenko <glider@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Mike Manning	fa7c48fb3a	net: bridge: allow IPv6 when multicast flood is disabled [ Upstream commit `8953de2f02` ] Even with multicast flooding turned off, IPv6 ND should still work so that IPv6 connectivity is provided. Allow this by continuing to flood multicast traffic originated by us. Fixes: `b6cb5ac833` ("net: bridge: add per-port multicast flood flag") Cc: Nikolay Aleksandrov <nikolay@cumulusnetworks.com> Signed-off-by: Mike Manning <mmanning@brocade.com> Acked-by: Nikolay Aleksandrov <nikolay@cumulusnetworks.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Eric Dumazet	bbaeb9b73f	tcp/dccp: block BH for SYN processing [ Upstream commit `449809a66c` ] SYN processing really was meant to be handled from BH. When I got rid of BH blocking while processing socket backlog in commit `5413d1babe` ("net: do not block BH while processing socket backlog"), I forgot that a malicious user could transition to TCP_LISTEN from a state that allowed (SYN) packets to be parked in the socket backlog while socket is owned by the thread doing the listen() call. Sure enough syzkaller found this and reported the bug ;) ================================= [ INFO: inconsistent lock state ] 4.10.0+ #60 Not tainted --------------------------------- inconsistent {IN-SOFTIRQ-W} -> {SOFTIRQ-ON-W} usage. syz-executor0/5090 [HC0[0]:SC0[0]:HE1:SE1] takes: (&(&hashinfo->ehash_locks[i])->rlock){+.?...}, at: [<ffffffff83a6a370>] spin_lock include/linux/spinlock.h:299 [inline] (&(&hashinfo->ehash_locks[i])->rlock){+.?...}, at: [<ffffffff83a6a370>] inet_ehash_insert+0x240/0xad0 net/ipv4/inet_hashtables.c:407 {IN-SOFTIRQ-W} state was registered at: mark_irqflags kernel/locking/lockdep.c:2923 [inline] __lock_acquire+0xbcf/0x3270 kernel/locking/lockdep.c:3295 lock_acquire+0x241/0x580 kernel/locking/lockdep.c:3753 __raw_spin_lock include/linux/spinlock_api_smp.h:142 [inline] _raw_spin_lock+0x33/0x50 kernel/locking/spinlock.c:151 spin_lock include/linux/spinlock.h:299 [inline] inet_ehash_insert+0x240/0xad0 net/ipv4/inet_hashtables.c:407 reqsk_queue_hash_req net/ipv4/inet_connection_sock.c:753 [inline] inet_csk_reqsk_queue_hash_add+0x1b7/0x2a0 net/ipv4/inet_connection_sock.c:764 tcp_conn_request+0x25cc/0x3310 net/ipv4/tcp_input.c:6399 tcp_v4_conn_request+0x157/0x220 net/ipv4/tcp_ipv4.c:1262 tcp_rcv_state_process+0x802/0x4130 net/ipv4/tcp_input.c:5889 tcp_v4_do_rcv+0x56b/0x940 net/ipv4/tcp_ipv4.c:1433 tcp_v4_rcv+0x2e12/0x3210 net/ipv4/tcp_ipv4.c:1711 ip_local_deliver_finish+0x4ce/0xc40 net/ipv4/ip_input.c:216 NF_HOOK include/linux/netfilter.h:257 [inline] ip_local_deliver+0x1ce/0x710 net/ipv4/ip_input.c:257 dst_input include/net/dst.h:492 [inline] ip_rcv_finish+0xb1d/0x2110 net/ipv4/ip_input.c:396 NF_HOOK include/linux/netfilter.h:257 [inline] ip_rcv+0xd90/0x19c0 net/ipv4/ip_input.c:487 __netif_receive_skb_core+0x1ad1/0x3400 net/core/dev.c:4179 __netif_receive_skb+0x2a/0x170 net/core/dev.c:4217 netif_receive_skb_internal+0x1d6/0x430 net/core/dev.c:4245 napi_skb_finish net/core/dev.c:4602 [inline] napi_gro_receive+0x4e6/0x680 net/core/dev.c:4636 e1000_receive_skb drivers/net/ethernet/intel/e1000/e1000_main.c:4033 [inline] e1000_clean_rx_irq+0x5e0/0x1490 drivers/net/ethernet/intel/e1000/e1000_main.c:4489 e1000_clean+0xb9a/0x2910 drivers/net/ethernet/intel/e1000/e1000_main.c:3834 napi_poll net/core/dev.c:5171 [inline] net_rx_action+0xe70/0x1900 net/core/dev.c:5236 __do_softirq+0x2fb/0xb7d kernel/softirq.c:284 invoke_softirq kernel/softirq.c:364 [inline] irq_exit+0x19e/0x1d0 kernel/softirq.c:405 exiting_irq arch/x86/include/asm/apic.h:658 [inline] do_IRQ+0x81/0x1a0 arch/x86/kernel/irq.c:250 ret_from_intr+0x0/0x20 native_safe_halt+0x6/0x10 arch/x86/include/asm/irqflags.h:53 arch_safe_halt arch/x86/include/asm/paravirt.h:98 [inline] default_idle+0x8f/0x410 arch/x86/kernel/process.c:271 arch_cpu_idle+0xa/0x10 arch/x86/kernel/process.c:262 default_idle_call+0x36/0x60 kernel/sched/idle.c:96 cpuidle_idle_call kernel/sched/idle.c:154 [inline] do_idle+0x348/0x440 kernel/sched/idle.c:243 cpu_startup_entry+0x18/0x20 kernel/sched/idle.c:345 start_secondary+0x344/0x440 arch/x86/kernel/smpboot.c:272 verify_cpu+0x0/0xfc irq event stamp: 1741 hardirqs last enabled at (1741): [<ffffffff84d49d77>] __raw_spin_unlock_irqrestore include/linux/spinlock_api_smp.h:160 [inline] hardirqs last enabled at (1741): [<ffffffff84d49d77>] _raw_spin_unlock_irqrestore+0xf7/0x1a0 kernel/locking/spinlock.c:191 hardirqs last disabled at (1740): [<ffffffff84d4a732>] __raw_spin_lock_irqsave include/linux/spinlock_api_smp.h:108 [inline] hardirqs last disabled at (1740): [<ffffffff84d4a732>] _raw_spin_lock_irqsave+0xa2/0x110 kernel/locking/spinlock.c:159 softirqs last enabled at (1738): [<ffffffff84d4deff>] __do_softirq+0x7cf/0xb7d kernel/softirq.c:310 softirqs last disabled at (1571): [<ffffffff84d4b92c>] do_softirq_own_stack+0x1c/0x30 arch/x86/entry/entry_64.S:902 other info that might help us debug this: Possible unsafe locking scenario: CPU0 ---- lock(&(&hashinfo->ehash_locks[i])->rlock); <Interrupt> lock(&(&hashinfo->ehash_locks[i])->rlock); * DEADLOCK * 1 lock held by syz-executor0/5090: #0: (sk_lock-AF_INET6){+.+.+.}, at: [<ffffffff83406b43>] lock_sock include/net/sock.h:1460 [inline] #0: (sk_lock-AF_INET6){+.+.+.}, at: [<ffffffff83406b43>] sock_setsockopt+0x233/0x1e40 net/core/sock.c:683 stack backtrace: CPU: 1 PID: 5090 Comm: syz-executor0 Not tainted 4.10.0+ #60 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Bochs 01/01/2011 Call Trace: __dump_stack lib/dump_stack.c:15 [inline] dump_stack+0x292/0x398 lib/dump_stack.c:51 print_usage_bug+0x3ef/0x450 kernel/locking/lockdep.c:2387 valid_state kernel/locking/lockdep.c:2400 [inline] mark_lock_irq kernel/locking/lockdep.c:2602 [inline] mark_lock+0xf30/0x1410 kernel/locking/lockdep.c:3065 mark_irqflags kernel/locking/lockdep.c:2941 [inline] __lock_acquire+0x6dc/0x3270 kernel/locking/lockdep.c:3295 lock_acquire+0x241/0x580 kernel/locking/lockdep.c:3753 __raw_spin_lock include/linux/spinlock_api_smp.h:142 [inline] _raw_spin_lock+0x33/0x50 kernel/locking/spinlock.c:151 spin_lock include/linux/spinlock.h:299 [inline] inet_ehash_insert+0x240/0xad0 net/ipv4/inet_hashtables.c:407 reqsk_queue_hash_req net/ipv4/inet_connection_sock.c:753 [inline] inet_csk_reqsk_queue_hash_add+0x1b7/0x2a0 net/ipv4/inet_connection_sock.c:764 dccp_v6_conn_request+0xada/0x11b0 net/dccp/ipv6.c:380 dccp_rcv_state_process+0x51e/0x1660 net/dccp/input.c:606 dccp_v6_do_rcv+0x213/0x350 net/dccp/ipv6.c:632 sk_backlog_rcv include/net/sock.h:896 [inline] __release_sock+0x127/0x3a0 net/core/sock.c:2052 release_sock+0xa5/0x2b0 net/core/sock.c:2539 sock_setsockopt+0x60f/0x1e40 net/core/sock.c:1016 SYSC_setsockopt net/socket.c:1782 [inline] SyS_setsockopt+0x2fb/0x3a0 net/socket.c:1765 entry_SYSCALL_64_fastpath+0x1f/0xc2 RIP: 0033:0x4458b9 RSP: 002b:00007fe8b26c2b58 EFLAGS: 00000292 ORIG_RAX: 0000000000000036 RAX: ffffffffffffffda RBX: 0000000000000006 RCX: 00000000004458b9 RDX: 000000000000001a RSI: 0000000000000001 RDI: 0000000000000006 RBP: 00000000006e2110 R08: 0000000000000010 R09: 0000000000000000 R10: 00000000208c3000 R11: 0000000000000292 R12: 0000000000708000 R13: 0000000020000000 R14: 0000000000001000 R15: 0000000000000000 Fixes: `5413d1babe` ("net: do not block BH while processing socket backlog") Signed-off-by: Eric Dumazet <edumazet@google.com> Reported-by: Andrey Konovalov <andreyknvl@google.com> Acked-by: Soheil Hassas Yeganeh <soheil@google.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Ido Schimmel	8f4db60c7f	mlxsw: spectrum_router: Avoid potential packets loss [ Upstream commit `f7df4923fa` ] When the structure of the LPM tree changes (f.e., due to the addition of a new prefix), we unbind the old tree and then bind the new one. This may result in temporary packet loss. Instead, overwrite the old binding with the new one. Fixes: `6b75c4807d` ("mlxsw: spectrum_router: Add virtual router management") Signed-off-by: Ido Schimmel <idosch@mellanox.com> Signed-off-by: Jiri Pirko <jiri@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Jakub Kicinski	02595f4725	geneve: lock RCU on TX path [ Upstream commit `a717e3f740` ] There is no guarantees that callers of the TX path will hold the RCU lock. Grab it explicitly. Fixes: `fceb9c3e38` ("geneve: avoid using stale geneve socket.") Signed-off-by: Jakub Kicinski <jakub.kicinski@netronome.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Jakub Kicinski	0a40da4a74	vxlan: lock RCU on TX path [ Upstream commit `56de859e99` ] There is no guarantees that callers of the TX path will hold the RCU lock. Grab it explicitly. Fixes: `c6fcc4fc5f` ("vxlan: avoid using stale vxlan socket.") Signed-off-by: Jakub Kicinski <jakub.kicinski@netronome.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Florian Fainelli	feaa5bab43	net: phy: Avoid deadlock during phy_error() [ Upstream commit `eab127717a` ] phy_error() is called in the PHY state machine workqueue context, and calls phy_trigger_machine() which does a cancel_delayed_work_sync() of the workqueue we execute from, causing a deadlock situation. Augment phy_trigger_machine() machine with a sync boolean indicating whether we should use cancel__sync() or just cancel__work(). Fixes: `3c293f4e08` ("net: phy: Trigger state machine on state change and not polling.") Reported-by: Russell King <rmk+kernel@armlinux.org.uk> Signed-off-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Paul Hüber	837786cbbb	l2tp: avoid use-after-free caused by l2tp_ip_backlog_recv [ Upstream commit `51fb60eb16` ] l2tp_ip_backlog_recv may not return -1 if the packet gets dropped. The return value is passed up to ip_local_deliver_finish, which treats negative values as an IP protocol number for resubmission. Signed-off-by: Paul Hüber <phueber@kernsp.in> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Roman Mashak	063893e4ec	net sched actions: decrement module reference count after table flush. [ Upstream commit `edb9d1bff4` ] When tc actions are loaded as a module and no actions have been installed, flushing them would result in actions removed from the memory, but modules reference count not being decremented, so that the modules would not be unloaded. Following is example with GACT action: % sudo modprobe act_gact % lsmod Module Size Used by act_gact 16384 0 % % sudo tc actions ls action gact % % sudo tc actions flush action gact % lsmod Module Size Used by act_gact 16384 1 % sudo tc actions flush action gact % lsmod Module Size Used by act_gact 16384 2 % sudo rmmod act_gact rmmod: ERROR: Module act_gact is in use .... After the fix: % lsmod Module Size Used by act_gact 16384 0 % % sudo tc actions add action pass index 1 % sudo tc actions add action pass index 2 % sudo tc actions add action pass index 3 % lsmod Module Size Used by act_gact 16384 3 % % sudo tc actions flush action gact % lsmod Module Size Used by act_gact 16384 0 % % sudo tc actions flush action gact % lsmod Module Size Used by act_gact 16384 0 % sudo rmmod act_gact % lsmod Module Size Used by % Fixes: `f97017cdef` ("net-sched: Fix actions flushing") Signed-off-by: Roman Mashak <mrv@mojatatu.com> Signed-off-by: Jamal Hadi Salim <jhs@mojatatu.com> Acked-by: Cong Wang <xiyou.wangcong@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Julian Anastasov	36931eb015	ipv4: mask tos for input route [ Upstream commit `6e28099d38` ] Restore the lost masking of TOS in input route code to allow ip rules to match it properly. Problem [1] noticed by Shmulik Ladkani <shmulik.ladkani@gmail.com> [1] http://marc.info/?t=137331755300040&r=1&w=2 Fixes: `89aef8921b` ("ipv4: Delete routing cache.") Signed-off-by: Julian Anastasov <ja@ssi.bg> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:32 +01:00
Brian Russell	a64407fafe	vxlan: don't allow overwrite of config src addr [ Upstream commit `1158632b5a` ] When using IPv6 transport and a default dst, a pointer to the configured source address is passed into the route lookup. If no source address is configured, then the value is overwritten. IPv6 route lookup ignores egress ifindex match if the source address is set, so if egress ifindex match is desired, the source address must be passed as any. The overwrite breaks this for subsequent lookups. Avoid this by copying the configured address to an existing stack variable and pass a pointer to that instead. Fixes: `272d96a5ab` ("net: vxlan: lwt: Use source ip address during route lookup.") Signed-off-by: Brian Russell <brussell@brocade.com> Acked-by: Jiri Benc <jbenc@redhat.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:31 +01:00
David Forster	f7081057d1	vti6: return GRE_KEY for vti6 [ Upstream commit `7dcdf941cd` ] Align vti6 with vti by returning GRE_KEY flag. This enables iproute2 to display tunnel keys on "ip -6 tunnel show" Signed-off-by: David Forster <dforster@brocade.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:31 +01:00
Matthias Schiffer	ee2da79de2	vxlan: correctly validate VXLAN ID against VXLAN_N_VID [ Upstream commit `4e37d6911f` ] The incorrect check caused an off-by-one error: the maximum VID 0xffffff was unusable. Fixes: `d342894c5d` ("vxlan: virtual extensible lan") Signed-off-by: Matthias Schiffer <mschiffer@universe-factory.net> Acked-by: Jiri Benc <jbenc@redhat.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:31 +01:00
Tariq Toukan	0c6e38e791	net/mlx5e: Fix wrong CQE decompression [ Upstream commit `36154be40a` ] In cqe compression with striding RQ, the decompression of the CQE field wqe_counter was done with a wrong wraparound value. This caused handling cqes with a wrong pointer to wqe (rx descriptor) and creating SKBs with wrong data, pointing to wrong (and already consumed) strides/pages. The meaning of the CQE field wqe_counter in striding RQ holds the stride index instead of the WQE index. Hence, when decompressing a CQE, wqe_counter should have wrapped-around the number of strides in a single multi-packet WQE. We dropped this wrap-around mask at all in CQE decompression of striding RQ. It is not needed as in such cases the CQE compression session would break because of different value of wqe_id field, starting a new compression session. Tested: ethtool -K ethxx lro off/on ethtool --set-priv-flags ethxx rx_cqe_compress on super_netperf 16 {ipv4,ipv6} -t TCP_STREAM -m 50 -D verified no csum errors and no page refcount issues. Fixes: `7219ab34f1` ("net/mlx5e: CQE compression") Signed-off-by: Tariq Toukan <tariqt@mellanox.com> Reported-by: Tom Herbert <tom@herbertland.com> Cc: kernel-team@fb.com Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:31 +01:00
Tariq Toukan	a633266469	net/mlx5e: Do not reduce LRO WQE size when not using build_skb [ Upstream commit `4078e637c1` ] When rq_type is Striding RQ, no room of SKB_RESERVE is needed as SKB allocation is not done via build_skb. Fixes: `e4b8550807` ("net/mlx5e: Slightly reduce hardware LRO size") Signed-off-by: Tariq Toukan <tariqt@mellanox.com> Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:31 +01:00
Saeed Mahameed	9e354abf9c	net/mlx5e: Register/unregister vport representors on interface attach/detach [ Upstream commit `6f08a22c5f` ] Currently vport representors are added only on driver load and removed on driver unload. Apparently we forgot to handle them when we added the seamless reset flow feature. This caused to leave the representors netdevs alive and active with open HW resources on pci shutdown and on error reset flows. To overcome this we move their handling to interface attach/detach, so they would be cleaned up on shutdown and recreated on reset flows. Fixes: `26e59d8077` ("net/mlx5e: Implement mlx5e interface attach/detach callbacks") Signed-off-by: Saeed Mahameed <saeedm@mellanox.com> Reviewed-by: Hadar Hen Zion <hadarh@mellanox.com> Reviewed-by: Roi Dayan <roid@mellanox.com> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-22 12:43:31 +01:00
Greg Kroah-Hartman	8a16224b40	Linux 4.9.16	2017-03-18 19:15:30 +08:00
Kamal Heib	c818669952	IB/mlx5: Verify that Q counters are supported commit `45bded2c21` upstream. Make sure that the Q counters are supported by the FW before trying to allocate/deallocte them, this will avoid driver load failure when they aren't supported by the FW. Fixes: `0837e86a7a` ('IB/mlx5: Add per port counters') Signed-off-by: Kamal Heib <kamalh@mellanox.com> Reviewed-by: Mark Bloch <markb@mellanox.com> Signed-off-by: Leon Romanovsky <leon@kernel.org> Signed-off-by: Doug Ledford <dledford@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:35 +08:00
Theodore Ts'o	8fed8fc188	ext4: don't BUG when truncating encrypted inodes on the orphan list commit `0d06863f90` upstream. Fix a BUG when the kernel tries to mount a file system constructed as follows: echo foo > foo.txt mke2fs -Fq -t ext4 -O encrypt foo.img 100 debugfs -w foo.img << EOF write foo.txt a set_inode_field a i_flags 0x80800 set_super_value s_last_orphan 12 quit EOF root@kvm-xfstests:~# mount -o loop foo.img /mnt [ 160.238770] ------------[ cut here ]------------ [ 160.240106] kernel BUG at /usr/projects/linux/ext4/fs/ext4/inode.c:3874! [ 160.240106] invalid opcode: 0000 [#1] SMP [ 160.240106] Modules linked in: [ 160.240106] CPU: 0 PID: 2547 Comm: mount Tainted: G W 4.10.0-rc3-00034-gcdd33b941b67 #227 [ 160.240106] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.10.1-1 04/01/2014 [ 160.240106] task: f4518000 task.stack: f47b6000 [ 160.240106] EIP: ext4_block_zero_page_range+0x1a7/0x2b4 [ 160.240106] EFLAGS: 00010246 CPU: 0 [ 160.240106] EAX: 00000001 EBX: f7be4b50 ECX: f47b7dc0 EDX: 00000007 [ 160.240106] ESI: f43b05a8 EDI: f43babec EBP: f47b7dd0 ESP: f47b7dac [ 160.240106] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 [ 160.240106] CR0: 80050033 CR2: bfd85b08 CR3: 34a00680 CR4: 000006f0 [ 160.240106] Call Trace: [ 160.240106] ext4_truncate+0x1e9/0x3e5 [ 160.240106] ext4_fill_super+0x286f/0x2b1e [ 160.240106] ? set_blocksize+0x2e/0x7e [ 160.240106] mount_bdev+0x114/0x15f [ 160.240106] ext4_mount+0x15/0x17 [ 160.240106] ? ext4_calculate_overhead+0x39d/0x39d [ 160.240106] mount_fs+0x58/0x115 [ 160.240106] vfs_kern_mount+0x4b/0xae [ 160.240106] do_mount+0x671/0x8c3 [ 160.240106] ? _copy_from_user+0x70/0x83 [ 160.240106] ? strndup_user+0x31/0x46 [ 160.240106] SyS_mount+0x57/0x7b [ 160.240106] do_int80_syscall_32+0x4f/0x61 [ 160.240106] entry_INT80_32+0x2f/0x2f [ 160.240106] EIP: 0xb76b919e [ 160.240106] EFLAGS: 00000246 CPU: 0 [ 160.240106] EAX: ffffffda EBX: 08053838 ECX: 08052188 EDX: 080537e8 [ 160.240106] ESI: c0ed0000 EDI: 00000000 EBP: 080537e8 ESP: bfa13660 [ 160.240106] DS: 007b ES: 007b FS: 0000 GS: 0033 SS: 007b [ 160.240106] Code: 59 8b 00 a8 01 0f 84 09 01 00 00 8b 07 66 25 00 f0 66 3d 00 80 75 61 89 f8 e8 3e e2 ff ff 84 c0 74 56 83 bf 48 02 00 00 00 75 02 <0f> 0b 81 7d e8 00 10 00 00 74 02 0f 0b 8b 43 04 8b 53 08 31 c9 [ 160.240106] EIP: ext4_block_zero_page_range+0x1a7/0x2b4 SS:ESP: 0068:f47b7dac [ 160.317241] ---[ end trace d6a773a375c810a5 ]--- The problem is that when the kernel tries to truncate an inode in ext4_truncate(), it tries to clear any on-disk data beyond i_size. Without the encryption key, it can't do that, and so it triggers a BUG. E2fsck does not provide this service, and in practice most file systems have their orphan list processed by e2fsck, so to avoid crashing, this patch skips this step if we don't have access to the encryption key (which is the case when processing the orphan list; in all other cases, we will have the encryption key, or the kernel wouldn't have allowed the file to be opened). An open question is whether the fact that e2fsck isn't clearing the bytes beyond i_size causing problems --- and if we've lived with it not doing it for so long, can we drop this from the kernel replay of the orphan list in all cases (not just when we don't have the key for encrypted inodes). Addresses-Google-Bug: #35209576 Signed-off-by: Theodore Ts'o <tytso@mit.edu> Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:34 +08:00
Sean Young	39df5977fd	rc: raw decoder for keymap protocol is not loaded on register commit `413808685d` upstream. When the protocol is set via the sysfs protocols attribute, the decoder is loaded. However, when it is not when a device is first plugged in or registered. Fixes: `acc1c3c` ("[media] media: rc: load decoder modules on-demand") Signed-off-by: Sean Young <sean@mess.org> Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:34 +08:00
Mikulas Patocka	21582cd0b6	dm: flush queued bios when process blocks to avoid deadlock commit `d67a5f4b59` upstream. Commit `df2cb6daa4` ("block: Avoid deadlocks with bio allocation by stacking drivers") created a workqueue for every bio set and code in bio_alloc_bioset() that tries to resolve some low-memory deadlocks by redirecting bios queued on current->bio_list to the workqueue if the system is low on memory. However other deadlocks (see below ) may happen, without any low memory condition, because generic_make_request is queuing bios to current->bio_list (rather than submitting them). the related dm-snapshot deadlock is detailed here: https://www.redhat.com/archives/dm-devel/2016-July/msg00065.html Fix this deadlock by redirecting any bios on current->bio_list to the bio_set's rescue workqueue on every schedule() call. Consequently, when the process blocks on a mutex, the bios queued on current->bio_list are dispatched to independent workqueus and they can complete without waiting for the mutex to be available. The structure blk_plug contains an entry cb_list and this list can contain arbitrary callback functions that are called when the process blocks. To implement this fix DM (ab)uses the onstack plug's cb_list interface to get its flush_current_bio_list() called at schedule() time. This fixes the snapshot deadlock - if the map method blocks, flush_current_bio_list() will be called and it redirects bios waiting on current->bio_list to appropriate workqueues. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1267650 Depends-on: `df2cb6daa4` ("block: Avoid deadlocks with bio allocation by stacking drivers") Signed-off-by: Mikulas Patocka <mpatocka@redhat.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:34 +08:00
Jintack Lim	d29e6215e5	KVM: arm/arm64: Let vcpu thread modify its own active state commit `370a0ec181` upstream. Currently, if a vcpu thread tries to change the active state of an interrupt which is already on the same vcpu's AP list, it will loop forever. Since the VGIC mmio handler is called after a vcpu has already synced back the LR state to the struct vgic_irq, we can just let it proceed safely. Reviewed-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Jintack Lim <jintack@cs.columbia.edu> Signed-off-by: Christoffer Dall <cdall@linaro.org> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:34 +08:00
Janosch Frank	9d89c20f3b	KVM: s390: Fix guest migration for huge guests resulting in panic commit `2e4d88009f` upstream. While we can technically not run huge page guests right now, we can setup a guest with huge pages. Trying to migrate it will trigger a VM_BUG_ON and, if the kernel is not configured to panic on a BUG, it will happily try to work on non-existing page table entries. With this patch, we always return "dirty" if we encounter a large page when migrating. This at least fixes the immediate problem until we have proper handling for both kind of pages. Fixes: `15f36eb` ("KVM: s390: Add proper dirty bitmap support to S390 kvm.") Signed-off-by: Janosch Frank <frankja@linux.vnet.ibm.com> Acked-by: Christian Borntraeger <borntraeger@de.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:34 +08:00
Krzysztof Kozlowski	ce8ab5f168	serial: samsung: Continue to work if DMA request fails commit `f98c7bce57` upstream. If DMA is not available (even when configured in DeviceTree), the driver will fail the startup procedure thus making serial console not available. For example this causes boot failure on QEMU ARMv7 (Exynos4210, SMDKC210): [ 1.302575] OF: amba_device_add() failed (-19) for /amba/pdma@12680000 ... [ 11.435732] samsung-uart 13800000.serial: DMA request failed [ 72.963893] samsung-uart 13800000.serial: DMA request failed [ 73.143361] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000000 DMA is not necessary for serial to work, so continue with UART startup after emitting a warning. Fixes: `62c37eedb7` ("serial: samsung: add dma reqest/release functions") Signed-off-by: Krzysztof Kozlowski <krzk@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:33 +08:00
Johan Hovold	d0ef6ecee8	USB: serial: io_ti: fix information leak in completion handler commit `654b404f2a` upstream. Add missing sanity check to the bulk-in completion handler to avoid an integer underflow that can be triggered by a malicious device. This avoids leaking 128 kB of memory content from after the URB transfer buffer to user space. Fixes: `8c209e6782` ("USB: make actual_length in struct urb field u32") Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:33 +08:00
Johan Hovold	449b0bb237	USB: serial: io_ti: fix NULL-deref in interrupt callback commit `0b1d250afb` upstream. Fix a NULL-pointer dereference in the interrupt callback should a malicious device send data containing a bad port number by adding the missing sanity check. Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:33 +08:00
Johan Hovold	f06b17020a	USB: iowarrior: fix NULL-deref in write commit `de46e56653` upstream. Make sure to verify that we have the required interrupt-out endpoint for IOWarrior56 devices to avoid dereferencing a NULL-pointer in write should a malicious device lack such an endpoint. Fixes: `946b960d13` ("USB: add driver for iowarrior devices.") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:33 +08:00
Johan Hovold	653418adaf	USB: iowarrior: fix NULL-deref at probe commit `b7321e81fc` upstream. Make sure to check for the required interrupt-in endpoint to avoid dereferencing a NULL-pointer should a malicious device lack such an endpoint. Note that a fairly recent change purported to fix this issue, but added an insufficient test on the number of endpoints only, a test which can now be removed. Fixes: `4ec0ef3a82` ("USB: iowarrior: fix oops with malicious USB descriptors") Fixes: `946b960d13` ("USB: add driver for iowarrior devices.") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:33 +08:00
Johan Hovold	6d6c5895f4	USB: serial: omninet: fix reference leaks at open commit `30572418b4` upstream. This driver needlessly took another reference to the tty on open, a reference which was then never released on close. This lead to not just a leak of the tty, but also a driver reference leak that prevented the driver from being unloaded after a port had once been opened. Fixes: `4a90f09b20` ("tty: usb-serial krefs") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:32 +08:00
Johan Hovold	0ab90ddba7	USB: serial: safe_serial: fix information leak in completion handler commit `8c76d7cd52` upstream. Add missing sanity check to the bulk-in completion handler to avoid an integer underflow that could be triggered by a malicious device. This avoids leaking up to 56 bytes from after the URB transfer buffer to user space. Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:32 +08:00
Guenter Roeck	24db1c5a1c	usb: host: xhci-plat: Fix timeout on removal of hot pluggable xhci controllers commit `dcc7620cad` upstream. Upstream commit `98d74f9cea` ("xhci: fix 10 second timeout on removal of PCI hotpluggable xhci controllers") fixes a problem with hot pluggable PCI xhci controllers which can result in excessive timeouts, to the point where the system reports a deadlock. The same problem is seen with hot pluggable xhci controllers using the xhci-plat driver, such as the driver used for Type-C ports on rk3399. Similar to hot-pluggable PCI controllers, the driver for this chip removes the xhci controller from the system when the Type-C cable is disconnected. The solution for PCI devices works just as well for non-PCI devices and avoids the problem. Signed-off-by: Guenter Roeck <linux@roeck-us.net> Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:32 +08:00
Peter Chen	d863767907	usb: host: xhci-dbg: HCIVERSION should be a binary number commit `f95e60a7db` upstream. According to xHCI spec, HCIVERSION containing a BCD encoding of the xHCI specification revision number, 0100h corresponds to xHCI version 1.0. Change "100" as "0x100". Cc: Lu Baolu <baolu.lu@linux.intel.com> Fixes: `04abb6de28` ("xhci: Read and parse new xhci 1.1 capability register") Signed-off-by: Peter Chen <peter.chen@nxp.com> Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:32 +08:00
Roger Quadros	03123df08e	Revert "usb: gadget: uvc: Add missing call for additional setup data" commit `eb38d913c2` upstream. This reverts commit `4fbac5206a`. This commit breaks g_webcam when used with uvc-gadget [1]. The user space application (e.g. uvc-gadget) is responsible for sending response to UVC class specific requests on control endpoint in uvc_send_response() in uvc_v4l2.c. The bad commit was causing a duplicate response to be sent with incorrect response data thus causing UVC probe to fail at the host and broken control transfer endpoint at the gadget. [1] - git://git.ideasonboard.org/uvc-gadget.git Acked-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com> Signed-off-by: Roger Quadros <rogerq@ti.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:32 +08:00
Felipe Balbi	5ce2e4ce4a	usb: gadget: function: f_fs: pass companion descriptor along commit `2bfa0719ac` upstream. If we're dealing with SuperSpeed endpoints, we need to make sure to pass along the companion descriptor and initialize fields needed by the Gadget API. Eventually, f_fs.c should be converted to use config_ep_by_speed() like all other functions, though. Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:31 +08:00
Jelle Martijn Kok	40192c9666	usb: ohci-at91: Do not drop unhandled USB suspend control requests commit `85550f9148` upstream. In patch 2e2aa1bc7eff90ecm, USB suspend and wakeup control requests are passed to SFR_OHCIICR register. If a processor does not have such a register, this hub control request will be dropped. If no such a SFR register is available, all USB suspend control requests will now be processed using ohci_hub_control() (like before patch 2e2aa1bc7eff90ecm.) Tested on an Atmel AT91SAM9G20 with an on-board TI TUSB2046B hub chip If the last USB device is unplugged from the USB hub, the hub goes into sleep and will not wakeup when an USB devices is inserted. Fixes: `2e2aa1bc7e` ("usb: ohci-at91: Forcibly suspend ports while USB suspend") Signed-off-by: Jelle Martijn Kok <jmkok@youcom.nl> Tested-by: Wenyou Yang <wenyou.yang@atmel.com> Cc: Wenyou Yang <wenyou.yang@atmel.com> Cc: Alan Stern <stern@rowland.harvard.edu> Acked-by: Nicolas Ferre <nicolas.ferre@microchip.com> Reviewed-by: Alexandre Belloni <alexandre.belloni@free-electrons.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:31 +08:00
Roger Quadros	accadd8164	usb: dwc3-omap: Fix missing break in dwc3_omap_set_mailbox() commit `0913750f9f` upstream. We need to break from all cases if we want to treat each one of them separately. Reported-by: Gustavo A. R. Silva <garsilva@embeddedor.com> Fixes: `d2728fb3e0` ("usb: dwc3: omap: Pass VBUS and ID events transparently") Signed-off-by: Roger Quadros <rogerq@ti.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:31 +08:00
Felipe Balbi	808ee146d7	usb: dwc3: gadget: make Set Endpoint Configuration macros safe commit `7369090a9f` upstream. Some gadget drivers are bad, bad boys. We notice that ADB was passing bad Burst Size which caused top bits of param0 to be overwritten which confused DWC3 when running this command. In order to avoid future issues, we're going to make sure values passed by macros are always safe for the controller. Note that ADB still needs a fix to not pass bad values. Reported-by: Mohamed Abbas <mohamed.abbas@intel.com> Sugested-by: Adam Andruszak <adam.andruszak@intel.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:31 +08:00
Peter Chen	de90394b0c	usb: gadget: dummy_hcd: clear usb_gadget region before registration commit `5bbc852676` upstream. When the user does device unbind and rebind test, the kernel will show below dump due to usb_gadget memory region is dirty after unbind. Clear usb_gadget region for every new probe. root@imx6qdlsolo:/sys/bus/platform/drivers/dummy_udc# echo dummy_udc.0 > bind [ 102.523312] kobject (eddd78b0): tried to init an initialized object, something is seriously wrong. [ 102.532447] CPU: 0 PID: 734 Comm: sh Not tainted 4.10.0-rc7-00872-g1b2b8e9 #1298 [ 102.539866] Hardware name: Freescale i.MX6 SoloX (Device Tree) [ 102.545717] Backtrace: [ 102.548225] [<c010d090>] (dump_backtrace) from [<c010d338>] (show_stack+0x18/0x1c) [ 102.555822] r7:ede34000 r6:60010013 r5:00000000 r4:c0f29418 [ 102.561512] [<c010d320>] (show_stack) from [<c040c2a4>] (dump_stack+0xb4/0xe8) [ 102.568764] [<c040c1f0>] (dump_stack) from [<c040e6d4>] (kobject_init+0x80/0x9c) [ 102.576187] r10:0000001f r9:eddd7000 r8:eeaf8c10 r7:eddd78a8 r6:c177891c r5:c0f3b060 [ 102.584036] r4:eddd78b0 r3:00000000 [ 102.587641] [<c040e654>] (kobject_init) from [<c05359a4>] (device_initialize+0x28/0xf8) [ 102.595665] r5:eebc4800 r4:eddd78a8 [ 102.599268] [<c053597c>] (device_initialize) from [<c05382ac>] (device_register+0x14/0x20) [ 102.607556] r7:eddd78a8 r6:00000000 r5:eebc4800 r4:eddd78a8 [ 102.613256] [<c0538298>] (device_register) from [<c0668ef4>] (usb_add_gadget_udc_release+0x8c/0x1ec) [ 102.622410] r5:eebc4800 r4:eddd7860 [ 102.626015] [<c0668e68>] (usb_add_gadget_udc_release) from [<c0669068>] (usb_add_gadget_udc+0x14/0x18) [ 102.635351] r10:0000001f r9:eddd7000 r8:eddd788c r7:bf003770 r6:eddd77f8 r5:eddd7818 [ 102.643198] r4:eddd785c r3:eddd7b24 [ 102.646834] [<c0669054>] (usb_add_gadget_udc) from [<bf003428>] (dummy_udc_probe+0x170/0x1c4 [dummy_hcd]) [ 102.656458] [<bf0032b8>] (dummy_udc_probe [dummy_hcd]) from [<c053d114>] (platform_drv_probe+0x54/0xb8) [ 102.665881] r10:00000008 r9:c1778960 r8:bf004128 r7:fffffdfb r6:bf004128 r5:eeaf8c10 [ 102.673727] r4:eeaf8c10 [ 102.676293] [<c053d0c0>] (platform_drv_probe) from [<c053b160>] (driver_probe_device+0x264/0x474) [ 102.685186] r7:00000000 r6:00000000 r5:c1778960 r4:eeaf8c10 [ 102.690876] [<c053aefc>] (driver_probe_device) from [<c05397c4>] (bind_store+0xb8/0x14c) [ 102.698994] r10:eeb3bb4c r9:ede34000 r8:0000000c r7:eeaf8c44 r6:bf004128 r5:c0f3b668 [ 102.706840] r4:eeaf8c10 [ 102.709402] [<c053970c>] (bind_store) from [<c0538ca8>] (drv_attr_store+0x28/0x34) [ 102.716998] r9:ede34000 r8:00000000 r7:ee3863c0 r6:ee3863c0 r5:c0538c80 r4:c053970c [ 102.724776] [<c0538c80>] (drv_attr_store) from [<c029c930>] (sysfs_kf_write+0x50/0x54) [ 102.732711] r5:c0538c80 r4:0000000c [ 102.736313] [<c029c8e0>] (sysfs_kf_write) from [<c029be84>] (kernfs_fop_write+0x100/0x214) [ 102.744599] r7:ee3863c0 r6:eeb3bb40 r5:00000000 r4:00000000 [ 102.750287] [<c029bd84>] (kernfs_fop_write) from [<c0222dd8>] (__vfs_write+0x34/0x120) [ 102.758231] r10:00000000 r9:ede34000 r8:c0108bc4 r7:0000000c r6:ede35f80 r5:c029bd84 [ 102.766077] r4:ee223780 [ 102.768638] [<c0222da4>] (__vfs_write) from [<c0224678>] (vfs_write+0xa8/0x170) [ 102.775974] r9:ede34000 r8:c0108bc4 r7:ede35f80 r6:01861cb0 r5:ee223780 r4:0000000c [ 102.783743] [<c02245d0>] (vfs_write) from [<c0225498>] (SyS_write+0x4c/0xa8) [ 102.790818] r9:ede34000 r8:c0108bc4 r7:0000000c r6:01861cb0 r5:ee223780 r4:ee223780 [ 102.798595] [<c022544c>] (SyS_write) from [<c0108a20>] (ret_fast_syscall+0x0/0x1c) [ 102.806188] r7:00000004 r6:b6e83d58 r5:01861cb0 r4:0000000c Fixes: `90fccb529d` ("usb: gadget: Gadget directory cleanup - group UDC drivers") Acked-by: Alan Stern <stern@rowland.harvard.edu> Signed-off-by: Peter Chen <peter.chen@nxp.com> Tested-by: Xiaolong Ye <xiaolong.ye@intel.com> Reported-by: Fengguang Wu <fengguang.wu@intel.com> Signed-off-by: Felipe Balbi <felipe.balbi@linux.intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:31 +08:00
Ethan Zhao	ed99f5a09c	PCI: Prevent VPD access for QLogic ISP2722 commit `0d5370d1d8` upstream. QLogic ISP2722-based 16/32Gb Fibre Channel to PCIe Adapter has the VPD access issue too, while read the common pci-sysfs access interface shown as /sys/devices/pci0000:00/0000:00:03.2/0000:0b:00.0/vpd with simple 'cat' could cause system hang and panic: Kernel panic - not syncing: An NMI occurred. Depending on your system the reason for the NMI is logged in any one of the following resources: 1. Integrated Management Log (IML) 2. OA Syslog 3. OA Forward Progress Log 4. iLO Event Log CPU: 0 PID: 15070 Comm: udevadm Not tainted 4.1.12 Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 12/27/2015 0000000000000086 000000007f0cdf51 ffff880c4fa05d58 ffffffff817193de ffffffffa00b42d8 0000000000000075 ffff880c4fa05dd8 ffffffff81714072 0000000000000008 ffff880c4fa05de8 ffff880c4fa05d88 000000007f0cdf51 Call Trace: <NMI> [<ffffffff817193de>] dump_stack+0x63/0x81 [<ffffffff81714072>] panic+0xd0/0x20e [<ffffffffa00b390d>] hpwdt_pretimeout+0xdd/0xe0 [hpwdt] [<ffffffff81021fc9>] ? sched_clock+0x9/0x10 [<ffffffff8101c101>] nmi_handle+0x91/0x170 [<ffffffff8101c10c>] ? nmi_handle+0x9c/0x170 [<ffffffff8101c5fe>] io_check_error+0x1e/0xa0 [<ffffffff8101c719>] default_do_nmi+0x99/0x140 [<ffffffff8101c8b4>] do_nmi+0xf4/0x170 [<ffffffff817232c5>] end_repeat_nmi+0x1a/0x1e [<ffffffff815d724b>] ? pci_conf1_read+0xeb/0x120 [<ffffffff815d724b>] ? pci_conf1_read+0xeb/0x120 [<ffffffff815d724b>] ? pci_conf1_read+0xeb/0x120 <<EOE>> [<ffffffff815db4b3>] raw_pci_read+0x23/0x40 [<ffffffff815db4fc>] pci_read+0x2c/0x30 [<ffffffff8136f612>] pci_user_read_config_word+0x72/0x110 [<ffffffff8136f746>] pci_vpd_pci22_wait+0x96/0x130 [<ffffffff8136ff9b>] pci_vpd_pci22_read+0xdb/0x1a0 [<ffffffff8136ea30>] pci_read_vpd+0x20/0x30 [<ffffffff8137d590>] read_vpd_attr+0x30/0x40 [<ffffffff8128e037>] sysfs_kf_bin_read+0x47/0x70 [<ffffffff8128d24e>] kernfs_fop_read+0xae/0x180 [<ffffffff8120dd97>] __vfs_read+0x37/0x100 [<ffffffff812ba7e4>] ? security_file_permission+0x84/0xa0 [<ffffffff8120e366>] ? rw_verify_area+0x56/0xe0 [<ffffffff8120e476>] vfs_read+0x86/0x140 [<ffffffff8120f3f5>] SyS_read+0x55/0xd0 [<ffffffff81720f2e>] system_call_fastpath+0x12/0x71 Shutting down cpus with NMI Kernel Offset: disabled drm_kms_helper: panic occurred, switching back to text console So blacklist the access to its VPD. Signed-off-by: Ethan Zhao <ethan.zhao@oracle.com> Signed-off-by: Bjorn Helgaas <bhelgaas@google.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:30 +08:00
Balbir Singh	1ced52ead2	powerpc/xics: Work around limitations of OPAL XICS priority handling commit `a69e2fb703` upstream. The CPPR (Current Processor Priority Register) of a XICS interrupt presentation controller contains a value N, such that only interrupts with a priority "more favoured" than N will be received by the CPU, where "more favoured" means "less than". So if the CPPR has the value 5 then only interrupts with a priority of 0-4 inclusive will be received. In theory the CPPR can support a value of 0 to 255 inclusive. In practice Linux only uses values of 0, 4, 5 and 0xff. Setting the CPPR to 0 rejects all interrupts, setting it to 0xff allows all interrupts. The values 4 and 5 are used to differentiate IPIs from external interrupts. Setting the CPPR to 5 allows IPIs to be received but not external interrupts. The CPPR emulation in the OPAL XICS implementation only directly supports priorities 0 and 0xff. All other priorities are considered equivalent, and mapped to a single priority value internally. This means when using icp-opal we can not allow IPIs but not externals. This breaks Linux's use of priority values when a CPU is hot unplugged. After migrating IRQs away from the CPU that is being offlined, we set the priority to 5, meaning we still want the offline CPU to receive IPIs. But the effect of the OPAL XICS emulation's use of a single priority value is that all interrupts are rejected by the CPU. With the CPU offline, and not receiving IPIs, we may not be able to wake it up to bring it back online. The first part of the fix is in icp_opal_set_cpu_priority(). CPPR values of 0 to 4 inclusive will correctly cause all interrupts to be rejected, so we pass those CPPR values through to OPAL. However if we are called with a CPPR of 5 or greater, the caller is expecting to be able to allow IPIs but not external interrupts. We know this doesn't work, so instead of rejecting all interrupts we choose the opposite which is to allow all interrupts. This is still not correct behaviour, but we know for the only existing caller (xics_migrate_irqs_away()), that it is the better option. The other part of the fix is in xics_migrate_irqs_away(). Instead of setting priority (CPPR) to 0, and then back to 5 before migrating IRQs, we migrate the IRQs before setting the priority back to 5. This should have no effect on an ICP backend with a working set_priority(), and on icp-opal it means we will keep all interrupts blocked until after we've finished doing the IRQ migration. Additionally we wait for 5ms after doing the migration to make sure there are no IRQs in flight. Fixes: `d74361881f` ("powerpc/xics: Add ICP OPAL backend") Suggested-by: Michael Ellerman <mpe@ellerman.id.au> Reported-by: Vaidyanathan Srinivasan <svaidy@linux.vnet.ibm.com> Tested-by: Vaidyanathan Srinivasan <svaidy@linux.vnet.ibm.com> Signed-off-by: Balbir Singh <bsingharora@gmail.com> [mpe: Rewrote comments and change log, change delay to 5ms] Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:30 +08:00
Ravi Bangoria	bc8d2eefe6	powerpc: Emulation support for load/store instructions on LE commit `e148bd17f4` upstream. emulate_step() uses a number of underlying kernel functions that were initially not enabled for LE. This has been rectified since. So, fix emulate_step() for LE for the corresponding instructions. Reported-by: Anton Blanchard <anton@samba.org> Signed-off-by: Ravi Bangoria <ravi.bangoria@linux.vnet.ibm.com> Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:30 +08:00
Qi Hou	ac4666a7fd	i2c: add missing of_node_put in i2c_mux_del_adapters commit `2e1e4949f9` upstream. Refcount of of_node is increased with of_node_get() in i2c_mux_add_adapter(). It must be decreased with of_node_put() in i2c_mux_del_adapters(). Signed-off-by: Qi Hou <qi.hou@windriver.com> Reviewed-by: Zhang Xiao <xiao.zhang@windriver.com> Signed-off-by: Peter Rosin <peda@axentia.se> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:30 +08:00
Jonathan McDowell	06996254a6	dw2102: don't do DMA on stack commit `606142af57` upstream. On Kernel 4.9, WARNINGs about doing DMA on stack are hit at the dw2102 driver: one in su3000_power_ctrl() and the other in tt_s2_4600_frontend_attach(). Both were due to the use of buffers on the stack as parameters to dvb_usb_generic_rw() and the resulting attempt to do DMA with them. The device was non-functional as a result. So, switch this driver over to use a buffer within the device state structure, as has been done with other DVB-USB drivers. Tested with TechnoTrend TT-connect S2-4600. [mchehab@osg.samsung.com: fixed a warning at su3000_i2c_transfer() that state var were dereferenced before check 'd'] Signed-off-by: Jonathan McDowell <noodles@earth.li> Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:30 +08:00
Ard Biesheuvel	56d91e106b	efi/arm: Fix boot crash with CONFIG_CPUMASK_OFFSTACK=y commit `d1eb98143c` upstream. On ARM and arm64, we use a dedicated mm_struct to map the UEFI Runtime Services regions, which allows us to map those regions on demand, and in a way that is guaranteed to be compatible with incoming kernels across kexec. As it turns out, we don't fully initialize the mm_struct in the same way as process mm_structs are initialized on fork(), which results in the following crash on ARM if CONFIG_CPUMASK_OFFSTACK=y is enabled: ... EFI Variables Facility v0.08 2004-May-17 Unable to handle kernel NULL pointer dereference at virtual address 00000000 [...] Process swapper/0 (pid: 1) ... __memzero() check_and_switch_context() virt_efi_get_next_variable() efivar_init() efivars_sysfs_init() do_one_initcall() ... This is due to a missing call to mm_init_cpumask(), so add it. Signed-off-by: Ard Biesheuvel <ard.biesheuvel@linaro.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Matt Fleming <matt@codeblueprint.co.uk> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: linux-efi@vger.kernel.org Link: http://lkml.kernel.org/r/1488395154-29786-1-git-send-email-ard.biesheuvel@linaro.org Signed-off-by: Ingo Molnar <mingo@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:29 +08:00
Eric W. Biederman	ee6f7ee1e4	ucount: Remove the atomicity from ucount->count commit `040757f738` upstream. Always increment/decrement ucount->count under the ucounts_lock. The increments are there already and moving the decrements there means the locking logic of the code is simpler. This simplification in the locking logic fixes a race between put_ucounts and get_ucounts that could result in a use-after-free because the count could go zero then be found by get_ucounts and then be freed by put_ucounts. A bug presumably this one was found by a combination of syzkaller and KASAN. JongWhan Kim reported the syzkaller failure and Dmitry Vyukov spotted the race in the code. Fixes: `f6b2db1a3e` ("userns: Make the count of user namespaces per user") Reported-by: JongHwan Kim <zzoru007@gmail.com> Reported-by: Dmitry Vyukov <dvyukov@google.com> Reviewed-by: Andrei Vagin <avagin@gmail.com> Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:29 +08:00
Rik van Riel	8bb208d02a	tracing: Add #undef to fix compile error commit `bf7165cfa2` upstream. There are several trace include files that define TRACE_INCLUDE_FILE. Include several of them in the same .c file (as I currently have in some code I am working on), and the compile will blow up with a "warning: "TRACE_INCLUDE_FILE" redefined #define TRACE_INCLUDE_FILE syscalls" Every other include file in include/trace/events/ avoids that issue by having a #undef TRACE_INCLUDE_FILE before the #define; syscalls.h should have one, too. Link: http://lkml.kernel.org/r/20160928225554.13bd7ac6@annuminas.surriel.com Fixes: `b8007ef742` ("tracing: Separate raw syscall from syscall tracer") Signed-off-by: Rik van Riel <riel@redhat.com> Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:29 +08:00
Arnd Bergmann	b72ae5ca7a	cpmac: remove hopeless #warning commit `d43e6fb4ac` upstream. The #warning was present 10 years ago when the driver first got merged. As the platform is rather obsolete by now, it seems very unlikely that the warning will cause anyone to fix the code properly. kernelci.org reports the warning for every build in the meantime, so I think it's better to just turn it into a code comment to reduce noise. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:29 +08:00
Arnd Bergmann	04275d2a8a	bcm63xx_enet: avoid uninitialized variable warning commit `df384d435a` upstream. gcc-7 and probably earlier versions get confused by this function and print a harmless warning: drivers/net/ethernet/broadcom/bcm63xx_enet.c: In function 'bcm_enet_open': drivers/net/ethernet/broadcom/bcm63xx_enet.c:1130:3: error: 'phydev' may be used uninitialized in this function [-Werror=maybe-uninitialized] This adds an initialization for the 'phydev' variable when it is unused and changes the check to test for that NULL pointer to make it clear that we always pass a valid pointer here. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:29 +08:00
Arnd Bergmann	b7e968da04	MIPS: ralink: Remove unused rt*_wdt_reset functions commit `886f9c69fc` upstream. All pointers to these functions were removed, so now they produce warnings: arch/mips/ralink/rt305x.c:92:13: error: 'rt305x_wdt_reset' defined but not used [-Werror=unused-function] This removes the functions. If we need them again, the patch can be reverted later. Fixes: `f576fb6a07` ("MIPS: ralink: cleanup the soc specific pinmux data") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: John Crispin <john@phrozen.org> Cc: Colin Ian King <colin.king@canonical.com> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15044/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:28 +08:00
Arnd Bergmann	dd2419e1ce	MIPS: ralink: Remove unused timer functions commit `d92240d12a` upstream. The functions were originally used for the module unload path, but are not referenced any more and just cause warnings: arch/mips/ralink/timer.c:104:13: error: 'rt_timer_disable' defined but not used [-Werror=unused-function] arch/mips/ralink/timer.c:74:13: error: 'rt_timer_free' defined but not used [-Werror=unused-function] Cc: Paul Gortmaker <paul.gortmaker@windriver.com> Fixes: `62ee73d284` ("MIPS: ralink: Make timer explicitly non-modular") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: Paul Gortmaker <paul.gortmaker@windriver.com> Cc: John Crispin <john@phrozen.org> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15041/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:28 +08:00
John Crispin	f34064186f	MIPS: ralink: Cosmetic change to prom_init(). commit `9c48568b36` upstream. Over the years the code has been changed various times leading to argc/argv being defined in a different function to where we actually use the variables. Clean this up by moving them to prom_init_cmdline(). Signed-off-by: John Crispin <john@phrozen.org> Cc: linux-mips@linux-mips.org Patchwork: https://patchwork.linux-mips.org/patch/14902/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:28 +08:00
Arnd Bergmann	7b767f6b51	mtd: pmcmsp: use kstrndup instead of kmalloc+strncpy commit `906b268477` upstream. kernelci.org reports a warning for this driver, as it copies a local variable into a 'const char *' string: drivers/mtd/maps/pmcmsp-flash.c:149:30: warning: passing argument 1 of 'strncpy' discards 'const' qualifier from pointer target type [-Wdiscarded-qualifiers] Using kstrndup() simplifies the code and avoids the warning. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Acked-by: Marek Vasut <marek.vasut@gmail.com> Signed-off-by: Brian Norris <computersforpeace@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:28 +08:00
Arnd Bergmann	976e40d7f6	MIPS: Update lemote2f_defconfig for CPU_FREQ_STAT change commit `b3f6046186` upstream. Since linux-4.8, CPU_FREQ_STAT is a bool symbol, causing a warning in kernelci.org: arch/mips/configs/lemote2f_defconfig:42:warning: symbol value 'm' invalid for CPU_FREQ_STAT This updates the defconfig to have the feature built-in. Fixes: `1aefc75b24` ("cpufreq: stats: Make the stats code non-modular") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15000/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:27 +08:00
Arnd Bergmann	dd2ef28eb7	MIPS: ip22: Fix ip28 build for modern gcc commit `23ca9b5223` upstream. kernelci reports a failure of the ip28_defconfig build after upgrading its gcc version: arch/mips/sgi-ip22/Platform:29: *** gcc doesn't support needed option -mr10k-cache-barrier=store. Stop. The problem apparently is that the -mr10k-cache-barrier=store option is now rejected for CPUs other than r10k. Explicitly including the CPU in the check fixes this and is safe because both options were introduced in gcc-4.4. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15049/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:27 +08:00
Arnd Bergmann	aff853abd2	MIPS: Update ip27_defconfig for SCSI_DH change commit `ea58fca184` upstream. Since linux-4.3, SCSI_DH is a bool symbol, causing a warning in kernelci.org: arch/mips/configs/ip27_defconfig:136:warning: symbol value 'm' invalid for SCSI_DH This updates the defconfig to have the feature built-in. Fixes: `086b91d052` ("scsi_dh: integrate into the core SCSI code") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15001/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:27 +08:00
Arnd Bergmann	23096c5678	MIPS: ip27: Disable qlge driver in defconfig commit `b617649468` upstream. One of the last remaining failures in kernelci.org is for a gcc bug: drivers/net/ethernet/qlogic/qlge/qlge_main.c:4819:1: error: insn does not satisfy its constraints: drivers/net/ethernet/qlogic/qlge/qlge_main.c:4819:1: internal compiler error: in extract_constrain_insn, at recog.c:2190 This is apparently broken in gcc-6 but fixed in gcc-7, and I cannot reproduce the problem here. However, it is clear that ip27_defconfig does not actually need this driver as the platform has only PCI-X but not PCIe, and the qlge adapter in turn is PCIe-only. The driver was originally enabled in 2010 along with lots of other drivers. Fixes: `59d302b342` ("MIPS: IP27: Make defconfig useful again.") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: Ralf Baechle <ralf@linux-mips.org> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15197/ Signed-off-by: James Hogan <james.hogan@imgtec.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:27 +08:00
Arnd Bergmann	ad8387a602	MIPS: VDSO: avoid duplicate CAC_BASE definition commit `1742ac2650` upstream. vdso.h includes <spaces.h> implicitly after defining CONFIG_32BITS. This defeats the override in mach-ip27/spaces.h, leading to a build error that shows up in kernelci.org: In file included from arch/mips/include/asm/mach-ip27/spaces.h:29:0, from arch/mips/include/asm/page.h:12, from arch/mips/vdso/vdso.h:26, from arch/mips/vdso/gettimeofday.c:11: arch/mips/include/asm/mach-generic/spaces.h:28:0: error: "CAC_BASE" redefined [-Werror] #define CAC_BASE _AC(0x80000000, UL) An earlier patch tried to make the second definition conditional, but that patch had the #ifdef in the wrong place, and would lead to another warning: arch/mips/include/asm/io.h: In function 'phys_to_virt': arch/mips/include/asm/io.h:138:9: error: cast to pointer from integer of different size [-Werror=int-to-pointer-cast] For all I can tell, there is no other reason than vdso32 to ever include this file with CONFIG_32BITS set, and the vdso itself should never refer to the base addresses as it is running in user space, so adding an #ifdef here is safe. Link: https://patchwork.kernel.org/patch/9418187/ Fixes: `3ffc17d876` ("MIPS: Adjust MIPS64 CAC_BASE to reflect Config.K0") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: Paul Burton <paul.burton@imgtec.com> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/15039/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:27 +08:00
Arnd Bergmann	5841e3d37d	MIPS: Update defconfigs for NF_CT_PROTO_DCCP/UDPLITE change commit `9ddc16ad8e` upstream. In linux-4.10-rc, NF_CT_PROTO_UDPLITE and NF_CT_PROTO_DCCP are bool symbols instead of tristate, and kernelci.org reports a bunch of warnings for this, like: arch/mips/configs/malta_kvm_guest_defconfig:63:warning: symbol value 'm' invalid for NF_CT_PROTO_UDPLITE arch/mips/configs/malta_defconfig:62:warning: symbol value 'm' invalid for NF_CT_PROTO_DCCP arch/mips/configs/malta_defconfig:63:warning: symbol value 'm' invalid for NF_CT_PROTO_UDPLITE arch/mips/configs/ip22_defconfig:70:warning: symbol value 'm' invalid for NF_CT_PROTO_DCCP arch/mips/configs/ip22_defconfig:71:warning: symbol value 'm' invalid for NF_CT_PROTO_UDPLITE This changes all the MIPS defconfigs with these symbols to have them built-in. Fixes: `9b91c96c5d` ("netfilter: conntrack: built-in support for UDPlite") Fixes: `c51d39010a` ("netfilter: conntrack: built-in support for DCCP") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Cc: linux-mips@linux-mips.org Cc: linux-kernel@vger.kernel.org Patchwork: https://patchwork.linux-mips.org/patch/14999/ Signed-off-by: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:26 +08:00
Arnd Bergmann	2c1820ea8d	crypto: improve gcc optimization flags for serpent and wp512 commit `7d6e910502` upstream. An ancient gcc bug (first reported in 2003) has apparently resurfaced on MIPS, where kernelci.org reports an overly large stack frame in the whirlpool hash algorithm: crypto/wp512.c:987:1: warning: the frame size of 1112 bytes is larger than 1024 bytes [-Wframe-larger-than=] With some testing in different configurations, I'm seeing large variations in stack frames size up to 1500 bytes for what should have around 300 bytes at most. I also checked the reference implementation, which is essentially the same code but also comes with some test and benchmarking infrastructure. It seems that recent compiler versions on at least arm, arm64 and powerpc have a partial fix for this problem, but enabling "-fsched-pressure", but even with that fix they suffer from the issue to a certain degree. Some testing on arm64 shows that the time needed to hash a given amount of data is roughly proportional to the stack frame size here, which makes sense given that the wp512 implementation is doing lots of loads for table lookups, and the problem with the overly large stack is a result of doing a lot more loads and stores for spilled registers (as seen from inspecting the object code). Disabling -fschedule-insns consistently fixes the problem for wp512, in my collection of cross-compilers, the results are consistently better or identical when comparing the stack sizes in this function, though some architectures (notable x86) have schedule-insns disabled by default. The four columns are: default: -O2 press: -O2 -fsched-pressure nopress: -O2 -fschedule-insns -fno-sched-pressure nosched: -O2 -no-schedule-insns (disables sched-pressure) default press nopress nosched alpha-linux-gcc-4.9.3 1136 848 1136 176 am33_2.0-linux-gcc-4.9.3 2100 2076 2100 2104 arm-linux-gnueabi-gcc-4.9.3 848 848 1048 352 cris-linux-gcc-4.9.3 272 272 272 272 frv-linux-gcc-4.9.3 1128 1000 1128 280 hppa64-linux-gcc-4.9.3 1128 336 1128 184 hppa-linux-gcc-4.9.3 644 308 644 276 i386-linux-gcc-4.9.3 352 352 352 352 m32r-linux-gcc-4.9.3 720 656 720 268 microblaze-linux-gcc-4.9.3 1108 604 1108 256 mips64-linux-gcc-4.9.3 1328 592 1328 208 mips-linux-gcc-4.9.3 1096 624 1096 240 powerpc64-linux-gcc-4.9.3 1088 432 1088 160 powerpc-linux-gcc-4.9.3 1080 584 1080 224 s390-linux-gcc-4.9.3 456 456 624 360 sh3-linux-gcc-4.9.3 292 292 292 292 sparc64-linux-gcc-4.9.3 992 240 992 208 sparc-linux-gcc-4.9.3 680 592 680 312 x86_64-linux-gcc-4.9.3 224 240 272 224 xtensa-linux-gcc-4.9.3 1152 704 1152 304 aarch64-linux-gcc-7.0.0 224 224 1104 208 arm-linux-gnueabi-gcc-7.0.1 824 824 1048 352 mips-linux-gcc-7.0.0 1120 648 1120 272 x86_64-linux-gcc-7.0.1 240 240 304 240 arm-linux-gnueabi-gcc-4.4.7 840 392 arm-linux-gnueabi-gcc-4.5.4 784 728 784 320 arm-linux-gnueabi-gcc-4.6.4 736 728 736 304 arm-linux-gnueabi-gcc-4.7.4 944 784 944 352 arm-linux-gnueabi-gcc-4.8.5 464 464 760 352 arm-linux-gnueabi-gcc-4.9.3 848 848 1048 352 arm-linux-gnueabi-gcc-5.3.1 824 824 1064 336 arm-linux-gnueabi-gcc-6.1.1 808 808 1056 344 arm-linux-gnueabi-gcc-7.0.1 824 824 1048 352 Trying the same test for serpent-generic, the picture is a bit different, and while -fno-schedule-insns is generally better here than the default, -fsched-pressure wins overall, so I picked that instead. default press nopress nosched alpha-linux-gcc-4.9.3 1392 864 1392 960 am33_2.0-linux-gcc-4.9.3 536 524 536 528 arm-linux-gnueabi-gcc-4.9.3 552 552 776 536 cris-linux-gcc-4.9.3 528 528 528 528 frv-linux-gcc-4.9.3 536 400 536 504 hppa64-linux-gcc-4.9.3 524 208 524 480 hppa-linux-gcc-4.9.3 768 472 768 508 i386-linux-gcc-4.9.3 564 564 564 564 m32r-linux-gcc-4.9.3 712 576 712 532 microblaze-linux-gcc-4.9.3 724 392 724 512 mips64-linux-gcc-4.9.3 720 384 720 496 mips-linux-gcc-4.9.3 728 384 728 496 powerpc64-linux-gcc-4.9.3 704 304 704 480 powerpc-linux-gcc-4.9.3 704 296 704 480 s390-linux-gcc-4.9.3 560 560 592 536 sh3-linux-gcc-4.9.3 540 540 540 540 sparc64-linux-gcc-4.9.3 544 352 544 496 sparc-linux-gcc-4.9.3 544 344 544 496 x86_64-linux-gcc-4.9.3 528 536 576 528 xtensa-linux-gcc-4.9.3 752 544 752 544 aarch64-linux-gcc-7.0.0 432 432 656 480 arm-linux-gnueabi-gcc-7.0.1 616 616 808 536 mips-linux-gcc-7.0.0 720 464 720 488 x86_64-linux-gcc-7.0.1 536 528 600 536 arm-linux-gnueabi-gcc-4.4.7 592 440 arm-linux-gnueabi-gcc-4.5.4 776 448 776 544 arm-linux-gnueabi-gcc-4.6.4 776 448 776 544 arm-linux-gnueabi-gcc-4.7.4 768 448 768 544 arm-linux-gnueabi-gcc-4.8.5 488 488 776 544 arm-linux-gnueabi-gcc-4.9.3 552 552 776 536 arm-linux-gnueabi-gcc-5.3.1 552 552 776 536 arm-linux-gnueabi-gcc-6.1.1 560 560 776 536 arm-linux-gnueabi-gcc-7.0.1 616 616 808 536 I did not do any runtime tests with serpent, so it is possible that stack frame size does not directly correlate with runtime performance here and it actually makes things worse, but it's more likely to help here, and the reduced stack frame size is probably enough reason to apply the patch, especially given that the crypto code is often used in deep call chains. Link: https://kernelci.org/build/id/58797d7559b5149efdf6c3a9/logs/ Link: http://www.larc.usp.br/~pbarreto/WhirlpoolPage.html Link: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=11488 Link: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79149 Cc: Ralf Baechle <ralf@linux-mips.org> Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:26 +08:00
Johan Hovold	ce4d67cb3e	USB: serial: digi_acceleport: fix OOB-event processing commit `2e46565cf6` upstream. A recent change claimed to fix an off-by-one error in the OOB-port completion handler, but instead introduced such an error. This could specifically led to modem-status changes going unnoticed, effectively breaking TIOCMGET. Note that the offending commit fixes a loop-condition underflow and is marked for stable, but should not be backported without this fix. Reported-by: Ben Hutchings <ben@decadent.org.uk> Fixes: `2d38088921` ("USB: serial: digi_acceleport: fix OOB data sanity check") Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:26 +08:00
Johan Hovold	e1533c4615	USB: serial: digi_acceleport: fix OOB data sanity check commit `2d38088921` upstream. Make sure to check for short transfers to avoid underflow in a loop condition when parsing the receive buffer. Also fix an off-by-one error in the incomplete sanity check which could lead to invalid data being parsed. Fixes: `8c209e6782` ("USB: make actual_length in struct urb field u32") Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Reviewed-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Johan Hovold <johan@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2017-03-18 19:14:26 +08:00

503 changed files with 6429 additions and 3955 deletions

44

Documentation/arm64/silicon-errata.txt

View File

 @ -42,24 +42,26 @@ file acts as a registry of software workarounds in the Linux Kernel and
 will be updated when new workarounds are committed and backported to
 stable kernels.
 | Implementor    | Component       | Erratum ID      | Kconfig                 |
 +----------------+-----------------+-----------------+-------------------------+
 | ARM            | Cortex-A53      | #826319         | ARM64_ERRATUM_826319    |
 | ARM            | Cortex-A53      | #827319         | ARM64_ERRATUM_827319    |
 | ARM            | Cortex-A53      | #824069         | ARM64_ERRATUM_824069    |
 | ARM            | Cortex-A53      | #819472         | ARM64_ERRATUM_819472    |
 | ARM            | Cortex-A53      | #845719         | ARM64_ERRATUM_845719    |
 | ARM            | Cortex-A53      | #843419         | ARM64_ERRATUM_843419    |
 | ARM            | Cortex-A57      | #832075         | ARM64_ERRATUM_832075    |
 | ARM            | Cortex-A57      | #852523         | N/A                     |
 | ARM            | Cortex-A57      | #834220         | ARM64_ERRATUM_834220    |
 | ARM            | Cortex-A72      | #853709         | N/A                     |
 | ARM            | MMU-500         | #841119,#826419 | N/A                     |
 |                |                 |                 |                         |
 | Cavium         | ThunderX ITS    | #22375, #24313  | CAVIUM_ERRATUM_22375    |
 | Cavium         | ThunderX ITS    | #23144          | CAVIUM_ERRATUM_23144    |
 | Cavium         | ThunderX GICv3  | #23154          | CAVIUM_ERRATUM_23154    |
 | Cavium         | ThunderX Core   | #27456          | CAVIUM_ERRATUM_27456    |
 | Cavium         | ThunderX SMMUv2 | #27704          | N/A		       |
 |                |                 |                 |                         |
 | Freescale/NXP  | LS2080A/LS1043A | A-008585        | FSL_ERRATUM_A008585     |
 | Implementor    | Component       | Erratum ID      | Kconfig                     |
 +----------------+-----------------+-----------------+-----------------------------+
 | ARM            | Cortex-A53      | #826319         | ARM64_ERRATUM_826319        |
 | ARM            | Cortex-A53      | #827319         | ARM64_ERRATUM_827319        |
 | ARM            | Cortex-A53      | #824069         | ARM64_ERRATUM_824069        |
 | ARM            | Cortex-A53      | #819472         | ARM64_ERRATUM_819472        |
 | ARM            | Cortex-A53      | #845719         | ARM64_ERRATUM_845719        |
 | ARM            | Cortex-A53      | #843419         | ARM64_ERRATUM_843419        |
 | ARM            | Cortex-A57      | #832075         | ARM64_ERRATUM_832075        |
 | ARM            | Cortex-A57      | #852523         | N/A                         |
 | ARM            | Cortex-A57      | #834220         | ARM64_ERRATUM_834220        |
 | ARM            | Cortex-A72      | #853709         | N/A                         |
 | ARM            | MMU-500         | #841119,#826419 | N/A                         |
 |                |                 |                 |                             |
 | Cavium         | ThunderX ITS    | #22375, #24313  | CAVIUM_ERRATUM_22375        |
 | Cavium         | ThunderX ITS    | #23144          | CAVIUM_ERRATUM_23144        |
 | Cavium         | ThunderX GICv3  | #23154          | CAVIUM_ERRATUM_23154        |
 | Cavium         | ThunderX Core   | #27456          | CAVIUM_ERRATUM_27456        |
 | Cavium         | ThunderX SMMUv2 | #27704          | N/A                         |
 |                |                 |                 |                             |
 | Freescale/NXP  | LS2080A/LS1043A | A-008585        | FSL_ERRATUM_A008585         |
 |                |                 |                 |                             |
 | Qualcomm Tech. | QDF2400 ITS     | E0065           | QCOM_QDF2400_ERRATUM_0065   |

5

Documentation/devicetree/bindings/arm/arch_timer.txt

View File

 @ -38,6 +38,11 @@ to deliver its interrupts via SPIs.
   architecturally-defined reset values. Only supported for 32-bit
   systems which follow the ARMv7 architected reset values.
 - arm,no-tick-in-suspend : The main counter does not tick when the system is in
   low-power system suspend on some SoCs. This behavior does not match the
   Architecture Reference Manual's specification that the system counter "must
   be implemented in an always-on power domain."
 Example:

8

Documentation/devicetree/bindings/display/sunxi/sun4i-drm.txt

View File

 @ -28,6 +28,8 @@ The TCON acts as a timing controller for RGB, LVDS and TV interfaces.
 Required properties:
  - compatible: value must be either:
    * allwinner,sun5i-a13-tcon
    * allwinner,sun6i-a31-tcon
    * allwinner,sun6i-a31s-tcon
    * allwinner,sun8i-a33-tcon
  - reg: base address and size of memory-mapped region
  - interrupts: interrupt associated to this IP
 @ -50,7 +52,7 @@ Required properties:
   second the block connected to the TCON channel 1 (usually the TV
   encoder)
 On the A13, there is one more clock required:
 On SoCs other than the A33, there is one more clock required:
    - 'tcon-ch1': The clock driving the TCON channel 1
 DRC
 @ -87,6 +89,7 @@ system.
 Required properties:
   - compatible: value must be one of:
     * allwinner,sun5i-a13-display-backend
     * allwinner,sun6i-a31-display-backend
     * allwinner,sun8i-a33-display-backend
   - reg: base address and size of the memory-mapped region.
   - clocks: phandles to the clocks feeding the frontend and backend
 @ -117,6 +120,7 @@ deinterlacing and color space conversion.
 Required properties:
   - compatible: value must be one of:
     * allwinner,sun5i-a13-display-frontend
     * allwinner,sun6i-a31-display-frontend
     * allwinner,sun8i-a33-display-frontend
   - reg: base address and size of the memory-mapped region.
   - interrupts: interrupt associated to this IP
 @ -142,6 +146,8 @@ extra node.
 Required properties:
   - compatible: value must be one of:
     * allwinner,sun5i-a13-display-engine
     * allwinner,sun6i-a31-display-engine
     * allwinner,sun6i-a31s-display-engine
     * allwinner,sun8i-a33-display-engine
   - allwinner,pipelines: list of phandle to the display engine

1

Documentation/devicetree/bindings/usb/usb-xhci.txt

View File

 @ -26,6 +26,7 @@ Required properties:
 Optional properties:
   - clocks: reference to a clock
   - usb3-lpm-capable: determines if platform is USB3 LPM capable
   - quirk-broken-port-ped: set if the controller has broken port disable mechanism
 Example:
 	usb@f0931000 {

9

Documentation/devicetree/bindings/watchdog/samsung-wdt.txt

View File

 @ -6,10 +6,11 @@ occurred.
 Required properties:
 - compatible : should be one among the following
 	(a) "samsung,s3c2410-wdt" for Exynos4 and previous SoCs
 	(b) "samsung,exynos5250-wdt" for Exynos5250
 	(c) "samsung,exynos5420-wdt" for Exynos5420
 	(c) "samsung,exynos7-wdt" for Exynos7
 	- "samsung,s3c2410-wdt" for S3C2410
 	- "samsung,s3c6410-wdt" for S3C6410, S5PV210 and Exynos4
 	- "samsung,exynos5250-wdt" for Exynos5250
 	- "samsung,exynos5420-wdt" for Exynos5420
 	- "samsung,exynos7-wdt" for Exynos7
 - reg : base physical address of the controller and length of memory mapped
 	region.

10

Documentation/kernel-parameters.txt

View File

 @ -305,6 +305,16 @@ bytes respectively. Such letter suffixes can also be entirely omitted.
 			use by PCI
 			Format: <irq>,<irq>...
 	acpi_mask_gpe=  [HW,ACPI]
 			Due to the existence of _Lxx/_Exx, some GPEs triggered
 			by unsupported hardware/firmware features can result in
                         GPE floodings that cannot be automatically disabled by
                         the GPE dispatcher.
 			This facility can be used to prevent such uncontrolled
 			GPE floodings.
 			Format: <int>
 			Support masking of GPEs numbered from 0x00 to 0x7f.
 	acpi_no_auto_serialize	[HW,ACPI]
 			Disable auto-serialization of AML methods
 			AML control methods that contain the opcodes to create

2

Documentation/stable_kernel_rules.txt

View File

 @ -124,7 +124,7 @@ specified in the following format in the sign-off area:
 .. code-block:: none
      Cc: <stable@vger.kernel.org> # 3.3.x-
      Cc: <stable@vger.kernel.org> # 3.3.x
 The tag has the meaning of:

									
										4

Makefile
									
												View File
												
				@ -1,6 +1,6 @@

				VERSION = 4

				PATCHLEVEL = 9

				SUBLEVEL = 15

				SUBLEVEL = 22

				EXTRAVERSION =

				NAME = Roaring Lionus

				@ -370,7 +370,7 @@ LDFLAGS_MODULE  =

				CFLAGS_KERNEL	=

				AFLAGS_KERNEL	=

				LDFLAGS_vmlinux =

				CFLAGS_GCOV	= -fprofile-arcs -ftest-coverage -fno-tree-loop-im -Wno-maybe-uninitialized

				CFLAGS_GCOV	:= -fprofile-arcs -ftest-coverage -fno-tree-loop-im $(call cc-disable-warning,maybe-uninitialized,)

				CFLAGS_KCOV	:= $(call cc-option,-fsanitize-coverage=trace-pc,)

4

arch/arm/boot/dts/bcm5301x.dtsi

View File

 @ -66,14 +66,14 @@
 		timer@20200 {
 			compatible = "arm,cortex-a9-global-timer";
 			reg = <0x20200 0x100>;
 			interrupts = <GIC_PPI 11 IRQ_TYPE_LEVEL_HIGH>;
 			interrupts = <GIC_PPI 11 IRQ_TYPE_EDGE_RISING>;
 			clocks = <&periph_clk>;
 		};
 		local-timer@20600 {
 			compatible = "arm,cortex-a9-twd-timer";
 			reg = <0x20600 0x100>;
 			interrupts = <GIC_PPI 13 IRQ_TYPE_LEVEL_HIGH>;
 			interrupts = <GIC_PPI 13 IRQ_TYPE_EDGE_RISING>;
 			clocks = <&periph_clk>;
 		};

2

arch/arm/boot/dts/sama5d2.dtsi

View File

 @ -266,7 +266,7 @@
 		};
 		usb1: ohci@00400000 {
 			compatible = "atmel,sama5d2-ohci", "usb-ohci";
 			compatible = "atmel,at91rm9200-ohci", "usb-ohci";
 			reg = <0x00400000 0x100000>;
 			interrupts = <41 IRQ_TYPE_LEVEL_HIGH 2>;
 			clocks = <&uhphs_clk>, <&uhphs_clk>, <&uhpck>;

1

arch/arm/boot/dts/stih407-family.dtsi

View File

 @ -680,6 +680,7 @@
 				phy-names	= "usb2-phy", "usb3-phy";
 				phys		= <&usb2_picophy0>,
 						  <&phy_port2 PHY_TYPE_USB3>;
 				snps,dis_u3_susphy_quirk;
 			};
 		};

									
										4

arch/arm/kernel/armksyms.c
									
												View File
												
				@ -178,6 +178,6 @@ EXPORT_SYMBOL(__pv_offset);

				#endif

				#ifdef CONFIG_HAVE_ARM_SMCCC

				EXPORT_SYMBOL(arm_smccc_smc);

				EXPORT_SYMBOL(arm_smccc_hvc);

				EXPORT_SYMBOL(__arm_smccc_smc);

				EXPORT_SYMBOL(__arm_smccc_hvc);

				#endif

									
										14

arch/arm/kernel/smccc-call.S
									
												View File
												
				@ -46,17 +46,19 @@ UNWIND(	.fnend)

				/*

				 * void smccc_smc(unsigned long a0, unsigned long a1, unsigned long a2,

				 *		  unsigned long a3, unsigned long a4, unsigned long a5,

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res)

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res,

				 *		  struct arm_smccc_quirk *quirk)

				 */

				ENTRY(arm_smccc_smc)

				ENTRY(__arm_smccc_smc)

					SMCCC SMCCC_SMC

				ENDPROC(arm_smccc_smc)

				ENDPROC(__arm_smccc_smc)

				/*

				 * void smccc_hvc(unsigned long a0, unsigned long a1, unsigned long a2,

				 *		  unsigned long a3, unsigned long a4, unsigned long a5,

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res)

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res,

				 *		  struct arm_smccc_quirk *quirk)

				 */

				ENTRY(arm_smccc_hvc)

				ENTRY(__arm_smccc_hvc)

					SMCCC SMCCC_HVC

				ENDPROC(arm_smccc_hvc)

				ENDPROC(__arm_smccc_hvc)

									
										23

arch/arm/kvm/mmu.c
									
												View File
												
				@ -292,11 +292,18 @@ static void unmap_stage2_range(struct kvm *kvm, phys_addr_t start, u64 size)

					phys_addr_t addr = start, end = start + size;

					phys_addr_t next;

					assert_spin_locked(&kvm->mmu_lock);

					pgd = kvm->arch.pgd + stage2_pgd_index(addr);

					do {

						next = stage2_pgd_addr_end(addr, end);

						if (!stage2_pgd_none(*pgd))

							unmap_stage2_puds(kvm, pgd, addr, next);

						/*

						 * If the range is too large, release the kvm->mmu_lock

						 * to prevent starvation and lockup detector warnings.

						 */

						if (next != end)

							cond_resched_lock(&kvm->mmu_lock);

					} while (pgd++, addr = next, addr != end);

				}

				@ -803,6 +810,7 @@ void stage2_unmap_vm(struct kvm *kvm)

					int idx;

					idx = srcu_read_lock(&kvm->srcu);

					down_read(&current->mm->mmap_sem);

					spin_lock(&kvm->mmu_lock);

					slots = kvm_memslots(kvm);

				@ -810,6 +818,7 @@ void stage2_unmap_vm(struct kvm *kvm)

						stage2_unmap_memslot(kvm, memslot);

					spin_unlock(&kvm->mmu_lock);

					up_read(&current->mm->mmap_sem);

					srcu_read_unlock(&kvm->srcu, idx);

				}

				@ -829,7 +838,10 @@ void kvm_free_stage2_pgd(struct kvm *kvm)

					if (kvm->arch.pgd == NULL)

						return;

					spin_lock(&kvm->mmu_lock);

					unmap_stage2_range(kvm, 0, KVM_PHYS_SIZE);

					spin_unlock(&kvm->mmu_lock);

					/* Free the HW pgd, one page at a time */

					free_pages_exact(kvm->arch.pgd, S2_PGD_SIZE);

					kvm->arch.pgd = NULL;

				@ -1804,6 +1816,7 @@ int kvm_arch_prepare_memory_region(struct kvm *kvm,

					    (KVM_PHYS_SIZE >> PAGE_SHIFT))

						return -EFAULT;

					down_read(&current->mm->mmap_sem);

					/*

					 * A memory region could potentially cover multiple VMAs, and any holes

					 * between them, so iterate over all of them to find out if we can map

				@ -1847,8 +1860,10 @@ int kvm_arch_prepare_memory_region(struct kvm *kvm,

							pa += vm_start - vma->vm_start;

							/* IO region dirty page logging not allowed */

							if (memslot->flags & KVM_MEM_LOG_DIRTY_PAGES)

								return -EINVAL;

							if (memslot->flags & KVM_MEM_LOG_DIRTY_PAGES) {

								ret = -EINVAL;

								goto out;

							}

							ret = kvm_phys_addr_ioremap(kvm, gpa, pa,

										    vm_end - vm_start,

				@ -1860,7 +1875,7 @@ int kvm_arch_prepare_memory_region(struct kvm *kvm,

					} while (hva < reg_end);

					if (change == KVM_MR_FLAGS_ONLY)

						return ret;

						goto out;

					spin_lock(&kvm->mmu_lock);

					if (ret)

				@ -1868,6 +1883,8 @@ int kvm_arch_prepare_memory_region(struct kvm *kvm,

					else

						stage2_flush_memslot(kvm, memslot);

					spin_unlock(&kvm->mmu_lock);

				out:

					up_read(&current->mm->mmap_sem);

					return ret;

				}

									
										18

arch/arm/mach-at91/pm.c
									
												View File
												
				@ -289,6 +289,22 @@ static void at91_ddr_standby(void)

						at91_ramc_write(1, AT91_DDRSDRC_LPR, saved_lpr1);

				}

				static void sama5d3_ddr_standby(void)

				{

					u32 lpr0;

					u32 saved_lpr0;

					saved_lpr0 = at91_ramc_read(0, AT91_DDRSDRC_LPR);

					lpr0 = saved_lpr0 & ~AT91_DDRSDRC_LPCB;

					lpr0 |= AT91_DDRSDRC_LPCB_POWER_DOWN;

					at91_ramc_write(0, AT91_DDRSDRC_LPR, lpr0);

					cpu_do_idle();

					at91_ramc_write(0, AT91_DDRSDRC_LPR, saved_lpr0);

				}

				/* We manage both DDRAM/SDRAM controllers, we need more than one value to

				 * remember.

				 */

				@ -323,7 +339,7 @@ static const struct of_device_id const ramc_ids[] __initconst = {

					{ .compatible = "atmel,at91rm9200-sdramc", .data = at91rm9200_standby },

					{ .compatible = "atmel,at91sam9260-sdramc", .data = at91sam9_sdram_standby },

					{ .compatible = "atmel,at91sam9g45-ddramc", .data = at91_ddr_standby },

					{ .compatible = "atmel,sama5d3-ddramc", .data = at91_ddr_standby },

					{ .compatible = "atmel,sama5d3-ddramc", .data = sama5d3_ddr_standby },

					{ /*sentinel*/ }

				};

									
										28

arch/arm/mach-bcm/bcm_5301x.c
									
												View File
												
				@ -9,14 +9,42 @@

				#include <asm/hardware/cache-l2x0.h>

				#include <asm/mach/arch.h>

				#include <asm/siginfo.h>

				#include <asm/signal.h>

				#define FSR_EXTERNAL		(1 << 12)

				#define FSR_READ		(0 << 10)

				#define FSR_IMPRECISE		0x0406

				static const char *const bcm5301x_dt_compat[] __initconst = {

					"brcm,bcm4708",

					NULL,

				};

				static int bcm5301x_abort_handler(unsigned long addr, unsigned int fsr,

								  struct pt_regs *regs)

				{

					/*

					 * We want to ignore aborts forwarded from the PCIe bus that are

					 * expected and shouldn't really be passed by the PCIe controller.

					 * The biggest disadvantage is the same FSR code may be reported when

					 * reading non-existing APB register and we shouldn't ignore that.

					 */

					if (fsr == (FSR_EXTERNAL | FSR_READ | FSR_IMPRECISE))

						return 0;

					return 1;

				}

				static void __init bcm5301x_init_early(void)

				{

					hook_fault_code(16 + 6, bcm5301x_abort_handler, SIGBUS, BUS_OBJERR,

							"imprecise external abort");

				}

				DT_MACHINE_START(BCM5301X, "BCM5301X")

					.l2c_aux_val	= 0,

					.l2c_aux_mask	= ~0,

					.dt_compat	= bcm5301x_dt_compat,

					.init_early	= bcm5301x_init_early,

				MACHINE_END

									
										1

arch/arm/mach-davinci/da8xx-dt.c
									
												View File
												
				@ -46,6 +46,7 @@ static struct of_dev_auxdata da850_auxdata_lookup[] __initdata = {

				static void __init da850_init_machine(void)

				{

					of_platform_default_populate(NULL, da850_auxdata_lookup, NULL);

					davinci_pm_init();

				}

				static const char *const da850_boards_compat[] __initconst = {

10

arch/arm64/Kconfig

View File

 @ -474,6 +474,16 @@ config CAVIUM_ERRATUM_27456
 	  If unsure, say Y.
 config QCOM_QDF2400_ERRATUM_0065
 	bool "QDF2400 E0065: Incorrect GITS_TYPER.ITT_Entry_size"
 	default y
 	help
 	  On Qualcomm Datacenter Technologies QDF2400 SoC, ITS hardware reports
 	  ITE size incorrectly. The GITS_TYPER.ITT_Entry_size field should have
 	  been indicated as 16Bytes (0xf), not 8Bytes (0x7).
 	  If unsure, say Y.
 endmenu

2

arch/arm64/boot/dts/hisilicon/hip06.dtsi

View File

 @ -590,7 +590,7 @@
 			reg = <0 0xa2000000 0 0x10000>;
 			sas-addr = [50 01 88 20 16 00 00 00];
 			hisilicon,sas-syscon = <&pcie_subctl>;
 			am-max-trans;
 			hip06-sas-v2-quirk-amt;
 			ctrl-reset-reg = <0xa18>;
 			ctrl-reset-sts-reg = <0x5a0c>;
 			ctrl-clock-ena-reg = <0x318>;

									
										4

arch/arm64/kernel/arm64ksyms.c
									
												View File
												
				@ -73,5 +73,5 @@ NOKPROBE_SYMBOL(_mcount);

				#endif

					/* arm-smccc */

				EXPORT_SYMBOL(arm_smccc_smc);

				EXPORT_SYMBOL(arm_smccc_hvc);

				EXPORT_SYMBOL(__arm_smccc_smc);

				EXPORT_SYMBOL(__arm_smccc_hvc);

									
										7

arch/arm64/kernel/asm-offsets.c
									
												View File
												
				@ -140,8 +140,11 @@ int main(void)

				  DEFINE(SLEEP_STACK_DATA_SYSTEM_REGS,	offsetof(struct sleep_stack_data, system_regs));

				  DEFINE(SLEEP_STACK_DATA_CALLEE_REGS,	offsetof(struct sleep_stack_data, callee_saved_regs));

				#endif

				  DEFINE(ARM_SMCCC_RES_X0_OFFS,	offsetof(struct arm_smccc_res, a0));

				  DEFINE(ARM_SMCCC_RES_X2_OFFS,	offsetof(struct arm_smccc_res, a2));

				  DEFINE(ARM_SMCCC_RES_X0_OFFS,		offsetof(struct arm_smccc_res, a0));

				  DEFINE(ARM_SMCCC_RES_X2_OFFS,		offsetof(struct arm_smccc_res, a2));

				  DEFINE(ARM_SMCCC_QUIRK_ID_OFFS,	offsetof(struct arm_smccc_quirk, id));

				  DEFINE(ARM_SMCCC_QUIRK_STATE_OFFS,	offsetof(struct arm_smccc_quirk, state));

				  BLANK();

				  DEFINE(HIBERN_PBE_ORIG,	offsetof(struct pbe, orig_address));

				  DEFINE(HIBERN_PBE_ADDR,	offsetof(struct pbe, address));

									
										10

arch/arm64/kernel/kaslr.c
									
												View File
												
				@ -131,11 +131,15 @@ u64 __init kaslr_early_init(u64 dt_phys, u64 modulo_offset)

					/*

					 * The kernel Image should not extend across a 1GB/32MB/512MB alignment

					 * boundary (for 4KB/16KB/64KB granule kernels, respectively). If this

					 * happens, increase the KASLR offset by the size of the kernel image.

					 * happens, increase the KASLR offset by the size of the kernel image

					 * rounded up by SWAPPER_BLOCK_SIZE.

					 */

					if ((((u64)_text + offset + modulo_offset) >> SWAPPER_TABLE_SHIFT) !=

					    (((u64)_end + offset + modulo_offset) >> SWAPPER_TABLE_SHIFT))

						offset = (offset + (u64)(_end - _text)) & mask;

					    (((u64)_end + offset + modulo_offset) >> SWAPPER_TABLE_SHIFT)) {

						u64 kimg_sz = _end - _text;

						offset = (offset + round_up(kimg_sz, SWAPPER_BLOCK_SIZE))

								& mask;

					}

					if (IS_ENABLED(CONFIG_KASAN))

						/*

									
										28

arch/arm64/kernel/pci.c
									
												View File
												
				@ -121,6 +121,7 @@ int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)

				static struct pci_config_window *

				pci_acpi_setup_ecam_mapping(struct acpi_pci_root *root)

				{

					struct device *dev = &root->device->dev;

					struct resource *bus_res = &root->secondary;

					u16 seg = root->segment;

					struct pci_config_window *cfg;

				@ -132,8 +133,7 @@ pci_acpi_setup_ecam_mapping(struct acpi_pci_root *root)

						root->mcfg_addr = pci_mcfg_lookup(seg, bus_res);

					if (!root->mcfg_addr) {

						dev_err(&root->device->dev, "%04x:%pR ECAM region not found\n",

							seg, bus_res);

						dev_err(dev, "%04x:%pR ECAM region not found\n", seg, bus_res);

						return NULL;

					}

				@ -141,11 +141,10 @@ pci_acpi_setup_ecam_mapping(struct acpi_pci_root *root)

					cfgres.start = root->mcfg_addr + bus_res->start * bsz;

					cfgres.end = cfgres.start + resource_size(bus_res) * bsz - 1;

					cfgres.flags = IORESOURCE_MEM;

					cfg = pci_ecam_create(&root->device->dev, &cfgres, bus_res,

							      &pci_generic_ecam_ops);

					cfg = pci_ecam_create(dev, &cfgres, bus_res, &pci_generic_ecam_ops);

					if (IS_ERR(cfg)) {

						dev_err(&root->device->dev, "%04x:%pR error %ld mapping ECAM\n",

							seg, bus_res, PTR_ERR(cfg));

						dev_err(dev, "%04x:%pR error %ld mapping ECAM\n", seg, bus_res,

							PTR_ERR(cfg));

						return NULL;

					}

				@ -159,33 +158,36 @@ static void pci_acpi_generic_release_info(struct acpi_pci_root_info *ci)

					ri = container_of(ci, struct acpi_pci_generic_root_info, common);

					pci_ecam_free(ri->cfg);

					kfree(ci->ops);

					kfree(ri);

				}

				static struct acpi_pci_root_ops acpi_pci_root_ops = {

					.release_info = pci_acpi_generic_release_info,

				};

				/* Interface called from ACPI code to setup PCI host controller */

				struct pci_bus *pci_acpi_scan_root(struct acpi_pci_root *root)

				{

					int node = acpi_get_node(root->device->handle);

					struct acpi_pci_generic_root_info *ri;

					struct pci_bus *bus, *child;

					struct acpi_pci_root_ops *root_ops;

					ri = kzalloc_node(sizeof(*ri), GFP_KERNEL, node);

					if (!ri)

						return NULL;

					root_ops = kzalloc_node(sizeof(*root_ops), GFP_KERNEL, node);

					if (!root_ops)

						return NULL;

					ri->cfg = pci_acpi_setup_ecam_mapping(root);

					if (!ri->cfg) {

						kfree(ri);

						kfree(root_ops);

						return NULL;

					}

					acpi_pci_root_ops.pci_ops = &ri->cfg->ops->pci_ops;

					bus = acpi_pci_root_create(root, &acpi_pci_root_ops, &ri->common,

								   ri->cfg);

					root_ops->release_info = pci_acpi_generic_release_info;

					root_ops->pci_ops = &ri->cfg->ops->pci_ops;

					bus = acpi_pci_root_create(root, root_ops, &ri->common, ri->cfg);

					if (!bus)

						return NULL;

									
										23

arch/arm64/kernel/smccc-call.S
									
												View File
												
				@ -12,6 +12,7 @@

				 *

				 */

				#include <linux/linkage.h>

				#include <linux/arm-smccc.h>

				#include <asm/asm-offsets.h>

					.macro SMCCC instr

				@ -20,24 +21,32 @@

					ldr	x4, [sp]

					stp	x0, x1, [x4, #ARM_SMCCC_RES_X0_OFFS]

					stp	x2, x3, [x4, #ARM_SMCCC_RES_X2_OFFS]

					ret

					ldr	x4, [sp, #8]

					cbz	x4, 1f /* no quirk structure */

					ldr	x9, [x4, #ARM_SMCCC_QUIRK_ID_OFFS]

					cmp	x9, #ARM_SMCCC_QUIRK_QCOM_A6

					b.ne	1f

					str	x6, [x4, ARM_SMCCC_QUIRK_STATE_OFFS]

				1:	ret

					.cfi_endproc

					.endm

				/*

				 * void arm_smccc_smc(unsigned long a0, unsigned long a1, unsigned long a2,

				 *		  unsigned long a3, unsigned long a4, unsigned long a5,

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res)

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res,

				 *		  struct arm_smccc_quirk *quirk)

				 */

				ENTRY(arm_smccc_smc)

				ENTRY(__arm_smccc_smc)

					SMCCC	smc

				ENDPROC(arm_smccc_smc)

				ENDPROC(__arm_smccc_smc)

				/*

				 * void arm_smccc_hvc(unsigned long a0, unsigned long a1, unsigned long a2,

				 *		  unsigned long a3, unsigned long a4, unsigned long a5,

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res)

				 *		  unsigned long a6, unsigned long a7, struct arm_smccc_res *res,

				 *		  struct arm_smccc_quirk *quirk)

				 */

				ENTRY(arm_smccc_hvc)

				ENTRY(__arm_smccc_hvc)

					SMCCC	hvc

				ENDPROC(arm_smccc_hvc)

				ENDPROC(__arm_smccc_hvc)

									
										64

arch/arm64/kvm/hyp/tlb.c
									
												View File
												
				@ -17,14 +17,62 @@

				#include <asm/kvm_hyp.h>

				static void __hyp_text __tlb_switch_to_guest_vhe(struct kvm *kvm)

				{

					u64 val;

					/*

					 * With VHE enabled, we have HCR_EL2.{E2H,TGE} = {1,1}, and

					 * most TLB operations target EL2/EL0. In order to affect the

					 * guest TLBs (EL1/EL0), we need to change one of these two

					 * bits. Changing E2H is impossible (goodbye TTBR1_EL2), so

					 * let's flip TGE before executing the TLB operation.

					 */

					write_sysreg(kvm->arch.vttbr, vttbr_el2);

					val = read_sysreg(hcr_el2);

					val &= ~HCR_TGE;

					write_sysreg(val, hcr_el2);

					isb();

				}

				static void __hyp_text __tlb_switch_to_guest_nvhe(struct kvm *kvm)

				{

					write_sysreg(kvm->arch.vttbr, vttbr_el2);

					isb();

				}

				static hyp_alternate_select(__tlb_switch_to_guest,

							    __tlb_switch_to_guest_nvhe,

							    __tlb_switch_to_guest_vhe,

							    ARM64_HAS_VIRT_HOST_EXTN);

				static void __hyp_text __tlb_switch_to_host_vhe(struct kvm *kvm)

				{

					/*

					 * We're done with the TLB operation, let's restore the host's

					 * view of HCR_EL2.

					 */

					write_sysreg(0, vttbr_el2);

					write_sysreg(HCR_HOST_VHE_FLAGS, hcr_el2);

				}

				static void __hyp_text __tlb_switch_to_host_nvhe(struct kvm *kvm)

				{

					write_sysreg(0, vttbr_el2);

				}

				static hyp_alternate_select(__tlb_switch_to_host,

							    __tlb_switch_to_host_nvhe,

							    __tlb_switch_to_host_vhe,

							    ARM64_HAS_VIRT_HOST_EXTN);

				void __hyp_text __kvm_tlb_flush_vmid_ipa(struct kvm *kvm, phys_addr_t ipa)

				{

					dsb(ishst);

					/* Switch to requested VMID */

					kvm = kern_hyp_va(kvm);

					write_sysreg(kvm->arch.vttbr, vttbr_el2);

					isb();

					__tlb_switch_to_guest()(kvm);

					/*

					 * We could do so much better if we had the VA as well.

				@ -45,7 +93,7 @@ void __hyp_text __kvm_tlb_flush_vmid_ipa(struct kvm *kvm, phys_addr_t ipa)

					dsb(ish);

					isb();

					write_sysreg(0, vttbr_el2);

					__tlb_switch_to_host()(kvm);

				}

				void __hyp_text __kvm_tlb_flush_vmid(struct kvm *kvm)

				@ -54,14 +102,13 @@ void __hyp_text __kvm_tlb_flush_vmid(struct kvm *kvm)

					/* Switch to requested VMID */

					kvm = kern_hyp_va(kvm);

					write_sysreg(kvm->arch.vttbr, vttbr_el2);

					isb();

					__tlb_switch_to_guest()(kvm);

					asm volatile("tlbi vmalls12e1is" : : );

					dsb(ish);

					isb();

					write_sysreg(0, vttbr_el2);

					__tlb_switch_to_host()(kvm);

				}

				void __hyp_text __kvm_tlb_flush_local_vmid(struct kvm_vcpu *vcpu)

				@ -69,14 +116,13 @@ void __hyp_text __kvm_tlb_flush_local_vmid(struct kvm_vcpu *vcpu)

					struct kvm *kvm = kern_hyp_va(kern_hyp_va(vcpu)->kvm);

					/* Switch to requested VMID */

					write_sysreg(kvm->arch.vttbr, vttbr_el2);

					isb();

					__tlb_switch_to_guest()(kvm);

					asm volatile("tlbi vmalle1" : : );

					dsb(nsh);

					isb();

					write_sysreg(0, vttbr_el2);

					__tlb_switch_to_host()(kvm);

				}

				void __hyp_text __kvm_flush_vm_context(void)

									
										42

arch/arm64/mm/fault.c
									
												View File
												
				@ -41,7 +41,20 @@

				#include <asm/pgtable.h>

				#include <asm/tlbflush.h>

				static const char *fault_name(unsigned int esr);

				struct fault_info {

					int	(*fn)(unsigned long addr, unsigned int esr,

						      struct pt_regs *regs);

					int	sig;

					int	code;

					const char *name;

				};

				static const struct fault_info fault_info[];

				static inline const struct fault_info *esr_to_fault_info(unsigned int esr)

				{

					return fault_info + (esr & 63);

				}

				#ifdef CONFIG_KPROBES

				static inline int notify_page_fault(struct pt_regs *regs, unsigned int esr)

				@ -196,10 +209,12 @@ static void __do_user_fault(struct task_struct *tsk, unsigned long addr,

							    struct pt_regs *regs)

				{

					struct siginfo si;

					const struct fault_info *inf;

					if (unhandled_signal(tsk, sig) && show_unhandled_signals_ratelimited()) {

						inf = esr_to_fault_info(esr);

						pr_info("%s[%d]: unhandled %s (%d) at 0x%08lx, esr 0x%03x\n",

							tsk->comm, task_pid_nr(tsk), fault_name(esr), sig,

							tsk->comm, task_pid_nr(tsk), inf->name, sig,

							addr, esr);

						show_pte(tsk->mm, addr);

						show_regs(regs);

				@ -218,14 +233,16 @@ static void do_bad_area(unsigned long addr, unsigned int esr, struct pt_regs *re

				{

					struct task_struct *tsk = current;

					struct mm_struct *mm = tsk->active_mm;

					const struct fault_info *inf;

					/*

					 * If we are in kernel mode at this point, we have no context to

					 * handle this fault with.

					 */

					if (user_mode(regs))

						__do_user_fault(tsk, addr, esr, SIGSEGV, SEGV_MAPERR, regs);

					else

					if (user_mode(regs)) {

						inf = esr_to_fault_info(esr);

						__do_user_fault(tsk, addr, esr, inf->sig, inf->code, regs);

					} else

						__do_kernel_fault(mm, addr, esr, regs);

				}

				@ -481,12 +498,7 @@ static int do_bad(unsigned long addr, unsigned int esr, struct pt_regs *regs)

					return 1;

				}

				static const struct fault_info {

					int	(*fn)(unsigned long addr, unsigned int esr, struct pt_regs *regs);

					int	sig;

					int	code;

					const char *name;

				} fault_info[] = {

				static const struct fault_info fault_info[] = {

					{ do_bad,		SIGBUS,  0,		"ttbr address size fault"	},

					{ do_bad,		SIGBUS,  0,		"level 1 address size fault"	},

					{ do_bad,		SIGBUS,  0,		"level 2 address size fault"	},

				@ -553,19 +565,13 @@ static const struct fault_info {

					{ do_bad,		SIGBUS,  0,		"unknown 63"			},

				};

				static const char *fault_name(unsigned int esr)

				{

					const struct fault_info *inf = fault_info + (esr & 63);

					return inf->name;

				}

				/*

				 * Dispatch a data abort to the relevant handler.

				 */

				asmlinkage void __exception do_mem_abort(unsigned long addr, unsigned int esr,

									 struct pt_regs *regs)

				{

					const struct fault_info *inf = fault_info + (esr & 63);

					const struct fault_info *inf = esr_to_fault_info(esr);

					struct siginfo info;

					if (!inf->fn(addr, esr, regs))

									
										41

arch/c6x/kernel/ptrace.c
									
												View File
												
				@ -69,46 +69,6 @@ static int gpr_get(struct task_struct *target,

								   0, sizeof(*regs));

				}

				static int gpr_set(struct task_struct *target,

						   const struct user_regset *regset,

						   unsigned int pos, unsigned int count,

						   const void *kbuf, const void __user *ubuf)

				{

					int ret;

					struct pt_regs *regs = task_pt_regs(target);

					/* Don't copyin TSR or CSR */

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

								 &regs,

								 0, PT_TSR * sizeof(long));

					if (ret)

						return ret;

					ret = user_regset_copyin_ignore(&pos, &count, &kbuf, &ubuf,

									PT_TSR * sizeof(long),

									(PT_TSR + 1) * sizeof(long));

					if (ret)

						return ret;

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

								 &regs,

								 (PT_TSR + 1) * sizeof(long),

								 PT_CSR * sizeof(long));

					if (ret)

						return ret;

					ret = user_regset_copyin_ignore(&pos, &count, &kbuf, &ubuf,

									PT_CSR * sizeof(long),

									(PT_CSR + 1) * sizeof(long));

					if (ret)

						return ret;

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

								 &regs,

								 (PT_CSR + 1) * sizeof(long), -1);

					return ret;

				}

				enum c6x_regset {

					REGSET_GPR,

				};

				@ -120,7 +80,6 @@ static const struct user_regset c6x_regsets[] = {

						.size = sizeof(u32),

						.align = sizeof(u32),

						.get = gpr_get,

						.set = gpr_set

					},

				};

									
										8

arch/h8300/kernel/ptrace.c
									
												View File
												
				@ -95,7 +95,8 @@ static int regs_get(struct task_struct *target,

					long *reg = (long *)&regs;

					/* build user regs in buffer */

					for (r = 0; r < ARRAY_SIZE(register_offset); r++)

					BUILD_BUG_ON(sizeof(regs) % sizeof(long) != 0);

					for (r = 0; r < sizeof(regs) / sizeof(long); r++)

						*reg++ = h8300_get_reg(target, r);

					return user_regset_copyout(&pos, &count, &kbuf, &ubuf,

				@ -113,7 +114,8 @@ static int regs_set(struct task_struct *target,

					long *reg;

					/* build user regs in buffer */

					for (reg = (long *)&regs, r = 0; r < ARRAY_SIZE(register_offset); r++)

					BUILD_BUG_ON(sizeof(regs) % sizeof(long) != 0);

					for (reg = (long *)&regs, r = 0; r < sizeof(regs) / sizeof(long); r++)

						*reg++ = h8300_get_reg(target, r);

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

				@ -122,7 +124,7 @@ static int regs_set(struct task_struct *target,

						return ret;

					/* write back to pt_regs */

					for (reg = (long *)&regs, r = 0; r < ARRAY_SIZE(register_offset); r++)

					for (reg = (long *)&regs, r = 0; r < sizeof(regs) / sizeof(long); r++)

						h8300_put_reg(target, r, *reg++);

					return 0;

				}

									
										15

arch/metag/include/asm/uaccess.h
									
												View File
												
				@ -197,20 +197,21 @@ extern long __must_check strnlen_user(const char __user *src, long count);

				#define strlen_user(str) strnlen_user(str, 32767)

				extern unsigned long __must_check __copy_user_zeroing(void *to,

										      const void __user *from,

										      unsigned long n);

				extern unsigned long raw_copy_from_user(void *to, const void __user *from,

									unsigned long n);

				static inline unsigned long

				copy_from_user(void *to, const void __user *from, unsigned long n)

				{

					unsigned long res = n;

					if (likely(access_ok(VERIFY_READ, from, n)))

						return __copy_user_zeroing(to, from, n);

					memset(to, 0, n);

					return n;

						res = raw_copy_from_user(to, from, n);

					if (unlikely(res))

						memset(to + (n - res), 0, res);

					return res;

				}

				#define __copy_from_user(to, from, n) __copy_user_zeroing(to, from, n)

				#define __copy_from_user(to, from, n) raw_copy_from_user(to, from, n)

				#define __copy_from_user_inatomic __copy_from_user

				extern unsigned long __must_check __copy_user(void __user *to,

									
										19

arch/metag/kernel/ptrace.c
									
												View File
												
				@ -24,6 +24,16 @@

				 * user_regset definitions.

				 */

				static unsigned long user_txstatus(const struct pt_regs *regs)

				{

					unsigned long data = (unsigned long)regs->ctx.Flags;

					if (regs->ctx.SaveMask & TBICTX_CBUF_BIT)

						data |= USER_GP_REGS_STATUS_CATCH_BIT;

					return data;

				}

				int metag_gp_regs_copyout(const struct pt_regs *regs,

							  unsigned int pos, unsigned int count,

							  void *kbuf, void __user *ubuf)

				@ -62,9 +72,7 @@ int metag_gp_regs_copyout(const struct pt_regs *regs,

					if (ret)

						goto out;

					/* TXSTATUS */

					data = (unsigned long)regs->ctx.Flags;

					if (regs->ctx.SaveMask & TBICTX_CBUF_BIT)

						data |= USER_GP_REGS_STATUS_CATCH_BIT;

					data = user_txstatus(regs);

					ret = user_regset_copyout(&pos, &count, &kbuf, &ubuf,

								  &data, 4*25, 4*26);

					if (ret)

				@ -119,6 +127,7 @@ int metag_gp_regs_copyin(struct pt_regs *regs,

					if (ret)

						goto out;

					/* TXSTATUS */

					data = user_txstatus(regs);

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

								 &data, 4*25, 4*26);

					if (ret)

				@ -244,6 +253,8 @@ int metag_rp_state_copyin(struct pt_regs *regs,

					unsigned long long *ptr;

					int ret, i;

					if (count < 4*13)

						return -EINVAL;

					/* Read the entire pipeline before making any changes */

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

								 &rp, 0, 4*13);

				@ -303,7 +314,7 @@ static int metag_tls_set(struct task_struct *target,

							const void *kbuf, const void __user *ubuf)

				{

					int ret;

					void __user *tls;

					void __user *tls = target->thread.tls_ptr;

					ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf, &tls, 0, -1);

					if (ret)

									
										324

arch/metag/lib/usercopy.c
									
												View File
												
				@ -29,7 +29,6 @@

						COPY						 \

						"1:\n"						 \

						"	.section .fixup,\"ax\"\n"		 \

						"	MOV D1Ar1,#0\n"				 \

						FIXUP						 \

						"	MOVT    D1Ar1,#HI(1b)\n"		 \

						"	JUMP    D1Ar1,#LO(1b)\n"		 \

				@ -260,27 +259,31 @@

						"MGETL	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"22:\n"							\

						"MSETL	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #32\n"					\

						"23:\n"							\

						"MGETL	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"SUB	%3, %3, #32\n"					\

						"24:\n"							\

						"MGETL	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"25:\n"							\

						"MSETL	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"26:\n"							\

						"SUB	%3, %3, #32\n"					\

						"DCACHE	[%1+#-64], D0Ar6\n"				\

						"BR	$Lloop"id"\n"					\

													\

						"MOV	RAPF, %1\n"					\

						"25:\n"							\

						"MGETL	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"26:\n"							\

						"MSETL	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #32\n"					\

						"27:\n"							\

						"MGETL	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"28:\n"							\

						"MSETL	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%0, %0, #8\n"					\

						"29:\n"							\

						"SUB	%3, %3, #32\n"					\

						"30:\n"							\

						"MGETL	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"31:\n"							\

						"MSETL	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"32:\n"							\

						"SUB	%0, %0, #8\n"					\

						"33:\n"							\

						"SETL	[%0++], D0.7, D1.7\n"				\

						"SUB	%3, %3, #32\n"					\

						"1:"							\

				@ -312,11 +315,15 @@

						"	.long 26b,3b\n"					\

						"	.long 27b,3b\n"					\

						"	.long 28b,3b\n"					\

						"	.long 29b,4b\n"					\

						"	.long 29b,3b\n"					\

						"	.long 30b,3b\n"					\

						"	.long 31b,3b\n"					\

						"	.long 32b,3b\n"					\

						"	.long 33b,4b\n"					\

						"	.previous\n"					\

						: "=r" (to), "=r" (from), "=r" (ret), "=d" (n)		\

						: "0" (to), "1" (from), "2" (ret), "3" (n)		\

						: "D1Ar1", "D0Ar2", "memory")

						: "D1Ar1", "D0Ar2", "cc", "memory")

				/*	rewind 'to' and 'from'  pointers when a fault occurs

				 *

				@ -342,7 +349,7 @@

				#define __asm_copy_to_user_64bit_rapf_loop(to,	from, ret, n, id)\

					__asm_copy_user_64bit_rapf_loop(to, from, ret, n, id,		\

						"LSR	D0Ar2, D0Ar2, #8\n"				\

						"AND	D0Ar2, D0Ar2, #0x7\n"				\

						"ANDS	D0Ar2, D0Ar2, #0x7\n"				\

						"ADDZ	D0Ar2, D0Ar2, #4\n"				\

						"SUB	D0Ar2, D0Ar2, #1\n"				\

						"MOV	D1Ar1, #4\n"					\

				@ -403,47 +410,55 @@

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"22:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #16\n"					\

						"23:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"24:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #16\n"					\

						"25:\n"							\

						"24:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"26:\n"							\

						"25:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"26:\n"							\

						"SUB	%3, %3, #16\n"					\

						"27:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"28:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"29:\n"							\

						"SUB	%3, %3, #16\n"					\

						"30:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"31:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"32:\n"							\

						"SUB	%3, %3, #16\n"					\

						"DCACHE	[%1+#-64], D0Ar6\n"				\

						"BR	$Lloop"id"\n"					\

													\

						"MOV	RAPF, %1\n"					\

						"29:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"30:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #16\n"					\

						"31:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"32:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #16\n"					\

						"33:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"34:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%3, %3, #16\n"					\

						"35:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"SUB	%3, %3, #16\n"					\

						"36:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"SUB	%0, %0, #4\n"					\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"37:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"38:\n"							\

						"SUB	%3, %3, #16\n"					\

						"39:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"40:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"41:\n"							\

						"SUB	%3, %3, #16\n"					\

						"42:\n"							\

						"MGETD	D0FrT, D0.5, D0.6, D0.7, [%1++]\n"		\

						"43:\n"							\

						"MSETD	[%0++], D0FrT, D0.5, D0.6, D0.7\n"		\

						"44:\n"							\

						"SUB	%0, %0, #4\n"					\

						"45:\n"							\

						"SETD	[%0++], D0.7\n"					\

						"SUB	%3, %3, #16\n"					\

						"1:"							\

				@ -483,11 +498,19 @@

						"	.long 34b,3b\n"					\

						"	.long 35b,3b\n"					\

						"	.long 36b,3b\n"					\

						"	.long 37b,4b\n"					\

						"	.long 37b,3b\n"					\

						"	.long 38b,3b\n"					\

						"	.long 39b,3b\n"					\

						"	.long 40b,3b\n"					\

						"	.long 41b,3b\n"					\

						"	.long 42b,3b\n"					\

						"	.long 43b,3b\n"					\

						"	.long 44b,3b\n"					\

						"	.long 45b,4b\n"					\

						"	.previous\n"					\

						: "=r" (to), "=r" (from), "=r" (ret), "=d" (n)		\

						: "0" (to), "1" (from), "2" (ret), "3" (n)		\

						: "D1Ar1", "D0Ar2", "memory")

						: "D1Ar1", "D0Ar2", "cc", "memory")

				/*	rewind 'to' and 'from'  pointers when a fault occurs

				 *

				@ -513,7 +536,7 @@

				#define __asm_copy_to_user_32bit_rapf_loop(to, from, ret, n, id)\

					__asm_copy_user_32bit_rapf_loop(to, from, ret, n, id,		\

						"LSR	D0Ar2, D0Ar2, #8\n"				\

						"AND	D0Ar2, D0Ar2, #0x7\n"				\

						"ANDS	D0Ar2, D0Ar2, #0x7\n"				\

						"ADDZ	D0Ar2, D0Ar2, #4\n"				\

						"SUB	D0Ar2, D0Ar2, #1\n"				\

						"MOV	D1Ar1, #4\n"					\

				@ -538,23 +561,31 @@ unsigned long __copy_user(void __user *pdst, const void *psrc,

					if ((unsigned long) src & 1) {

						__asm_copy_to_user_1(dst, src, retn);

						n--;

						if (retn)

							return retn + n;

					}

					if ((unsigned long) dst & 1) {

						/* Worst case - byte copy */

						while (n > 0) {

							__asm_copy_to_user_1(dst, src, retn);

							n--;

							if (retn)

								return retn + n;

						}

					}

					if (((unsigned long) src & 2) && n >= 2) {

						__asm_copy_to_user_2(dst, src, retn);

						n -= 2;

						if (retn)

							return retn + n;

					}

					if ((unsigned long) dst & 2) {

						/* Second worst case - word copy */

						while (n >= 2) {

							__asm_copy_to_user_2(dst, src, retn);

							n -= 2;

							if (retn)

								return retn + n;

						}

					}

				@ -569,6 +600,8 @@ unsigned long __copy_user(void __user *pdst, const void *psrc,

						while (n >= 8) {

							__asm_copy_to_user_8x64(dst, src, retn);

							n -= 8;

							if (retn)

								return retn + n;

						}

					}

					if (n >= RAPF_MIN_BUF_SIZE) {

				@ -581,6 +614,8 @@ unsigned long __copy_user(void __user *pdst, const void *psrc,

						while (n >= 8) {

							__asm_copy_to_user_8x64(dst, src, retn);

							n -= 8;

							if (retn)

								return retn + n;

						}

					}

				#endif

				@ -588,11 +623,15 @@ unsigned long __copy_user(void __user *pdst, const void *psrc,

					while (n >= 16) {

						__asm_copy_to_user_16(dst, src, retn);

						n -= 16;

						if (retn)

							return retn + n;

					}

					while (n >= 4) {

						__asm_copy_to_user_4(dst, src, retn);

						n -= 4;

						if (retn)

							return retn + n;

					}

					switch (n) {

				@ -609,6 +648,10 @@ unsigned long __copy_user(void __user *pdst, const void *psrc,

						break;

					}

					/*

					 * If we get here, retn correctly reflects the number of failing

					 * bytes.

					 */

					return retn;

				}

				EXPORT_SYMBOL(__copy_user);

				@ -617,16 +660,14 @@ EXPORT_SYMBOL(__copy_user);

					__asm_copy_user_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"	\

						"2:	SETB [%0++],D1Ar1\n",	\

						"3:	ADD  %2,%2,#1\n"	\

						"	SETB [%0++],D1Ar1\n",	\

						"3:	ADD  %2,%2,#1\n",	\

						"	.long 2b,3b\n")

				#define __asm_copy_from_user_2x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_user_cont(to, from, ret,		\

						"	GETW D1Ar1,[%1++]\n"		\

						"2:	SETW [%0++],D1Ar1\n" COPY,	\

						"3:	ADD  %2,%2,#2\n"		\

						"	SETW [%0++],D1Ar1\n" FIXUP,	\

						"3:	ADD  %2,%2,#2\n" FIXUP,		\

						"	.long 2b,3b\n" TENTRY)

				#define __asm_copy_from_user_2(to, from, ret) \

				@ -636,145 +677,26 @@ EXPORT_SYMBOL(__copy_user);

					__asm_copy_from_user_2x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"4:	SETB [%0++],D1Ar1\n",		\

						"5:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"5:	ADD  %2,%2,#1\n",		\

						"	.long 4b,5b\n")

				#define __asm_copy_from_user_4x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_user_cont(to, from, ret,		\

						"	GETD D1Ar1,[%1++]\n"		\

						"2:	SETD [%0++],D1Ar1\n" COPY,	\

						"3:	ADD  %2,%2,#4\n"		\

						"	SETD [%0++],D1Ar1\n" FIXUP,	\

						"3:	ADD  %2,%2,#4\n" FIXUP,		\

						"	.long 2b,3b\n" TENTRY)

				#define __asm_copy_from_user_4(to, from, ret) \

					__asm_copy_from_user_4x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_5(to, from, ret) \

					__asm_copy_from_user_4x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"4:	SETB [%0++],D1Ar1\n",		\

						"5:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"	.long 4b,5b\n")

				#define __asm_copy_from_user_6x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_from_user_4x_cont(to, from, ret,	\

						"	GETW D1Ar1,[%1++]\n"		\

						"4:	SETW [%0++],D1Ar1\n" COPY,	\

						"5:	ADD  %2,%2,#2\n"		\

						"	SETW [%0++],D1Ar1\n" FIXUP,	\

						"	.long 4b,5b\n" TENTRY)

				#define __asm_copy_from_user_6(to, from, ret) \

					__asm_copy_from_user_6x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_7(to, from, ret) \

					__asm_copy_from_user_6x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"6:	SETB [%0++],D1Ar1\n",		\

						"7:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"	.long 6b,7b\n")

				#define __asm_copy_from_user_8x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_from_user_4x_cont(to, from, ret,	\

						"	GETD D1Ar1,[%1++]\n"		\

						"4:	SETD [%0++],D1Ar1\n" COPY,	\

						"5:	ADD  %2,%2,#4\n"			\

						"	SETD [%0++],D1Ar1\n" FIXUP,		\

						"	.long 4b,5b\n" TENTRY)

				#define __asm_copy_from_user_8(to, from, ret) \

					__asm_copy_from_user_8x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_9(to, from, ret) \

					__asm_copy_from_user_8x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"6:	SETB [%0++],D1Ar1\n",		\

						"7:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"	.long 6b,7b\n")

				#define __asm_copy_from_user_10x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_from_user_8x_cont(to, from, ret,	\

						"	GETW D1Ar1,[%1++]\n"		\

						"6:	SETW [%0++],D1Ar1\n" COPY,	\

						"7:	ADD  %2,%2,#2\n"		\

						"	SETW [%0++],D1Ar1\n" FIXUP,	\

						"	.long 6b,7b\n" TENTRY)

				#define __asm_copy_from_user_10(to, from, ret) \

					__asm_copy_from_user_10x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_11(to, from, ret)		\

					__asm_copy_from_user_10x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"8:	SETB [%0++],D1Ar1\n",		\

						"9:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"	.long 8b,9b\n")

				#define __asm_copy_from_user_12x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_from_user_8x_cont(to, from, ret,	\

						"	GETD D1Ar1,[%1++]\n"		\

						"6:	SETD [%0++],D1Ar1\n" COPY,	\

						"7:	ADD  %2,%2,#4\n"		\

						"	SETD [%0++],D1Ar1\n" FIXUP,	\

						"	.long 6b,7b\n" TENTRY)

				#define __asm_copy_from_user_12(to, from, ret) \

					__asm_copy_from_user_12x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_13(to, from, ret) \

					__asm_copy_from_user_12x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"8:	SETB [%0++],D1Ar1\n",		\

						"9:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"	.long 8b,9b\n")

				#define __asm_copy_from_user_14x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_from_user_12x_cont(to, from, ret,	\

						"	GETW D1Ar1,[%1++]\n"		\

						"8:	SETW [%0++],D1Ar1\n" COPY,	\

						"9:	ADD  %2,%2,#2\n"		\

						"	SETW [%0++],D1Ar1\n" FIXUP,	\

						"	.long 8b,9b\n" TENTRY)

				#define __asm_copy_from_user_14(to, from, ret) \

					__asm_copy_from_user_14x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_15(to, from, ret) \

					__asm_copy_from_user_14x_cont(to, from, ret,	\

						"	GETB D1Ar1,[%1++]\n"		\

						"10:	SETB [%0++],D1Ar1\n",		\

						"11:	ADD  %2,%2,#1\n"		\

						"	SETB [%0++],D1Ar1\n",		\

						"	.long 10b,11b\n")

				#define __asm_copy_from_user_16x_cont(to, from, ret, COPY, FIXUP, TENTRY) \

					__asm_copy_from_user_12x_cont(to, from, ret,	\

						"	GETD D1Ar1,[%1++]\n"		\

						"8:	SETD [%0++],D1Ar1\n" COPY,	\

						"9:	ADD  %2,%2,#4\n"		\

						"	SETD [%0++],D1Ar1\n" FIXUP,	\

						"	.long 8b,9b\n" TENTRY)

				#define __asm_copy_from_user_16(to, from, ret) \

					__asm_copy_from_user_16x_cont(to, from, ret, "", "", "")

				#define __asm_copy_from_user_8x64(to, from, ret) \

					asm volatile (				\

						"	GETL D0Ar2,D1Ar1,[%1++]\n"	\

						"2:	SETL [%0++],D0Ar2,D1Ar1\n"	\

						"1:\n"					\

						"	.section .fixup,\"ax\"\n"	\

						"	MOV D1Ar1,#0\n"			\

						"	MOV D0Ar2,#0\n"			\

						"3:	ADD  %2,%2,#8\n"		\

						"	SETL [%0++],D0Ar2,D1Ar1\n"	\

						"	MOVT    D0Ar2,#HI(1b)\n"	\

						"	JUMP    D0Ar2,#LO(1b)\n"	\

						"	.previous\n"			\

				@ -789,36 +711,57 @@ EXPORT_SYMBOL(__copy_user);

				 *

				 *	Rationale:

				 *		A fault occurs while reading from user buffer, which is the

				 *		source. Since the fault is at a single address, we only

				 *		need to rewind by 8 bytes.

				 *		source.

				 *		Since we don't write to kernel buffer until we read first,

				 *		the kernel buffer is at the right state and needn't be

				 *		corrected.

				 *		corrected, but the source must be rewound to the beginning of

				 *		the block, which is LSM_STEP*8 bytes.

				 *		LSM_STEP is bits 10:8 in TXSTATUS which is already read

				 *		and stored in D0Ar2

				 *

				 *		NOTE: If a fault occurs at the last operation in M{G,S}ETL

				 *			LSM_STEP will be 0. ie: we do 4 writes in our case, if

				 *			a fault happens at the 4th write, LSM_STEP will be 0

				 *			instead of 4. The code copes with that.

				 */

				#define __asm_copy_from_user_64bit_rapf_loop(to, from, ret, n, id)	\

					__asm_copy_user_64bit_rapf_loop(to, from, ret, n, id,		\

						"SUB	%1, %1, #8\n")

						"LSR	D0Ar2, D0Ar2, #5\n"				\

						"ANDS	D0Ar2, D0Ar2, #0x38\n"				\

						"ADDZ	D0Ar2, D0Ar2, #32\n"				\

						"SUB	%1, %1, D0Ar2\n")

				/*	rewind 'from' pointer when a fault occurs

				 *

				 *	Rationale:

				 *		A fault occurs while reading from user buffer, which is the

				 *		source. Since the fault is at a single address, we only

				 *		need to rewind by 4 bytes.

				 *		source.

				 *		Since we don't write to kernel buffer until we read first,

				 *		the kernel buffer is at the right state and needn't be

				 *		corrected.

				 *		corrected, but the source must be rewound to the beginning of

				 *		the block, which is LSM_STEP*4 bytes.

				 *		LSM_STEP is bits 10:8 in TXSTATUS which is already read

				 *		and stored in D0Ar2

				 *

				 *		NOTE: If a fault occurs at the last operation in M{G,S}ETL

				 *			LSM_STEP will be 0. ie: we do 4 writes in our case, if

				 *			a fault happens at the 4th write, LSM_STEP will be 0

				 *			instead of 4. The code copes with that.

				 */

				#define __asm_copy_from_user_32bit_rapf_loop(to, from, ret, n, id)	\

					__asm_copy_user_32bit_rapf_loop(to, from, ret, n, id,		\

						"SUB	%1, %1, #4\n")

						"LSR	D0Ar2, D0Ar2, #6\n"				\

						"ANDS	D0Ar2, D0Ar2, #0x1c\n"				\

						"ADDZ	D0Ar2, D0Ar2, #16\n"				\

						"SUB	%1, %1, D0Ar2\n")

				/* Copy from user to kernel, zeroing the bytes that were inaccessible in

				   userland.  The return-value is the number of bytes that were

				   inaccessible.  */

				unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

								  unsigned long n)

				/*

				 * Copy from user to kernel. The return-value is the number of bytes that were

				 * inaccessible.

				 */

				unsigned long raw_copy_from_user(void *pdst, const void __user *psrc,

								 unsigned long n)

				{

					register char *dst asm ("A0.2") = pdst;

					register const char __user *src asm ("A1.2") = psrc;

				@ -830,6 +773,8 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

					if ((unsigned long) src & 1) {

						__asm_copy_from_user_1(dst, src, retn);

						n--;

						if (retn)

							return retn + n;

					}

					if ((unsigned long) dst & 1) {

						/* Worst case - byte copy */

				@ -837,12 +782,14 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

							__asm_copy_from_user_1(dst, src, retn);

							n--;

							if (retn)

								goto copy_exception_bytes;

								return retn + n;

						}

					}

					if (((unsigned long) src & 2) && n >= 2) {

						__asm_copy_from_user_2(dst, src, retn);

						n -= 2;

						if (retn)

							return retn + n;

					}

					if ((unsigned long) dst & 2) {

						/* Second worst case - word copy */

				@ -850,16 +797,10 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

							__asm_copy_from_user_2(dst, src, retn);

							n -= 2;

							if (retn)

								goto copy_exception_bytes;

								return retn + n;

						}

					}

					/* We only need one check after the unalignment-adjustments,

					   because if both adjustments were done, either both or

					   neither reference had an exception.  */

					if (retn != 0)

						goto copy_exception_bytes;

				#ifdef USE_RAPF

					/* 64 bit copy loop */

					if (!(((unsigned long) src | (unsigned long) dst) & 7)) {

				@ -872,7 +813,7 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

							__asm_copy_from_user_8x64(dst, src, retn);

							n -= 8;

							if (retn)

								goto copy_exception_bytes;

								return retn + n;

						}

					}

				@ -888,7 +829,7 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

							__asm_copy_from_user_8x64(dst, src, retn);

							n -= 8;

							if (retn)

								goto copy_exception_bytes;

								return retn + n;

						}

					}

				#endif

				@ -898,7 +839,7 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

						n -= 4;

						if (retn)

							goto copy_exception_bytes;

							return retn + n;

					}

					/* If we get here, there were no memory read faults.  */

				@ -924,21 +865,8 @@ unsigned long __copy_user_zeroing(void *pdst, const void __user *psrc,

					/* If we get here, retn correctly reflects the number of failing

					   bytes.  */

					return retn;

				 copy_exception_bytes:

					/* We already have "retn" bytes cleared, and need to clear the

					   remaining "n" bytes.  A non-optimized simple byte-for-byte in-line

					   memset is preferred here, since this isn't speed-critical code and

					   we'd rather have this a leaf-function than calling memset.  */

					{

						char *endp;

						for (endp = dst + n; dst < endp; dst++)

							*dst = 0;

					}

					return retn + n;

				}

				EXPORT_SYMBOL(__copy_user_zeroing);

				EXPORT_SYMBOL(raw_copy_from_user);

				#define __asm_clear_8x64(to, ret) \

					asm volatile (					\

2

arch/mips/Kconfig

View File

 @ -1526,7 +1526,7 @@ config CPU_MIPS64_R6
 	select CPU_SUPPORTS_HIGHMEM
 	select CPU_SUPPORTS_MSA
 	select GENERIC_CSUM
 	select MIPS_O32_FP64_SUPPORT if MIPS32_O32
 	select MIPS_O32_FP64_SUPPORT if 32BIT || MIPS32_O32
 	select HAVE_KVM
 	help
 	  Choose this option to build a kernel for release 6 or later of the

4

arch/mips/configs/ip22_defconfig

View File

 @ -67,8 +67,8 @@ CONFIG_NETFILTER_NETLINK_QUEUE=m
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_DCCP=m
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_DCCP=y
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

3

arch/mips/configs/ip27_defconfig

View File

 @ -133,7 +133,7 @@ CONFIG_LIBFC=m
 CONFIG_SCSI_QLOGIC_1280=y
 CONFIG_SCSI_PMCRAID=m
 CONFIG_SCSI_BFA_FC=m
 CONFIG_SCSI_DH=m
 CONFIG_SCSI_DH=y
 CONFIG_SCSI_DH_RDAC=m
 CONFIG_SCSI_DH_HP_SW=m
 CONFIG_SCSI_DH_EMC=m
 @ -205,7 +205,6 @@ CONFIG_MLX4_EN=m
 # CONFIG_MLX4_DEBUG is not set
 CONFIG_TEHUTI=m
 CONFIG_BNX2X=m
 CONFIG_QLGE=m
 CONFIG_SFC=m
 CONFIG_BE2NET=m
 CONFIG_LIBERTAS_THINFIRM=m

2

arch/mips/configs/lemote2f_defconfig

View File

 @ -39,7 +39,7 @@ CONFIG_HIBERNATION=y
 CONFIG_PM_STD_PARTITION="/dev/hda3"
 CONFIG_CPU_FREQ=y
 CONFIG_CPU_FREQ_DEBUG=y
 CONFIG_CPU_FREQ_STAT=m
 CONFIG_CPU_FREQ_STAT=y
 CONFIG_CPU_FREQ_STAT_DETAILS=y
 CONFIG_CPU_FREQ_DEFAULT_GOV_ONDEMAND=y
 CONFIG_CPU_FREQ_GOV_POWERSAVE=m

4

arch/mips/configs/malta_defconfig

View File

 @ -59,8 +59,8 @@ CONFIG_NETFILTER=y
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_DCCP=m
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_DCCP=y
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

4

arch/mips/configs/malta_kvm_defconfig

View File

 @ -60,8 +60,8 @@ CONFIG_NETFILTER=y
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_DCCP=m
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_DCCP=y
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

4

arch/mips/configs/malta_kvm_guest_defconfig

View File

 @ -59,8 +59,8 @@ CONFIG_NETFILTER=y
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_DCCP=m
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_DCCP=y
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

4

arch/mips/configs/maltaup_xpa_defconfig

View File

 @ -61,8 +61,8 @@ CONFIG_NETFILTER=y
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_DCCP=m
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_DCCP=y
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

2

arch/mips/configs/nlm_xlp_defconfig

View File

 @ -110,7 +110,7 @@ CONFIG_NETFILTER=y
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

2

arch/mips/configs/nlm_xlr_defconfig

View File

 @ -90,7 +90,7 @@ CONFIG_NETFILTER=y
 CONFIG_NF_CONNTRACK=m
 CONFIG_NF_CONNTRACK_SECMARK=y
 CONFIG_NF_CONNTRACK_EVENTS=y
 CONFIG_NF_CT_PROTO_UDPLITE=m
 CONFIG_NF_CT_PROTO_UDPLITE=y
 CONFIG_NF_CONNTRACK_AMANDA=m
 CONFIG_NF_CONNTRACK_FTP=m
 CONFIG_NF_CONNTRACK_H323=m

									
										6

arch/mips/include/asm/mach-ip27/spaces.h
									
												View File
												
				@ -12,14 +12,16 @@

				/*

				 * IP27 uses the R10000's uncached attribute feature.  Attribute 3 selects

				 * uncached memory addressing.

				 * uncached memory addressing. Hide the definitions on 32-bit compilation

				 * of the compat-vdso code.

				 */

				#ifdef CONFIG_64BIT

				#define HSPEC_BASE		0x9000000000000000

				#define IO_BASE			0x9200000000000000

				#define MSPEC_BASE		0x9400000000000000

				#define UNCAC_BASE		0x9600000000000000

				#define CAC_BASE		0xa800000000000000

				#endif

				#define TO_MSPEC(x)		(MSPEC_BASE | ((x) & TO_PHYS_MASK))

				#define TO_HSPEC(x)		(HSPEC_BASE | ((x) & TO_PHYS_MASK))

									
										8

arch/mips/include/asm/spinlock.h
									
												View File
												
				@ -127,7 +127,7 @@ static inline void arch_spin_lock(arch_spinlock_t *lock)

						"	andi	%[ticket], %[ticket], 0xffff		\n"

						"	bne	%[ticket], %[my_ticket], 4f		\n"

						"	 subu	%[ticket], %[my_ticket], %[ticket]	\n"

						"2:							\n"

						"2:	.insn						\n"

						"	.subsection 2					\n"

						"4:	andi	%[ticket], %[ticket], 0xffff		\n"

						"	sll	%[ticket], 5				\n"

				@ -202,7 +202,7 @@ static inline unsigned int arch_spin_trylock(arch_spinlock_t *lock)

						"	sc	%[ticket], %[ticket_ptr]		\n"

						"	beqz	%[ticket], 1b				\n"

						"	 li	%[ticket], 1				\n"

						"2:							\n"

						"2:	.insn						\n"

						"	.subsection 2					\n"

						"3:	b	2b					\n"

						"	 li	%[ticket], 0				\n"

				@ -382,7 +382,7 @@ static inline int arch_read_trylock(arch_rwlock_t *rw)

						"	.set	reorder					\n"

						__WEAK_LLSC_MB

						"	li	%2, 1					\n"

						"2:							\n"

						"2:	.insn						\n"

						: "=" GCC_OFF_SMALL_ASM() (rw->lock), "=&r" (tmp), "=&r" (ret)

						: GCC_OFF_SMALL_ASM() (rw->lock)

						: "memory");

				@ -422,7 +422,7 @@ static inline int arch_write_trylock(arch_rwlock_t *rw)

							"	lui	%1, 0x8000			\n"

							"	sc	%1, %0				\n"

							"	li	%2, 1				\n"

							"2:						\n"

							"2:	.insn					\n"

							: "=" GCC_OFF_SMALL_ASM() (rw->lock), "=&r" (tmp),

							  "=&r" (ret)

							: GCC_OFF_SMALL_ASM() (rw->lock)

									
										2

arch/mips/kernel/cpu-probe.c
									
												View File
												
				@ -1824,7 +1824,7 @@ static inline void cpu_probe_loongson(struct cpuinfo_mips *c, unsigned int cpu)

						}

						decode_configs(c);

						c->options |= MIPS_CPU_TLBINV | MIPS_CPU_LDPTE;

						c->options |= MIPS_CPU_FTLB | MIPS_CPU_TLBINV | MIPS_CPU_LDPTE;

						c->writecombine = _CACHE_UNCACHED_ACCELERATED;

						break;

					default:

									
										4

arch/mips/kernel/genex.S
									
												View File
												
				@ -448,7 +448,7 @@ NESTED(nmi_handler, PT_SIZE, sp)

					BUILD_HANDLER reserved reserved sti verbose	/* others */

					.align	5

					LEAF(handle_ri_rdhwr_vivt)

					LEAF(handle_ri_rdhwr_tlbp)

					.set	push

					.set	noat

					.set	noreorder

				@ -467,7 +467,7 @@ NESTED(nmi_handler, PT_SIZE, sp)

					.set	pop

					bltz	k1, handle_ri	/* slow path */

					/* fall thru */

					END(handle_ri_rdhwr_vivt)

					END(handle_ri_rdhwr_tlbp)

					LEAF(handle_ri_rdhwr)

					.set	push

									
										3

arch/mips/kernel/ptrace.c
									
												View File
												
				@ -485,7 +485,8 @@ static int fpr_set(struct task_struct *target,

									  &target->thread.fpu,

									  0, sizeof(elf_fpregset_t));

					for (i = 0; i < NUM_FPU_REGS; i++) {

					BUILD_BUG_ON(sizeof(fpr_val) != sizeof(elf_fpreg_t));

					for (i = 0; i < NUM_FPU_REGS && count >= sizeof(elf_fpreg_t); i++) {

						err = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

									 &fpr_val, i * sizeof(elf_fpreg_t),

									 (i + 1) * sizeof(elf_fpreg_t));

									
										17

arch/mips/kernel/traps.c
									
												View File
												
				@ -81,7 +81,7 @@ extern asmlinkage void handle_dbe(void);

				extern asmlinkage void handle_sys(void);

				extern asmlinkage void handle_bp(void);

				extern asmlinkage void handle_ri(void);

				extern asmlinkage void handle_ri_rdhwr_vivt(void);

				extern asmlinkage void handle_ri_rdhwr_tlbp(void);

				extern asmlinkage void handle_ri_rdhwr(void);

				extern asmlinkage void handle_cpu(void);

				extern asmlinkage void handle_ov(void);

				@ -2352,9 +2352,18 @@ void __init trap_init(void)

					set_except_vector(EXCCODE_SYS, handle_sys);

					set_except_vector(EXCCODE_BP, handle_bp);

					set_except_vector(EXCCODE_RI, rdhwr_noopt ? handle_ri :

							  (cpu_has_vtag_icache ?

							   handle_ri_rdhwr_vivt : handle_ri_rdhwr));

					if (rdhwr_noopt)

						set_except_vector(EXCCODE_RI, handle_ri);

					else {

						if (cpu_has_vtag_icache)

							set_except_vector(EXCCODE_RI, handle_ri_rdhwr_tlbp);

						else if (current_cpu_type() == CPU_LOONGSON3)

							set_except_vector(EXCCODE_RI, handle_ri_rdhwr_tlbp);

						else

							set_except_vector(EXCCODE_RI, handle_ri_rdhwr);

					}

					set_except_vector(EXCCODE_CPU, handle_cpu);

					set_except_vector(EXCCODE_OV, handle_ov);

					set_except_vector(EXCCODE_TR, handle_tr);

									
										36

arch/mips/lantiq/irq.c
									
												View File
												
				@ -269,6 +269,11 @@ static void ltq_hw5_irqdispatch(void)

				DEFINE_HWx_IRQDISPATCH(5)

				#endif

				static void ltq_hw_irq_handler(struct irq_desc *desc)

				{

					ltq_hw_irqdispatch(irq_desc_get_irq(desc) - 2);

				}

				#ifdef CONFIG_MIPS_MT_SMP

				void __init arch_init_ipiirq(int irq, struct irqaction *action)

				{

				@ -313,23 +318,19 @@ static struct irqaction irq_call = {

				asmlinkage void plat_irq_dispatch(void)

				{

					unsigned int pending = read_c0_status() & read_c0_cause() & ST0_IM;

					unsigned int i;

					int irq;

					if ((MIPS_CPU_TIMER_IRQ == 7) && (pending & CAUSEF_IP7)) {

						do_IRQ(MIPS_CPU_TIMER_IRQ);

						goto out;

					} else {

						for (i = 0; i < MAX_IM; i++) {

							if (pending & (CAUSEF_IP2 << i)) {

								ltq_hw_irqdispatch(i);

								goto out;

							}

						}

					if (!pending) {

						spurious_interrupt();

						return;

					}

					pr_alert("Spurious IRQ: CAUSE=0x%08x\n", read_c0_status());

				out:

					return;

					pending >>= CAUSEB_IP;

					while (pending) {

						irq = fls(pending) - 1;

						do_IRQ(MIPS_CPU_IRQ_BASE + irq);

						pending &= ~BIT(irq);

					}

				}

				static int icu_map(struct irq_domain *d, unsigned int irq, irq_hw_number_t hw)

				@ -354,11 +355,6 @@ static const struct irq_domain_ops irq_domain_ops = {

					.map = icu_map,

				};

				static struct irqaction cascade = {

					.handler = no_action,

					.name = "cascade",

				};

				int __init icu_of_init(struct device_node *node, struct device_node *parent)

				{

					struct device_node *eiu_node;

				@ -390,7 +386,7 @@ int __init icu_of_init(struct device_node *node, struct device_node *parent)

					mips_cpu_irq_init();

					for (i = 0; i < MAX_IM; i++)

						setup_irq(i + 2, &cascade);

						irq_set_chained_handler(i + 2, ltq_hw_irq_handler);

					if (cpu_has_vint) {

						pr_info("Setting up vectored interrupts\n");

									
										2

arch/mips/lantiq/xway/sysctrl.c
									
												View File
												
				@ -467,7 +467,7 @@ void __init ltq_soc_init(void)

						if (!np_xbar)

							panic("Failed to load xbar nodes from devicetree");

						if (of_address_to_resource(np_pmu, 0, &res_xbar))

						if (of_address_to_resource(np_xbar, 0, &res_xbar))

							panic("Failed to get xbar resources");

						if (request_mem_region(res_xbar.start, resource_size(&res_xbar),

							res_xbar.name) < 0)

									
										2

arch/mips/mm/c-r4k.c
									
												View File
												
				@ -1558,6 +1558,7 @@ static void probe_vcache(void)

					vcache_size = c->vcache.sets * c->vcache.ways * c->vcache.linesz;

					c->vcache.waybit = 0;

					c->vcache.waysize = vcache_size / c->vcache.ways;

					pr_info("Unified victim cache %ldkB %s, linesize %d bytes.\n",

						vcache_size >> 10, way_string[c->vcache.ways], c->vcache.linesz);

				@ -1660,6 +1661,7 @@ static void __init loongson3_sc_init(void)

					/* Loongson-3 has 4 cores, 1MB scache for each. scaches are shared */

					scache_size *= 4;

					c->scache.waybit = 0;

					c->scache.waysize = scache_size / c->scache.ways;

					pr_info("Unified secondary cache %ldkB %s, linesize %d bytes.\n",

					       scache_size >> 10, way_string[c->scache.ways], c->scache.linesz);

					if (scache_size)

									
										25

arch/mips/mm/tlbex.c
									
												View File
												
				@ -762,7 +762,8 @@ static void build_huge_update_entries(u32 **p, unsigned int pte,

				static void build_huge_handler_tail(u32 **p, struct uasm_reloc **r,

								    struct uasm_label **l,

								    unsigned int pte,

								    unsigned int ptr)

								    unsigned int ptr,

								    unsigned int flush)

				{

				#ifdef CONFIG_SMP

					UASM_i_SC(p, pte, 0, ptr);

				@ -771,6 +772,22 @@ static void build_huge_handler_tail(u32 **p, struct uasm_reloc **r,

				#else

					UASM_i_SW(p, pte, 0, ptr);

				#endif

					if (cpu_has_ftlb && flush) {

						BUG_ON(!cpu_has_tlbinv);

						UASM_i_MFC0(p, ptr, C0_ENTRYHI);

						uasm_i_ori(p, ptr, ptr, MIPS_ENTRYHI_EHINV);

						UASM_i_MTC0(p, ptr, C0_ENTRYHI);

						build_tlb_write_entry(p, l, r, tlb_indexed);

						uasm_i_xori(p, ptr, ptr, MIPS_ENTRYHI_EHINV);

						UASM_i_MTC0(p, ptr, C0_ENTRYHI);

						build_huge_update_entries(p, pte, ptr);

						build_huge_tlb_write_entry(p, l, r, pte, tlb_random, 0);

						return;

					}

					build_huge_update_entries(p, pte, ptr);

					build_huge_tlb_write_entry(p, l, r, pte, tlb_indexed, 0);

				}

				@ -2197,7 +2214,7 @@ static void build_r4000_tlb_load_handler(void)

						uasm_l_tlbl_goaround2(&l, p);

					}

					uasm_i_ori(&p, wr.r1, wr.r1, (_PAGE_ACCESSED | _PAGE_VALID));

					build_huge_handler_tail(&p, &r, &l, wr.r1, wr.r2);

					build_huge_handler_tail(&p, &r, &l, wr.r1, wr.r2, 1);

				#endif

					uasm_l_nopage_tlbl(&l, p);

				@ -2252,7 +2269,7 @@ static void build_r4000_tlb_store_handler(void)

					build_tlb_probe_entry(&p);

					uasm_i_ori(&p, wr.r1, wr.r1,

						   _PAGE_ACCESSED | _PAGE_MODIFIED | _PAGE_VALID | _PAGE_DIRTY);

					build_huge_handler_tail(&p, &r, &l, wr.r1, wr.r2);

					build_huge_handler_tail(&p, &r, &l, wr.r1, wr.r2, 1);

				#endif

					uasm_l_nopage_tlbs(&l, p);

				@ -2308,7 +2325,7 @@ static void build_r4000_tlb_modify_handler(void)

					build_tlb_probe_entry(&p);

					uasm_i_ori(&p, wr.r1, wr.r1,

						   _PAGE_ACCESSED | _PAGE_MODIFIED | _PAGE_VALID | _PAGE_DIRTY);

					build_huge_handler_tail(&p, &r, &l, wr.r1, wr.r2);

					build_huge_handler_tail(&p, &r, &l, wr.r1, wr.r2, 0);

				#endif

					uasm_l_nopage_tlbm(&l, p);

									
										9

arch/mips/ralink/prom.c
									
												View File
												
				@ -30,8 +30,10 @@ const char *get_system_type(void)

					return soc_info.sys_type;

				}

				static __init void prom_init_cmdline(int argc, char **argv)

				static __init void prom_init_cmdline(void)

				{

					int argc;

					char **argv;

					int i;

					pr_debug("prom: fw_arg0=%08x fw_arg1=%08x fw_arg2=%08x fw_arg3=%08x\n",

				@ -60,14 +62,11 @@ static __init void prom_init_cmdline(int argc, char **argv)

				void __init prom_init(void)

				{

					int argc;

					char **argv;

					prom_soc_init(&soc_info);

					pr_info("SoC Type: %s\n", get_system_type());

					prom_init_cmdline(argc, argv);

					prom_init_cmdline();

				}

				void __init prom_free_prom_memory(void)

									
										10

arch/mips/ralink/rt288x.c
									
												View File
												
				@ -40,16 +40,6 @@ static struct rt2880_pmx_group rt2880_pinmux_data_act[] = {

					{ 0 }

				};

				static void rt288x_wdt_reset(void)

				{

					u32 t;

					/* enable WDT reset output on pin SRAM_CS_N */

					t = rt_sysc_r32(SYSC_REG_CLKCFG);

					t |= CLKCFG_SRAM_CS_N_WDT;

					rt_sysc_w32(t, SYSC_REG_CLKCFG);

				}

				void __init ralink_clk_init(void)

				{

					unsigned long cpu_rate, wmac_rate = 40000000;

									
										11

arch/mips/ralink/rt305x.c
									
												View File
												
				@ -89,17 +89,6 @@ static struct rt2880_pmx_group rt5350_pinmux_data[] = {

					{ 0 }

				};

				static void rt305x_wdt_reset(void)

				{

					u32 t;

					/* enable WDT reset output on pin SRAM_CS_N */

					t = rt_sysc_r32(SYSC_REG_SYSTEM_CONFIG);

					t |= RT305X_SYSCFG_SRAM_CS0_MODE_WDT <<

						RT305X_SYSCFG_SRAM_CS0_MODE_SHIFT;

					rt_sysc_w32(t, SYSC_REG_SYSTEM_CONFIG);

				}

				static unsigned long rt5350_get_mem_size(void)

				{

					void __iomem *sysc = (void __iomem *) KSEG1ADDR(RT305X_SYSC_BASE);

									
										14

arch/mips/ralink/rt3883.c
									
												View File
												
				@ -36,7 +36,7 @@ static struct rt2880_pmx_func uartlite_func[] = { FUNC("uartlite", 0, 15, 2) };

				static struct rt2880_pmx_func jtag_func[] = { FUNC("jtag", 0, 17, 5) };

				static struct rt2880_pmx_func mdio_func[] = { FUNC("mdio", 0, 22, 2) };

				static struct rt2880_pmx_func lna_a_func[] = { FUNC("lna a", 0, 32, 3) };

				static struct rt2880_pmx_func lna_g_func[] = { FUNC("lna a", 0, 35, 3) };

				static struct rt2880_pmx_func lna_g_func[] = { FUNC("lna g", 0, 35, 3) };

				static struct rt2880_pmx_func pci_func[] = {

					FUNC("pci-dev", 0, 40, 32),

					FUNC("pci-host2", 1, 40, 32),

				@ -44,7 +44,7 @@ static struct rt2880_pmx_func pci_func[] = {

					FUNC("pci-fnc", 3, 40, 32)

				};

				static struct rt2880_pmx_func ge1_func[] = { FUNC("ge1", 0, 72, 12) };

				static struct rt2880_pmx_func ge2_func[] = { FUNC("ge1", 0, 84, 12) };

				static struct rt2880_pmx_func ge2_func[] = { FUNC("ge2", 0, 84, 12) };

				static struct rt2880_pmx_group rt3883_pinmux_data[] = {

					GRP("i2c", i2c_func, 1, RT3883_GPIO_MODE_I2C),

				@ -63,16 +63,6 @@ static struct rt2880_pmx_group rt3883_pinmux_data[] = {

					{ 0 }

				};

				static void rt3883_wdt_reset(void)

				{

					u32 t;

					/* enable WDT reset output on GPIO 2 */

					t = rt_sysc_r32(RT3883_SYSC_REG_SYSCFG1);

					t |= RT3883_SYSCFG1_GPIO2_AS_WDT_OUT;

					rt_sysc_w32(t, RT3883_SYSC_REG_SYSCFG1);

				}

				void __init ralink_clk_init(void)

				{

					unsigned long cpu_rate, sys_rate;

									
										14

arch/mips/ralink/timer.c
									
												View File
												
				@ -71,11 +71,6 @@ static int rt_timer_request(struct rt_timer *rt)

					return err;

				}

				static void rt_timer_free(struct rt_timer *rt)

				{

					free_irq(rt->irq, rt);

				}

				static int rt_timer_config(struct rt_timer *rt, unsigned long divisor)

				{

					if (rt->timer_freq < divisor)

				@ -101,15 +96,6 @@ static int rt_timer_enable(struct rt_timer *rt)

					return 0;

				}

				static void rt_timer_disable(struct rt_timer *rt)

				{

					u32 t;

					t = rt_timer_r32(rt, TIMER_REG_TMR0CTL);

					t &= ~TMR0CTL_ENABLE;

					rt_timer_w32(rt, TIMER_REG_TMR0CTL, t);

				}

				static int rt_timer_probe(struct platform_device *pdev)

				{

					struct resource *res = platform_get_resource(pdev, IORESOURCE_MEM, 0);

2

arch/mips/sgi-ip22/Platform

View File

 @ -25,7 +25,7 @@ endif
 # Simplified: what IP22 does at 128MB+ in ksegN, IP28 does at 512MB+ in xkphys
 #
 ifdef CONFIG_SGI_IP28
   ifeq ($(call cc-option-yn,-mr10k-cache-barrier=store), n)
   ifeq ($(call cc-option-yn,-march=r10000 -mr10k-cache-barrier=store), n)
       $(error gcc doesn't support needed option -mr10k-cache-barrier=store)
   endif
 endif

									
										7

arch/nios2/kernel/prom.c
									
												View File
												
				@ -48,6 +48,13 @@ void * __init early_init_dt_alloc_memory_arch(u64 size, u64 align)

					return alloc_bootmem_align(size, align);

				}

				int __init early_init_dt_reserve_memory_arch(phys_addr_t base, phys_addr_t size,

									     bool nomap)

				{

					reserve_bootmem(base, size, BOOTMEM_DEFAULT);

					return 0;

				}

				void __init early_init_devtree(void *params)

				{

					__be32 *dtb = (u32 *)__dtb_start;

									
										3

arch/nios2/kernel/setup.c
									
												View File
												
				@ -195,6 +195,9 @@ void __init setup_arch(char **cmdline_p)

					}

				#endif /* CONFIG_BLK_DEV_INITRD */

					early_init_fdt_reserve_self();

					early_init_fdt_scan_reserved_mem();

					unflatten_and_copy_device_tree();

					setup_cpuinfo();

									
										23

arch/parisc/include/asm/cacheflush.h
									
												View File
												
				@ -45,28 +45,9 @@ static inline void flush_kernel_dcache_page(struct page *page)

				#define flush_kernel_dcache_range(start,size) \

					flush_kernel_dcache_range_asm((start), (start)+(size));

				/* vmap range flushes and invalidates.  Architecturally, we don't need

				 * the invalidate, because the CPU should refuse to speculate once an

				 * area has been flushed, so invalidate is left empty */

				static inline void flush_kernel_vmap_range(void *vaddr, int size)

				{

					unsigned long start = (unsigned long)vaddr;

					flush_kernel_dcache_range_asm(start, start + size);

				}

				static inline void invalidate_kernel_vmap_range(void *vaddr, int size)

				{

					unsigned long start = (unsigned long)vaddr;

					void *cursor = vaddr;

					for ( ; cursor < vaddr + size; cursor += PAGE_SIZE) {

						struct page *page = vmalloc_to_page(cursor);

						if (test_and_clear_bit(PG_dcache_dirty, &page->flags))

							flush_kernel_dcache_page(page);

					}

					flush_kernel_dcache_range_asm(start, start + size);

				}

				void flush_kernel_vmap_range(void *vaddr, int size);

				void invalidate_kernel_vmap_range(void *vaddr, int size);

				#define flush_cache_vmap(start, end)		flush_cache_all()

				#define flush_cache_vunmap(start, end)		flush_cache_all()

									
										59

arch/parisc/include/asm/uaccess.h
									
												View File
												
				@ -67,6 +67,15 @@ struct exception_table_entry {

					".word (" #fault_addr " - .), (" #except_addr " - .)\n\t" \

					".previous\n"

				/*

				 * ASM_EXCEPTIONTABLE_ENTRY_EFAULT() creates a special exception table entry

				 * (with lowest bit set) for which the fault handler in fixup_exception() will

				 * load -EFAULT into %r8 for a read or write fault, and zeroes the target

				 * register in case of a read fault in get_user().

				 */

				#define ASM_EXCEPTIONTABLE_ENTRY_EFAULT( fault_addr, except_addr )\

					ASM_EXCEPTIONTABLE_ENTRY( fault_addr, except_addr + 1)

				/*

				 * The page fault handler stores, in a per-cpu area, the following information

				 * if a fixup routine is available.

				@ -94,7 +103,7 @@ struct exception_data {

				#define __get_user(x, ptr)                               \

				({                                                       \

					register long __gu_err __asm__ ("r8") = 0;       \

					register long __gu_val __asm__ ("r9") = 0;       \

					register long __gu_val;				 \

											 \

					load_sr2();					 \

					switch (sizeof(*(ptr))) {			 \

				@ -110,22 +119,23 @@ struct exception_data {

				})

				#define __get_user_asm(ldx, ptr)                        \

					__asm__("\n1:\t" ldx "\t0(%%sr2,%2),%0\n\t"	\

						ASM_EXCEPTIONTABLE_ENTRY(1b, fixup_get_user_skip_1)\

					__asm__("1: " ldx " 0(%%sr2,%2),%0\n"		\

						"9:\n"					\

						ASM_EXCEPTIONTABLE_ENTRY_EFAULT(1b, 9b)	\

						: "=r"(__gu_val), "=r"(__gu_err)        \

						: "r"(ptr), "1"(__gu_err)		\

						: "r1");

						: "r"(ptr), "1"(__gu_err));

				#if !defined(CONFIG_64BIT)

				#define __get_user_asm64(ptr) 				\

					__asm__("\n1:\tldw 0(%%sr2,%2),%0"		\

						"\n2:\tldw 4(%%sr2,%2),%R0\n\t"		\

						ASM_EXCEPTIONTABLE_ENTRY(1b, fixup_get_user_skip_2)\

						ASM_EXCEPTIONTABLE_ENTRY(2b, fixup_get_user_skip_1)\

					__asm__("   copy %%r0,%R0\n"			\

						"1: ldw 0(%%sr2,%2),%0\n"		\

						"2: ldw 4(%%sr2,%2),%R0\n"		\

						"9:\n"					\

						ASM_EXCEPTIONTABLE_ENTRY_EFAULT(1b, 9b)	\

						ASM_EXCEPTIONTABLE_ENTRY_EFAULT(2b, 9b)	\

						: "=r"(__gu_val), "=r"(__gu_err)	\

						: "r"(ptr), "1"(__gu_err)		\

						: "r1");

						: "r"(ptr), "1"(__gu_err));

				#endif /* !defined(CONFIG_64BIT) */

				@ -151,32 +161,31 @@ struct exception_data {

				 * The "__put_user/kernel_asm()" macros tell gcc they read from memory

				 * instead of writing. This is because they do not write to any memory

				 * gcc knows about, so there are no aliasing issues. These macros must

				 * also be aware that "fixup_put_user_skip_[12]" are executed in the

				 * context of the fault, and any registers used there must be listed

				 * as clobbers. In this case only "r1" is used by the current routines.

				 * r8/r9 are already listed as err/val.

				 * also be aware that fixups are executed in the context of the fault,

				 * and any registers used there must be listed as clobbers.

				 * r8 is already listed as err.

				 */

				#define __put_user_asm(stx, x, ptr)                         \

					__asm__ __volatile__ (                              \

						"\n1:\t" stx "\t%2,0(%%sr2,%1)\n\t"	    \

						ASM_EXCEPTIONTABLE_ENTRY(1b, fixup_put_user_skip_1)\

						"1: " stx " %2,0(%%sr2,%1)\n"		    \

						"9:\n"					    \

						ASM_EXCEPTIONTABLE_ENTRY_EFAULT(1b, 9b)	    \

						: "=r"(__pu_err)                            \

						: "r"(ptr), "r"(x), "0"(__pu_err)	    \

						: "r1")

						: "r"(ptr), "r"(x), "0"(__pu_err))

				#if !defined(CONFIG_64BIT)

				#define __put_user_asm64(__val, ptr) do {	    	    \

					__asm__ __volatile__ (				    \

						"\n1:\tstw %2,0(%%sr2,%1)"		    \

						"\n2:\tstw %R2,4(%%sr2,%1)\n\t"		    \

						ASM_EXCEPTIONTABLE_ENTRY(1b, fixup_put_user_skip_2)\

						ASM_EXCEPTIONTABLE_ENTRY(2b, fixup_put_user_skip_1)\

						"1: stw %2,0(%%sr2,%1)\n"		    \

						"2: stw %R2,4(%%sr2,%1)\n"		    \

						"9:\n"					    \

						ASM_EXCEPTIONTABLE_ENTRY_EFAULT(1b, 9b)	    \

						ASM_EXCEPTIONTABLE_ENTRY_EFAULT(2b, 9b)	    \

						: "=r"(__pu_err)                            \

						: "r"(ptr), "r"(__val), "0"(__pu_err) \

						: "r1");				    \

						: "r"(ptr), "r"(__val), "0"(__pu_err));	    \

				} while (0)

				#endif /* !defined(CONFIG_64BIT) */

									
										22

arch/parisc/kernel/cache.c
									
												View File
												
				@ -633,3 +633,25 @@ flush_cache_page(struct vm_area_struct *vma, unsigned long vmaddr, unsigned long

						__flush_cache_page(vma, vmaddr, PFN_PHYS(pfn));

					}

				}

				void flush_kernel_vmap_range(void *vaddr, int size)

				{

					unsigned long start = (unsigned long)vaddr;

					if ((unsigned long)size > parisc_cache_flush_threshold)

						flush_data_cache();

					else

						flush_kernel_dcache_range_asm(start, start + size);

				}

				EXPORT_SYMBOL(flush_kernel_vmap_range);

				void invalidate_kernel_vmap_range(void *vaddr, int size)

				{

					unsigned long start = (unsigned long)vaddr;

					if ((unsigned long)size > parisc_cache_flush_threshold)

						flush_data_cache();

					else

						flush_kernel_dcache_range_asm(start, start + size);

				}

				EXPORT_SYMBOL(invalidate_kernel_vmap_range);

									
										10

arch/parisc/kernel/parisc_ksyms.c
									
												View File
												
				@ -47,16 +47,6 @@ EXPORT_SYMBOL(__cmpxchg_u64);

				EXPORT_SYMBOL(lclear_user);

				EXPORT_SYMBOL(lstrnlen_user);

				/* Global fixups - defined as int to avoid creation of function pointers */

				extern int fixup_get_user_skip_1;

				extern int fixup_get_user_skip_2;

				extern int fixup_put_user_skip_1;

				extern int fixup_put_user_skip_2;

				EXPORT_SYMBOL(fixup_get_user_skip_1);

				EXPORT_SYMBOL(fixup_get_user_skip_2);

				EXPORT_SYMBOL(fixup_put_user_skip_1);

				EXPORT_SYMBOL(fixup_put_user_skip_2);

				#ifndef CONFIG_64BIT

				/* Needed so insmod can set dp value */

				extern int $global$;

									
										4

arch/parisc/kernel/process.c
									
												View File
												
				@ -139,6 +139,10 @@ void machine_power_off(void)

					printk(KERN_EMERG "System shut down completed.\n"

					       "Please power this system off now.");

					/* prevent soft lockup/stalled CPU messages for endless loop. */

					rcu_sysrq_start();

					for (;;);

				}

				void (*pm_power_off)(void) = machine_power_off;

									
										2

arch/parisc/lib/Makefile
									
												View File
												
				@ -2,7 +2,7 @@

				# Makefile for parisc-specific library files

				#

				lib-y	:= lusercopy.o bitops.o checksum.o io.o memset.o fixup.o memcpy.o \

				lib-y	:= lusercopy.o bitops.o checksum.o io.o memset.o memcpy.o \

					   ucmpdi2.o delay.o

				obj-y	:= iomap.o

									
										98

arch/parisc/lib/fixup.S
									
												View File
											
				@ -1,98 +0,0 @@

				/*

				 * Linux/PA-RISC Project (http://www.parisc-linux.org/)

				 *

				 *  Copyright (C) 2004  Randolph Chung <tausq@debian.org>

				 *

				 *    This program is free software; you can redistribute it and/or modify

				 *    it under the terms of the GNU General Public License as published by

				 *    the Free Software Foundation; either version 2, or (at your option)

				 *    any later version.

				 *

				 *    This program is distributed in the hope that it will be useful,

				 *    but WITHOUT ANY WARRANTY; without even the implied warranty of

				 *    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the

				 *    GNU General Public License for more details.

				 *

				 *    You should have received a copy of the GNU General Public License

				 *    along with this program; if not, write to the Free Software

				 *    Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA.

				 * 

				 * Fixup routines for kernel exception handling.

				 */

				#include <asm/asm-offsets.h>

				#include <asm/assembly.h>

				#include <asm/errno.h>

				#include <linux/linkage.h>

				#ifdef CONFIG_SMP

					.macro  get_fault_ip t1 t2

					loadgp

					addil LT%__per_cpu_offset,%r27

					LDREG RT%__per_cpu_offset(%r1),\t1

					/* t2 = smp_processor_id() */

					mfctl 30,\t2

					ldw TI_CPU(\t2),\t2

				#ifdef CONFIG_64BIT

					extrd,u \t2,63,32,\t2

				#endif

					/* t2 = &__per_cpu_offset[smp_processor_id()]; */

					LDREGX \t2(\t1),\t2 

					addil LT%exception_data,%r27

					LDREG RT%exception_data(%r1),\t1

					/* t1 = this_cpu_ptr(&exception_data) */

					add,l \t1,\t2,\t1

					/* %r27 = t1->fault_gp - restore gp */

					LDREG EXCDATA_GP(\t1), %r27

					/* t1 = t1->fault_ip */

					LDREG EXCDATA_IP(\t1), \t1

					.endm

				#else

					.macro  get_fault_ip t1 t2

					loadgp

					/* t1 = this_cpu_ptr(&exception_data) */

					addil LT%exception_data,%r27

					LDREG RT%exception_data(%r1),\t2

					/* %r27 = t2->fault_gp - restore gp */

					LDREG EXCDATA_GP(\t2), %r27

					/* t1 = t2->fault_ip */

					LDREG EXCDATA_IP(\t2), \t1

					.endm

				#endif

					.level LEVEL

					.text

					.section .fixup, "ax"

					/* get_user() fixups, store -EFAULT in r8, and 0 in r9 */

				ENTRY_CFI(fixup_get_user_skip_1)

					get_fault_ip %r1,%r8

					ldo 4(%r1), %r1

					ldi -EFAULT, %r8

					bv %r0(%r1)

					copy %r0, %r9

				ENDPROC_CFI(fixup_get_user_skip_1)

				ENTRY_CFI(fixup_get_user_skip_2)

					get_fault_ip %r1,%r8

					ldo 8(%r1), %r1

					ldi -EFAULT, %r8

					bv %r0(%r1)

					copy %r0, %r9

				ENDPROC_CFI(fixup_get_user_skip_2)

					/* put_user() fixups, store -EFAULT in r8 */

				ENTRY_CFI(fixup_put_user_skip_1)

					get_fault_ip %r1,%r8

					ldo 4(%r1), %r1

					bv %r0(%r1)

					ldi -EFAULT, %r8

				ENDPROC_CFI(fixup_put_user_skip_1)

				ENTRY_CFI(fixup_put_user_skip_2)

					get_fault_ip %r1,%r8

					ldo 8(%r1), %r1

					bv %r0(%r1)

					ldi -EFAULT, %r8

				ENDPROC_CFI(fixup_put_user_skip_2)

									
										318

arch/parisc/lib/lusercopy.S
									
												View File
												
				@ -5,6 +5,8 @@

				 *    Copyright (C) 2000 Richard Hirst <rhirst with parisc-linux.org>

				 *    Copyright (C) 2001 Matthieu Delahaye <delahaym at esiee.fr>

				 *    Copyright (C) 2003 Randolph Chung <tausq with parisc-linux.org>

				 *    Copyright (C) 2017 Helge Deller <deller@gmx.de>

				 *    Copyright (C) 2017 John David Anglin <dave.anglin@bell.net>

				 *

				 *

				 *    This program is free software; you can redistribute it and/or modify

				@ -132,4 +134,320 @@ ENDPROC_CFI(lstrnlen_user)

					.procend

				/*

				 * unsigned long pa_memcpy(void *dstp, const void *srcp, unsigned long len)

				 *

				 * Inputs:

				 * - sr1 already contains space of source region

				 * - sr2 already contains space of destination region

				 *

				 * Returns:

				 * - number of bytes that could not be copied.

				 *   On success, this will be zero.

				 *

				 * This code is based on a C-implementation of a copy routine written by

				 * Randolph Chung, which in turn was derived from the glibc.

				 *

				 * Several strategies are tried to try to get the best performance for various

				 * conditions. In the optimal case, we copy by loops that copy 32- or 16-bytes

				 * at a time using general registers.  Unaligned copies are handled either by

				 * aligning the destination and then using shift-and-write method, or in a few

				 * cases by falling back to a byte-at-a-time copy.

				 *

				 * Testing with various alignments and buffer sizes shows that this code is

				 * often >10x faster than a simple byte-at-a-time copy, even for strangely

				 * aligned operands. It is interesting to note that the glibc version of memcpy

				 * (written in C) is actually quite fast already. This routine is able to beat

				 * it by 30-40% for aligned copies because of the loop unrolling, but in some

				 * cases the glibc version is still slightly faster. This lends more

				 * credibility that gcc can generate very good code as long as we are careful.

				 *

				 * Possible optimizations:

				 * - add cache prefetching

				 * - try not to use the post-increment address modifiers; they may create

				 *   additional interlocks. Assumption is that those were only efficient on old

				 *   machines (pre PA8000 processors)

				 */

					dst = arg0

					src = arg1

					len = arg2

					end = arg3

					t1  = r19

					t2  = r20

					t3  = r21

					t4  = r22

					srcspc = sr1

					dstspc = sr2

					t0 = r1

					a1 = t1

					a2 = t2

					a3 = t3

					a0 = t4

					save_src = ret0

					save_dst = ret1

					save_len = r31

				ENTRY_CFI(pa_memcpy)

					.proc

					.callinfo NO_CALLS

					.entry

					/* Last destination address */

					add	dst,len,end

					/* short copy with less than 16 bytes? */

					cmpib,>>=,n 15,len,.Lbyte_loop

					/* same alignment? */

					xor	src,dst,t0

					extru	t0,31,2,t1

					cmpib,<>,n  0,t1,.Lunaligned_copy

				#ifdef CONFIG_64BIT

					/* only do 64-bit copies if we can get aligned. */

					extru	t0,31,3,t1

					cmpib,<>,n  0,t1,.Lalign_loop32

					/* loop until we are 64-bit aligned */

				.Lalign_loop64:

					extru	dst,31,3,t1

					cmpib,=,n	0,t1,.Lcopy_loop_16

				20:	ldb,ma	1(srcspc,src),t1

				21:	stb,ma	t1,1(dstspc,dst)

					b	.Lalign_loop64

					ldo	-1(len),len

					ASM_EXCEPTIONTABLE_ENTRY(20b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(21b,.Lcopy_done)

					ldi	31,t0

				.Lcopy_loop_16:

					cmpb,COND(>>=),n t0,len,.Lword_loop

				10:	ldd	0(srcspc,src),t1

				11:	ldd	8(srcspc,src),t2

					ldo	16(src),src

				12:	std,ma	t1,8(dstspc,dst)

				13:	std,ma	t2,8(dstspc,dst)

				14:	ldd	0(srcspc,src),t1

				15:	ldd	8(srcspc,src),t2

					ldo	16(src),src

				16:	std,ma	t1,8(dstspc,dst)

				17:	std,ma	t2,8(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(10b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(11b,.Lcopy16_fault)

					ASM_EXCEPTIONTABLE_ENTRY(12b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(13b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(14b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(15b,.Lcopy16_fault)

					ASM_EXCEPTIONTABLE_ENTRY(16b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(17b,.Lcopy_done)

					b	.Lcopy_loop_16

					ldo	-32(len),len

				.Lword_loop:

					cmpib,COND(>>=),n 3,len,.Lbyte_loop

				20:	ldw,ma	4(srcspc,src),t1

				21:	stw,ma	t1,4(dstspc,dst)

					b	.Lword_loop

					ldo	-4(len),len

					ASM_EXCEPTIONTABLE_ENTRY(20b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(21b,.Lcopy_done)

				#endif /* CONFIG_64BIT */

					/* loop until we are 32-bit aligned */

				.Lalign_loop32:

					extru	dst,31,2,t1

					cmpib,=,n	0,t1,.Lcopy_loop_4

				20:	ldb,ma	1(srcspc,src),t1

				21:	stb,ma	t1,1(dstspc,dst)

					b	.Lalign_loop32

					ldo	-1(len),len

					ASM_EXCEPTIONTABLE_ENTRY(20b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(21b,.Lcopy_done)

				.Lcopy_loop_4:

					cmpib,COND(>>=),n 15,len,.Lbyte_loop

				10:	ldw	0(srcspc,src),t1

				11:	ldw	4(srcspc,src),t2

				12:	stw,ma	t1,4(dstspc,dst)

				13:	stw,ma	t2,4(dstspc,dst)

				14:	ldw	8(srcspc,src),t1

				15:	ldw	12(srcspc,src),t2

					ldo	16(src),src

				16:	stw,ma	t1,4(dstspc,dst)

				17:	stw,ma	t2,4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(10b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(11b,.Lcopy8_fault)

					ASM_EXCEPTIONTABLE_ENTRY(12b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(13b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(14b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(15b,.Lcopy8_fault)

					ASM_EXCEPTIONTABLE_ENTRY(16b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(17b,.Lcopy_done)

					b	.Lcopy_loop_4

					ldo	-16(len),len

				.Lbyte_loop:

					cmpclr,COND(<>) len,%r0,%r0

					b,n	.Lcopy_done

				20:	ldb	0(srcspc,src),t1

					ldo	1(src),src

				21:	stb,ma	t1,1(dstspc,dst)

					b	.Lbyte_loop

					ldo	-1(len),len

					ASM_EXCEPTIONTABLE_ENTRY(20b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(21b,.Lcopy_done)

				.Lcopy_done:

					bv	%r0(%r2)

					sub	end,dst,ret0

					/* src and dst are not aligned the same way. */

					/* need to go the hard way */

				.Lunaligned_copy:

					/* align until dst is 32bit-word-aligned */

					extru	dst,31,2,t1

					cmpib,COND(=),n	0,t1,.Lcopy_dstaligned

				20:	ldb	0(srcspc,src),t1

					ldo	1(src),src

				21:	stb,ma	t1,1(dstspc,dst)

					b	.Lunaligned_copy

					ldo	-1(len),len

					ASM_EXCEPTIONTABLE_ENTRY(20b,.Lcopy_done)

					ASM_EXCEPTIONTABLE_ENTRY(21b,.Lcopy_done)

				.Lcopy_dstaligned:

					/* store src, dst and len in safe place */

					copy	src,save_src

					copy	dst,save_dst

					copy	len,save_len

					/* len now needs give number of words to copy */

					SHRREG	len,2,len

					/*

					 * Copy from a not-aligned src to an aligned dst using shifts.

					 * Handles 4 words per loop.

					 */

					depw,z src,28,2,t0

					subi 32,t0,t0

					mtsar t0

					extru len,31,2,t0

					cmpib,= 2,t0,.Lcase2

					/* Make src aligned by rounding it down.  */

					depi 0,31,2,src

					cmpiclr,<> 3,t0,%r0

					b,n .Lcase3

					cmpiclr,<> 1,t0,%r0

					b,n .Lcase1

				.Lcase0:

					cmpb,= %r0,len,.Lcda_finish

					nop

				1:	ldw,ma 4(srcspc,src), a3

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

				1:	ldw,ma 4(srcspc,src), a0

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					b,n .Ldo3

				.Lcase1:

				1:	ldw,ma 4(srcspc,src), a2

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

				1:	ldw,ma 4(srcspc,src), a3

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					ldo -1(len),len

					cmpb,=,n %r0,len,.Ldo0

				.Ldo4:

				1:	ldw,ma 4(srcspc,src), a0

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					shrpw a2, a3, %sar, t0

				1:	stw,ma t0, 4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcopy_done)

				.Ldo3:

				1:	ldw,ma 4(srcspc,src), a1

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					shrpw a3, a0, %sar, t0

				1:	stw,ma t0, 4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcopy_done)

				.Ldo2:

				1:	ldw,ma 4(srcspc,src), a2

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					shrpw a0, a1, %sar, t0

				1:	stw,ma t0, 4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcopy_done)

				.Ldo1:

				1:	ldw,ma 4(srcspc,src), a3

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					shrpw a1, a2, %sar, t0

				1:	stw,ma t0, 4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcopy_done)

					ldo -4(len),len

					cmpb,<> %r0,len,.Ldo4

					nop

				.Ldo0:

					shrpw a2, a3, %sar, t0

				1:	stw,ma t0, 4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcopy_done)

				.Lcda_rdfault:

				.Lcda_finish:

					/* calculate new src, dst and len and jump to byte-copy loop */

					sub	dst,save_dst,t0

					add	save_src,t0,src

					b	.Lbyte_loop

					sub	save_len,t0,len

				.Lcase3:

				1:	ldw,ma 4(srcspc,src), a0

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

				1:	ldw,ma 4(srcspc,src), a1

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					b .Ldo2

					ldo 1(len),len

				.Lcase2:

				1:	ldw,ma 4(srcspc,src), a1

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

				1:	ldw,ma 4(srcspc,src), a2

					ASM_EXCEPTIONTABLE_ENTRY(1b,.Lcda_rdfault)

					b .Ldo1

					ldo 2(len),len

					/* fault exception fixup handlers: */

				#ifdef CONFIG_64BIT

				.Lcopy16_fault:

				10:	b	.Lcopy_done

					std,ma	t1,8(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(10b,.Lcopy_done)

				#endif

				.Lcopy8_fault:

				10:	b	.Lcopy_done

					stw,ma	t1,4(dstspc,dst)

					ASM_EXCEPTIONTABLE_ENTRY(10b,.Lcopy_done)

					.exit

				ENDPROC_CFI(pa_memcpy)

					.procend

					.end

									
										461

arch/parisc/lib/memcpy.c
									
												View File
												
				@ -2,7 +2,7 @@

				 *    Optimized memory copy routines.

				 *

				 *    Copyright (C) 2004 Randolph Chung <tausq@debian.org>

				 *    Copyright (C) 2013 Helge Deller <deller@gmx.de>

				 *    Copyright (C) 2013-2017 Helge Deller <deller@gmx.de>

				 *

				 *    This program is free software; you can redistribute it and/or modify

				 *    it under the terms of the GNU General Public License as published by

				@ -21,474 +21,21 @@

				 *    Portions derived from the GNU C Library

				 *    Copyright (C) 1991, 1997, 2003 Free Software Foundation, Inc.

				 *

				 * Several strategies are tried to try to get the best performance for various

				 * conditions. In the optimal case, we copy 64-bytes in an unrolled loop using 

				 * fp regs. This is followed by loops that copy 32- or 16-bytes at a time using

				 * general registers.  Unaligned copies are handled either by aligning the 

				 * destination and then using shift-and-write method, or in a few cases by 

				 * falling back to a byte-at-a-time copy.

				 *

				 * I chose to implement this in C because it is easier to maintain and debug,

				 * and in my experiments it appears that the C code generated by gcc (3.3/3.4

				 * at the time of writing) is fairly optimal. Unfortunately some of the 

				 * semantics of the copy routine (exception handling) is difficult to express

				 * in C, so we have to play some tricks to get it to work.

				 *

				 * All the loads and stores are done via explicit asm() code in order to use

				 * the right space registers. 

				 * 

				 * Testing with various alignments and buffer sizes shows that this code is 

				 * often >10x faster than a simple byte-at-a-time copy, even for strangely

				 * aligned operands. It is interesting to note that the glibc version

				 * of memcpy (written in C) is actually quite fast already. This routine is 

				 * able to beat it by 30-40% for aligned copies because of the loop unrolling, 

				 * but in some cases the glibc version is still slightly faster. This lends 

				 * more credibility that gcc can generate very good code as long as we are 

				 * careful.

				 *

				 * TODO:

				 * - cache prefetching needs more experimentation to get optimal settings

				 * - try not to use the post-increment address modifiers; they create additional

				 *   interlocks

				 * - replace byte-copy loops with stybs sequences

				 */

				#ifdef __KERNEL__

				#include <linux/module.h>

				#include <linux/compiler.h>

				#include <linux/uaccess.h>

				#define s_space "%%sr1"

				#define d_space "%%sr2"

				#else

				#include "memcpy.h"

				#define s_space "%%sr0"

				#define d_space "%%sr0"

				#define pa_memcpy new2_copy

				#endif

				DECLARE_PER_CPU(struct exception_data, exception_data);

				#define preserve_branch(label)	do {					\

					volatile int dummy = 0;						\

					/* The following branch is never taken, it's just here to  */	\

					/* prevent gcc from optimizing away our exception code. */ 	\

					if (unlikely(dummy != dummy))					\

						goto label;						\

				} while (0)

				#define get_user_space() (segment_eq(get_fs(), KERNEL_DS) ? 0 : mfsp(3))

				#define get_kernel_space() (0)

				#define MERGE(w0, sh_1, w1, sh_2)  ({					\

					unsigned int _r;						\

					asm volatile (							\

					"mtsar %3\n"							\

					"shrpw %1, %2, %%sar, %0\n"					\

					: "=r"(_r)							\

					: "r"(w0), "r"(w1), "r"(sh_2)					\

					);								\

					_r;								\

				})

				#define THRESHOLD	16

				#ifdef DEBUG_MEMCPY

				#define DPRINTF(fmt, args...) do { printk(KERN_DEBUG "%s:%d:%s ", __FILE__, __LINE__, __func__ ); printk(KERN_DEBUG fmt, ##args ); } while (0)

				#else

				#define DPRINTF(fmt, args...)

				#endif

				#define def_load_ai_insn(_insn,_sz,_tt,_s,_a,_t,_e)	\

					__asm__ __volatile__ (				\

					"1:\t" #_insn ",ma " #_sz "(" _s ",%1), %0\n\t"	\

					ASM_EXCEPTIONTABLE_ENTRY(1b,_e)			\

					: _tt(_t), "+r"(_a)				\

					: 						\

					: "r8")

				#define def_store_ai_insn(_insn,_sz,_tt,_s,_a,_t,_e) 	\

					__asm__ __volatile__ (				\

					"1:\t" #_insn ",ma %1, " #_sz "(" _s ",%0)\n\t"	\

					ASM_EXCEPTIONTABLE_ENTRY(1b,_e)			\

					: "+r"(_a) 					\

					: _tt(_t)					\

					: "r8")

				#define ldbma(_s, _a, _t, _e) def_load_ai_insn(ldbs,1,"=r",_s,_a,_t,_e)

				#define stbma(_s, _t, _a, _e) def_store_ai_insn(stbs,1,"r",_s,_a,_t,_e)

				#define ldwma(_s, _a, _t, _e) def_load_ai_insn(ldw,4,"=r",_s,_a,_t,_e)

				#define stwma(_s, _t, _a, _e) def_store_ai_insn(stw,4,"r",_s,_a,_t,_e)

				#define flddma(_s, _a, _t, _e) def_load_ai_insn(fldd,8,"=f",_s,_a,_t,_e)

				#define fstdma(_s, _t, _a, _e) def_store_ai_insn(fstd,8,"f",_s,_a,_t,_e)

				#define def_load_insn(_insn,_tt,_s,_o,_a,_t,_e) 	\

					__asm__ __volatile__ (				\

					"1:\t" #_insn " " #_o "(" _s ",%1), %0\n\t"	\

					ASM_EXCEPTIONTABLE_ENTRY(1b,_e)			\

					: _tt(_t) 					\

					: "r"(_a)					\

					: "r8")

				#define def_store_insn(_insn,_tt,_s,_t,_o,_a,_e) 	\

					__asm__ __volatile__ (				\

					"1:\t" #_insn " %0, " #_o "(" _s ",%1)\n\t" 	\

					ASM_EXCEPTIONTABLE_ENTRY(1b,_e)			\

					: 						\

					: _tt(_t), "r"(_a)				\

					: "r8")

				#define ldw(_s,_o,_a,_t,_e)	def_load_insn(ldw,"=r",_s,_o,_a,_t,_e)

				#define stw(_s,_t,_o,_a,_e) 	def_store_insn(stw,"r",_s,_t,_o,_a,_e)

				#ifdef  CONFIG_PREFETCH

				static inline void prefetch_src(const void *addr)

				{

					__asm__("ldw 0(" s_space ",%0), %%r0" : : "r" (addr));

				}

				static inline void prefetch_dst(const void *addr)

				{

					__asm__("ldd 0(" d_space ",%0), %%r0" : : "r" (addr));

				}

				#else

				#define prefetch_src(addr) do { } while(0)

				#define prefetch_dst(addr) do { } while(0)

				#endif

				#define PA_MEMCPY_OK		0

				#define PA_MEMCPY_LOAD_ERROR	1

				#define PA_MEMCPY_STORE_ERROR	2

				/* Copy from a not-aligned src to an aligned dst, using shifts. Handles 4 words

				 * per loop.  This code is derived from glibc. 

				 */

				static noinline unsigned long copy_dstaligned(unsigned long dst,

									unsigned long src, unsigned long len)

				{

					/* gcc complains that a2 and a3 may be uninitialized, but actually

					 * they cannot be.  Initialize a2/a3 to shut gcc up.

					 */

					register unsigned int a0, a1, a2 = 0, a3 = 0;

					int sh_1, sh_2;

					/* prefetch_src((const void *)src); */

					/* Calculate how to shift a word read at the memory operation

					   aligned srcp to make it aligned for copy.  */

					sh_1 = 8 * (src % sizeof(unsigned int));

					sh_2 = 8 * sizeof(unsigned int) - sh_1;

					/* Make src aligned by rounding it down.  */

					src &= -sizeof(unsigned int);

					switch (len % 4)

					{

						case 2:

							/* a1 = ((unsigned int *) src)[0];

							   a2 = ((unsigned int *) src)[1]; */

							ldw(s_space, 0, src, a1, cda_ldw_exc);

							ldw(s_space, 4, src, a2, cda_ldw_exc);

							src -= 1 * sizeof(unsigned int);

							dst -= 3 * sizeof(unsigned int);

							len += 2;

							goto do1;

						case 3:

							/* a0 = ((unsigned int *) src)[0];

							   a1 = ((unsigned int *) src)[1]; */

							ldw(s_space, 0, src, a0, cda_ldw_exc);

							ldw(s_space, 4, src, a1, cda_ldw_exc);

							src -= 0 * sizeof(unsigned int);

							dst -= 2 * sizeof(unsigned int);

							len += 1;

							goto do2;

						case 0:

							if (len == 0)

								return PA_MEMCPY_OK;

							/* a3 = ((unsigned int *) src)[0];

							   a0 = ((unsigned int *) src)[1]; */

							ldw(s_space, 0, src, a3, cda_ldw_exc);

							ldw(s_space, 4, src, a0, cda_ldw_exc);

							src -=-1 * sizeof(unsigned int);

							dst -= 1 * sizeof(unsigned int);

							len += 0;

							goto do3;

						case 1:

							/* a2 = ((unsigned int *) src)[0];

							   a3 = ((unsigned int *) src)[1]; */

							ldw(s_space, 0, src, a2, cda_ldw_exc);

							ldw(s_space, 4, src, a3, cda_ldw_exc);

							src -=-2 * sizeof(unsigned int);

							dst -= 0 * sizeof(unsigned int);

							len -= 1;

							if (len == 0)

								goto do0;

							goto do4;			/* No-op.  */

					}

					do

					{

						/* prefetch_src((const void *)(src + 4 * sizeof(unsigned int))); */

				do4:

						/* a0 = ((unsigned int *) src)[0]; */

						ldw(s_space, 0, src, a0, cda_ldw_exc);

						/* ((unsigned int *) dst)[0] = MERGE (a2, sh_1, a3, sh_2); */

						stw(d_space, MERGE (a2, sh_1, a3, sh_2), 0, dst, cda_stw_exc);

				do3:

						/* a1 = ((unsigned int *) src)[1]; */

						ldw(s_space, 4, src, a1, cda_ldw_exc);

						/* ((unsigned int *) dst)[1] = MERGE (a3, sh_1, a0, sh_2); */

						stw(d_space, MERGE (a3, sh_1, a0, sh_2), 4, dst, cda_stw_exc);

				do2:

						/* a2 = ((unsigned int *) src)[2]; */

						ldw(s_space, 8, src, a2, cda_ldw_exc);

						/* ((unsigned int *) dst)[2] = MERGE (a0, sh_1, a1, sh_2); */

						stw(d_space, MERGE (a0, sh_1, a1, sh_2), 8, dst, cda_stw_exc);

				do1:

						/* a3 = ((unsigned int *) src)[3]; */

						ldw(s_space, 12, src, a3, cda_ldw_exc);

						/* ((unsigned int *) dst)[3] = MERGE (a1, sh_1, a2, sh_2); */

						stw(d_space, MERGE (a1, sh_1, a2, sh_2), 12, dst, cda_stw_exc);

						src += 4 * sizeof(unsigned int);

						dst += 4 * sizeof(unsigned int);

						len -= 4;

					}

					while (len != 0);

				do0:

					/* ((unsigned int *) dst)[0] = MERGE (a2, sh_1, a3, sh_2); */

					stw(d_space, MERGE (a2, sh_1, a3, sh_2), 0, dst, cda_stw_exc);

					preserve_branch(handle_load_error);

					preserve_branch(handle_store_error);

					return PA_MEMCPY_OK;

				handle_load_error:

					__asm__ __volatile__ ("cda_ldw_exc:\n");

					return PA_MEMCPY_LOAD_ERROR;

				handle_store_error:

					__asm__ __volatile__ ("cda_stw_exc:\n");

					return PA_MEMCPY_STORE_ERROR;

				}

				/* Returns PA_MEMCPY_OK, PA_MEMCPY_LOAD_ERROR or PA_MEMCPY_STORE_ERROR.

				 * In case of an access fault the faulty address can be read from the per_cpu

				 * exception data struct. */

				static noinline unsigned long pa_memcpy_internal(void *dstp, const void *srcp,

									unsigned long len)

				{

					register unsigned long src, dst, t1, t2, t3;

					register unsigned char *pcs, *pcd;

					register unsigned int *pws, *pwd;

					register double *pds, *pdd;

					unsigned long ret;

					src = (unsigned long)srcp;

					dst = (unsigned long)dstp;

					pcs = (unsigned char *)srcp;

					pcd = (unsigned char *)dstp;

					/* prefetch_src((const void *)srcp); */

					if (len < THRESHOLD)

						goto byte_copy;

					/* Check alignment */

					t1 = (src ^ dst);

					if (unlikely(t1 & (sizeof(double)-1)))

						goto unaligned_copy;

					/* src and dst have same alignment. */

					/* Copy bytes till we are double-aligned. */

					t2 = src & (sizeof(double) - 1);

					if (unlikely(t2 != 0)) {

						t2 = sizeof(double) - t2;

						while (t2 && len) {

							/* *pcd++ = *pcs++; */

							ldbma(s_space, pcs, t3, pmc_load_exc);

							len--;

							stbma(d_space, t3, pcd, pmc_store_exc);

							t2--;

						}

					}

					pds = (double *)pcs;

					pdd = (double *)pcd;

				#if 0

					/* Copy 8 doubles at a time */

					while (len >= 8*sizeof(double)) {

						register double r1, r2, r3, r4, r5, r6, r7, r8;

						/* prefetch_src((char *)pds + L1_CACHE_BYTES); */

						flddma(s_space, pds, r1, pmc_load_exc);

						flddma(s_space, pds, r2, pmc_load_exc);

						flddma(s_space, pds, r3, pmc_load_exc);

						flddma(s_space, pds, r4, pmc_load_exc);

						fstdma(d_space, r1, pdd, pmc_store_exc);

						fstdma(d_space, r2, pdd, pmc_store_exc);

						fstdma(d_space, r3, pdd, pmc_store_exc);

						fstdma(d_space, r4, pdd, pmc_store_exc);

				#if 0

						if (L1_CACHE_BYTES <= 32)

							prefetch_src((char *)pds + L1_CACHE_BYTES);

				#endif

						flddma(s_space, pds, r5, pmc_load_exc);

						flddma(s_space, pds, r6, pmc_load_exc);

						flddma(s_space, pds, r7, pmc_load_exc);

						flddma(s_space, pds, r8, pmc_load_exc);

						fstdma(d_space, r5, pdd, pmc_store_exc);

						fstdma(d_space, r6, pdd, pmc_store_exc);

						fstdma(d_space, r7, pdd, pmc_store_exc);

						fstdma(d_space, r8, pdd, pmc_store_exc);

						len -= 8*sizeof(double);

					}

				#endif

					pws = (unsigned int *)pds;

					pwd = (unsigned int *)pdd;

				word_copy:

					while (len >= 8*sizeof(unsigned int)) {

						register unsigned int r1,r2,r3,r4,r5,r6,r7,r8;

						/* prefetch_src((char *)pws + L1_CACHE_BYTES); */

						ldwma(s_space, pws, r1, pmc_load_exc);

						ldwma(s_space, pws, r2, pmc_load_exc);

						ldwma(s_space, pws, r3, pmc_load_exc);

						ldwma(s_space, pws, r4, pmc_load_exc);

						stwma(d_space, r1, pwd, pmc_store_exc);

						stwma(d_space, r2, pwd, pmc_store_exc);

						stwma(d_space, r3, pwd, pmc_store_exc);

						stwma(d_space, r4, pwd, pmc_store_exc);

						ldwma(s_space, pws, r5, pmc_load_exc);

						ldwma(s_space, pws, r6, pmc_load_exc);

						ldwma(s_space, pws, r7, pmc_load_exc);

						ldwma(s_space, pws, r8, pmc_load_exc);

						stwma(d_space, r5, pwd, pmc_store_exc);

						stwma(d_space, r6, pwd, pmc_store_exc);

						stwma(d_space, r7, pwd, pmc_store_exc);

						stwma(d_space, r8, pwd, pmc_store_exc);

						len -= 8*sizeof(unsigned int);

					}

					while (len >= 4*sizeof(unsigned int)) {

						register unsigned int r1,r2,r3,r4;

						ldwma(s_space, pws, r1, pmc_load_exc);

						ldwma(s_space, pws, r2, pmc_load_exc);

						ldwma(s_space, pws, r3, pmc_load_exc);

						ldwma(s_space, pws, r4, pmc_load_exc);

						stwma(d_space, r1, pwd, pmc_store_exc);

						stwma(d_space, r2, pwd, pmc_store_exc);

						stwma(d_space, r3, pwd, pmc_store_exc);

						stwma(d_space, r4, pwd, pmc_store_exc);

						len -= 4*sizeof(unsigned int);

					}

					pcs = (unsigned char *)pws;

					pcd = (unsigned char *)pwd;

				byte_copy:

					while (len) {

						/* *pcd++ = *pcs++; */

						ldbma(s_space, pcs, t3, pmc_load_exc);

						stbma(d_space, t3, pcd, pmc_store_exc);

						len--;

					}

					return PA_MEMCPY_OK;

				unaligned_copy:

					/* possibly we are aligned on a word, but not on a double... */

					if (likely((t1 & (sizeof(unsigned int)-1)) == 0)) {

						t2 = src & (sizeof(unsigned int) - 1);

						if (unlikely(t2 != 0)) {

							t2 = sizeof(unsigned int) - t2;

							while (t2) {

								/* *pcd++ = *pcs++; */

								ldbma(s_space, pcs, t3, pmc_load_exc);

								stbma(d_space, t3, pcd, pmc_store_exc);

								len--;

								t2--;

							}

						}

						pws = (unsigned int *)pcs;

						pwd = (unsigned int *)pcd;

						goto word_copy;

					}

					/* Align the destination.  */

					if (unlikely((dst & (sizeof(unsigned int) - 1)) != 0)) {

						t2 = sizeof(unsigned int) - (dst & (sizeof(unsigned int) - 1));

						while (t2) {

							/* *pcd++ = *pcs++; */

							ldbma(s_space, pcs, t3, pmc_load_exc);

							stbma(d_space, t3, pcd, pmc_store_exc);

							len--;

							t2--;

						}

						dst = (unsigned long)pcd;

						src = (unsigned long)pcs;

					}

					ret = copy_dstaligned(dst, src, len / sizeof(unsigned int));

					if (ret)

						return ret;

					pcs += (len & -sizeof(unsigned int));

					pcd += (len & -sizeof(unsigned int));

					len %= sizeof(unsigned int);

					preserve_branch(handle_load_error);

					preserve_branch(handle_store_error);

					goto byte_copy;

				handle_load_error:

					__asm__ __volatile__ ("pmc_load_exc:\n");

					return PA_MEMCPY_LOAD_ERROR;

				handle_store_error:

					__asm__ __volatile__ ("pmc_store_exc:\n");

					return PA_MEMCPY_STORE_ERROR;

				}

				/* Returns 0 for success, otherwise, returns number of bytes not transferred. */

				static unsigned long pa_memcpy(void *dstp, const void *srcp, unsigned long len)

				{

					unsigned long ret, fault_addr, reference;

					struct exception_data *d;

				extern unsigned long pa_memcpy(void *dst, const void *src,

								unsigned long len);

					ret = pa_memcpy_internal(dstp, srcp, len);

					if (likely(ret == PA_MEMCPY_OK))

						return 0;

					/* if a load or store fault occured we can get the faulty addr */

					d = this_cpu_ptr(&exception_data);

					fault_addr = d->fault_addr;

					/* error in load or store? */

					if (ret == PA_MEMCPY_LOAD_ERROR)

						reference = (unsigned long) srcp;

					else

						reference = (unsigned long) dstp;

					DPRINTF("pa_memcpy: fault type = %lu, len=%lu fault_addr=%lu ref=%lu\n",

						ret, len, fault_addr, reference);

					if (fault_addr >= reference)

						return len - (fault_addr - reference);

					else

						return len;

				}

				#ifdef __KERNEL__

				unsigned long __copy_to_user(void __user *dst, const void *src,

							     unsigned long len)

				{

				@ -537,5 +84,3 @@ long probe_kernel_read(void *dst, const void *src, size_t size)

					return __probe_kernel_read(dst, src, size);

				}

				#endif

									
										17

arch/parisc/mm/fault.c
									
												View File
												
				@ -149,6 +149,23 @@ int fixup_exception(struct pt_regs *regs)

						d->fault_space = regs->isr;

						d->fault_addr = regs->ior;

						/*

						 * Fix up get_user() and put_user().

						 * ASM_EXCEPTIONTABLE_ENTRY_EFAULT() sets the least-significant

						 * bit in the relative address of the fixup routine to indicate

						 * that %r8 should be loaded with -EFAULT to report a userspace

						 * access error.

						 */

						if (fix->fixup & 1) {

							regs->gr[8] = -EFAULT;

							/* zero target register for get_user() */

							if (parisc_acctyp(0, regs->iir) == VM_READ) {

								int treg = regs->iir & 0x1f;

								regs->gr[treg] = 0;

							}

						}

						regs->iaoq[0] = (unsigned long)&fix->fixup + fix->fixup;

						regs->iaoq[0] &= ~3;

						/*

									
										1

arch/powerpc/boot/zImage.lds.S
									
												View File
												
				@ -68,6 +68,7 @@ SECTIONS

				  }

				#ifdef CONFIG_PPC64_BOOT_WRAPPER

				  . = ALIGN(256);

				  .got :

				  {

				    __toc_start = .;

									
										5

arch/powerpc/crypto/crc32c-vpmsum_glue.c
									
												View File
												
				@ -33,10 +33,13 @@ static u32 crc32c_vpmsum(u32 crc, unsigned char const *p, size_t len)

					}

					if (len & ~VMX_ALIGN_MASK) {

						preempt_disable();

						pagefault_disable();

						enable_kernel_altivec();

						crc = __crc32c_vpmsum(crc, p, len & ~VMX_ALIGN_MASK);

						disable_kernel_altivec();

						pagefault_enable();

						preempt_enable();

					}

					tail = len & VMX_ALIGN_MASK;

				@ -52,7 +55,7 @@ static int crc32c_vpmsum_cra_init(struct crypto_tfm *tfm)

				{

					u32 *key = crypto_tfm_ctx(tfm);

					*key = 0;

					*key = ~0;

					return 0;

				}

									
										20

arch/powerpc/include/asm/mmu_context.h
									
												View File
												
				@ -19,16 +19,18 @@ extern void destroy_context(struct mm_struct *mm);

				struct mm_iommu_table_group_mem_t;

				extern int isolate_lru_page(struct page *page);	/* from internal.h */

				extern bool mm_iommu_preregistered(void);

				extern long mm_iommu_get(unsigned long ua, unsigned long entries,

				extern bool mm_iommu_preregistered(struct mm_struct *mm);

				extern long mm_iommu_get(struct mm_struct *mm,

						unsigned long ua, unsigned long entries,

						struct mm_iommu_table_group_mem_t **pmem);

				extern long mm_iommu_put(struct mm_iommu_table_group_mem_t *mem);

				extern void mm_iommu_init(mm_context_t *ctx);

				extern void mm_iommu_cleanup(mm_context_t *ctx);

				extern struct mm_iommu_table_group_mem_t *mm_iommu_lookup(unsigned long ua,

						unsigned long size);

				extern struct mm_iommu_table_group_mem_t *mm_iommu_find(unsigned long ua,

						unsigned long entries);

				extern long mm_iommu_put(struct mm_struct *mm,

						struct mm_iommu_table_group_mem_t *mem);

				extern void mm_iommu_init(struct mm_struct *mm);

				extern void mm_iommu_cleanup(struct mm_struct *mm);

				extern struct mm_iommu_table_group_mem_t *mm_iommu_lookup(struct mm_struct *mm,

						unsigned long ua, unsigned long size);

				extern struct mm_iommu_table_group_mem_t *mm_iommu_find(struct mm_struct *mm,

						unsigned long ua, unsigned long entries);

				extern long mm_iommu_ua_to_hpa(struct mm_iommu_table_group_mem_t *mem,

						unsigned long ua, unsigned long *hpa);

				extern long mm_iommu_mapped_inc(struct mm_iommu_table_group_mem_t *mem);

									
										27

arch/powerpc/kernel/align.c
									
												View File
												
				@ -807,14 +807,25 @@ int fix_alignment(struct pt_regs *regs)

					nb = aligninfo[instr].len;

					flags = aligninfo[instr].flags;

					/* ldbrx/stdbrx overlap lfs/stfs in the DSISR unfortunately */

					if (IS_XFORM(instruction) && ((instruction >> 1) & 0x3ff) == 532) {

						nb = 8;

						flags = LD+SW;

					} else if (IS_XFORM(instruction) &&

						   ((instruction >> 1) & 0x3ff) == 660) {

						nb = 8;

						flags = ST+SW;

					/*

					 * Handle some cases which give overlaps in the DSISR values.

					 */

					if (IS_XFORM(instruction)) {

						switch (get_xop(instruction)) {

						case 532:	/* ldbrx */

							nb = 8;

							flags = LD+SW;

							break;

						case 660:	/* stdbrx */

							nb = 8;

							flags = ST+SW;

							break;

						case 20:	/* lwarx */

						case 84:	/* ldarx */

						case 116:	/* lharx */

						case 276:	/* lqarx */

							return 0;	/* not emulated ever */

						}

					}

					/* Byteswap little endian loads and stores */

									
										20

arch/powerpc/kernel/idle_book3s.S
									
												View File
												
				@ -439,9 +439,23 @@ END_FTR_SECTION_IFSET(CPU_FTR_ARCH_300)

				_GLOBAL(pnv_wakeup_tb_loss)

					ld	r1,PACAR1(r13)

					/*

					 * Before entering any idle state, the NVGPRs are saved in the stack

					 * and they are restored before switching to the process context. Hence

					 * until they are restored, they are free to be used.

					 * Before entering any idle state, the NVGPRs are saved in the stack.

					 * If there was a state loss, or PACA_NAPSTATELOST was set, then the

					 * NVGPRs are restored. If we are here, it is likely that state is lost,

					 * but not guaranteed -- neither ISA207 nor ISA300 tests to reach

					 * here are the same as the test to restore NVGPRS:

					 * PACA_THREAD_IDLE_STATE test for ISA207, PSSCR test for ISA300,

					 * and SRR1 test for restoring NVGPRs.

					 *

					 * We are about to clobber NVGPRs now, so set NAPSTATELOST to

					 * guarantee they will always be restored. This might be tightened

					 * with careful reading of specs (particularly for ISA300) but this

					 * is already a slow wakeup path and it's simpler to be safe.

					 */

					li	r0,1

					stb	r0,PACA_NAPSTATELOST(r13)

					/*

					 *

					 * Save SRR1 and LR in NVGPRs as they might be clobbered in

					 * opal_call() (called in CHECK_HMI_INTERRUPT). SRR1 is required

									
										4

arch/powerpc/kernel/misc_64.S
									
												View File
												
				@ -67,7 +67,7 @@ PPC64_CACHES:

				 *   flush all bytes from start through stop-1 inclusive

				 */

				_GLOBAL(flush_icache_range)

				_GLOBAL_TOC(flush_icache_range)

				BEGIN_FTR_SECTION

					PURGE_PREFETCHED_INS

					blr

				@ -120,7 +120,7 @@ EXPORT_SYMBOL(flush_icache_range)

				 *

				 *    flush all bytes from start to stop-1 inclusive

				 */

				_GLOBAL(flush_dcache_range)

				_GLOBAL_TOC(flush_dcache_range)

				/*

				 * Flush the data cache to memory

									
										2

arch/powerpc/kernel/setup-common.c
									
												View File
												
				@ -915,7 +915,7 @@ void __init setup_arch(char **cmdline_p)

					init_mm.context.pte_frag = NULL;

				#endif

				#ifdef CONFIG_SPAPR_TCE_IOMMU

					mm_iommu_init(&init_mm.context);

					mm_iommu_init(&init_mm);

				#endif

					irqstack_early_init();

					exc_lvl_early_init();

									
										9

arch/powerpc/kernel/setup_64.c
									
												View File
												
				@ -245,6 +245,15 @@ static void cpu_ready_for_interrupts(void)

						mtspr(SPRN_LPCR, lpcr | LPCR_AIL_3);

					}

					/*

					 * Fixup HFSCR:TM based on CPU features. The bit is set by our

					 * early asm init because at that point we haven't updated our

					 * CPU features from firmware and device-tree. Here we have,

					 * so let's do it.

					 */

					if (cpu_has_feature(CPU_FTR_HVMODE) && !cpu_has_feature(CPU_FTR_TM_COMP))

						mtspr(SPRN_HFSCR, mfspr(SPRN_HFSCR) & ~HFSCR_TM);

					/* Set IR and DR in PACA MSR */

					get_paca()->kernel_msr = MSR_KERNEL;

				}

									
										20

arch/powerpc/lib/sstep.c
									
												View File
												
				@ -1807,8 +1807,6 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

						goto instr_done;

					case LARX:

						if (regs->msr & MSR_LE)

							return 0;

						if (op.ea & (size - 1))

							break;		/* can't handle misaligned */

						err = -EFAULT;

				@ -1832,8 +1830,6 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

						goto ldst_done;

					case STCX:

						if (regs->msr & MSR_LE)

							return 0;

						if (op.ea & (size - 1))

							break;		/* can't handle misaligned */

						err = -EFAULT;

				@ -1859,8 +1855,6 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

						goto ldst_done;

					case LOAD:

						if (regs->msr & MSR_LE)

							return 0;

						err = read_mem(&regs->gpr[op.reg], op.ea, size, regs);

						if (!err) {

							if (op.type & SIGNEXT)

				@ -1872,8 +1866,6 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

				#ifdef CONFIG_PPC_FPU

					case LOAD_FP:

						if (regs->msr & MSR_LE)

							return 0;

						if (size == 4)

							err = do_fp_load(op.reg, do_lfs, op.ea, size, regs);

						else

				@ -1882,15 +1874,11 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

				#endif

				#ifdef CONFIG_ALTIVEC

					case LOAD_VMX:

						if (regs->msr & MSR_LE)

							return 0;

						err = do_vec_load(op.reg, do_lvx, op.ea & ~0xfUL, regs);

						goto ldst_done;

				#endif

				#ifdef CONFIG_VSX

					case LOAD_VSX:

						if (regs->msr & MSR_LE)

							return 0;

						err = do_vsx_load(op.reg, do_lxvd2x, op.ea, regs);

						goto ldst_done;

				#endif

				@ -1913,8 +1901,6 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

						goto instr_done;

					case STORE:

						if (regs->msr & MSR_LE)

							return 0;

						if ((op.type & UPDATE) && size == sizeof(long) &&

						    op.reg == 1 && op.update_reg == 1 &&

						    !(regs->msr & MSR_PR) &&

				@ -1927,8 +1913,6 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

				#ifdef CONFIG_PPC_FPU

					case STORE_FP:

						if (regs->msr & MSR_LE)

							return 0;

						if (size == 4)

							err = do_fp_store(op.reg, do_stfs, op.ea, size, regs);

						else

				@ -1937,15 +1921,11 @@ int __kprobes emulate_step(struct pt_regs *regs, unsigned int instr)

				#endif

				#ifdef CONFIG_ALTIVEC

					case STORE_VMX:

						if (regs->msr & MSR_LE)

							return 0;

						err = do_vec_store(op.reg, do_stvx, op.ea & ~0xfUL, regs);

						goto ldst_done;

				#endif

				#ifdef CONFIG_VSX

					case STORE_VSX:

						if (regs->msr & MSR_LE)

							return 0;

						err = do_vsx_store(op.reg, do_stxvd2x, op.ea, regs);

						goto ldst_done;

				#endif

									
										7

arch/powerpc/mm/hash_native_64.c
									
												View File
												
				@ -636,6 +636,10 @@ static void native_flush_hash_range(unsigned long number, int local)

					unsigned long psize = batch->psize;

					int ssize = batch->ssize;

					int i;

					unsigned int use_local;

					use_local = local && mmu_has_feature(MMU_FTR_TLBIEL) &&

						mmu_psize_defs[psize].tlbiel && !cxl_ctx_in_use();

					local_irq_save(flags);

				@ -665,8 +669,7 @@ static void native_flush_hash_range(unsigned long number, int local)

						} pte_iterate_hashed_end();

					}

					if (mmu_has_feature(MMU_FTR_TLBIEL) &&

					    mmu_psize_defs[psize].tlbiel && local) {

					if (use_local) {

						asm volatile("ptesync":::"memory");

						for (i = 0; i < number; i++) {

							vpn = batch->vpn[i];

									
										6

arch/powerpc/mm/mmu_context_book3s64.c
									
												View File
												
				@ -115,7 +115,7 @@ int init_new_context(struct task_struct *tsk, struct mm_struct *mm)

					mm->context.pte_frag = NULL;

				#endif

				#ifdef CONFIG_SPAPR_TCE_IOMMU

					mm_iommu_init(&mm->context);

					mm_iommu_init(mm);

				#endif

					return 0;

				}

				@ -156,13 +156,11 @@ static inline void destroy_pagetable_page(struct mm_struct *mm)

				}

				#endif

				void destroy_context(struct mm_struct *mm)

				{

				#ifdef CONFIG_SPAPR_TCE_IOMMU

					mm_iommu_cleanup(&mm->context);

					WARN_ON_ONCE(!list_empty(&mm->context.iommu_group_mem_list));

				#endif

				#ifdef CONFIG_PPC_ICSWX

					drop_cop(mm->context.acop, mm);

					kfree(mm->context.cop_lockp);

									
										62

arch/powerpc/mm/mmu_context_iommu.c
									
												View File
												
				@ -56,7 +56,7 @@ static long mm_iommu_adjust_locked_vm(struct mm_struct *mm,

					}

					pr_debug("[%d] RLIMIT_MEMLOCK HASH64 %c%ld %ld/%ld\n",

							current->pid,

							current ? current->pid : 0,

							incr ? '+' : '-',

							npages << PAGE_SHIFT,

							mm->locked_vm << PAGE_SHIFT,

				@ -66,12 +66,9 @@ static long mm_iommu_adjust_locked_vm(struct mm_struct *mm,

					return ret;

				}

				bool mm_iommu_preregistered(void)

				bool mm_iommu_preregistered(struct mm_struct *mm)

				{

					if (!current || !current->mm)

						return false;

					return !list_empty(&current->mm->context.iommu_group_mem_list);

					return !list_empty(&mm->context.iommu_group_mem_list);

				}

				EXPORT_SYMBOL_GPL(mm_iommu_preregistered);

				@ -124,19 +121,16 @@ static int mm_iommu_move_page_from_cma(struct page *page)

					return 0;

				}

				long mm_iommu_get(unsigned long ua, unsigned long entries,

				long mm_iommu_get(struct mm_struct *mm, unsigned long ua, unsigned long entries,

						struct mm_iommu_table_group_mem_t **pmem)

				{

					struct mm_iommu_table_group_mem_t *mem;

					long i, j, ret = 0, locked_entries = 0;

					struct page *page = NULL;

					if (!current || !current->mm)

						return -ESRCH; /* process exited */

					mutex_lock(&mem_list_mutex);

					list_for_each_entry_rcu(mem, &current->mm->context.iommu_group_mem_list,

					list_for_each_entry_rcu(mem, &mm->context.iommu_group_mem_list,

							next) {

						if ((mem->ua == ua) && (mem->entries == entries)) {

							++mem->used;

				@ -154,7 +148,7 @@ long mm_iommu_get(unsigned long ua, unsigned long entries,

					}

					ret = mm_iommu_adjust_locked_vm(current->mm, entries, true);

					ret = mm_iommu_adjust_locked_vm(mm, entries, true);

					if (ret)

						goto unlock_exit;

				@ -190,7 +184,7 @@ long mm_iommu_get(unsigned long ua, unsigned long entries,

						 * of the CMA zone if possible. NOTE: faulting in + migration

						 * can be expensive. Batching can be considered later

						 */

						if (get_pageblock_migratetype(page) == MIGRATE_CMA) {

						if (is_migrate_cma_page(page)) {

							if (mm_iommu_move_page_from_cma(page))

								goto populate;

							if (1 != get_user_pages_fast(ua + (i << PAGE_SHIFT),

				@ -215,11 +209,11 @@ populate:

					mem->entries = entries;

					*pmem = mem;

					list_add_rcu(&mem->next, &current->mm->context.iommu_group_mem_list);

					list_add_rcu(&mem->next, &mm->context.iommu_group_mem_list);

				unlock_exit:

					if (locked_entries && ret)

						mm_iommu_adjust_locked_vm(current->mm, locked_entries, false);

						mm_iommu_adjust_locked_vm(mm, locked_entries, false);

					mutex_unlock(&mem_list_mutex);

				@ -264,17 +258,13 @@ static void mm_iommu_free(struct rcu_head *head)

				static void mm_iommu_release(struct mm_iommu_table_group_mem_t *mem)

				{

					list_del_rcu(&mem->next);

					mm_iommu_adjust_locked_vm(current->mm, mem->entries, false);

					call_rcu(&mem->rcu, mm_iommu_free);

				}

				long mm_iommu_put(struct mm_iommu_table_group_mem_t *mem)

				long mm_iommu_put(struct mm_struct *mm, struct mm_iommu_table_group_mem_t *mem)

				{

					long ret = 0;

					if (!current || !current->mm)

						return -ESRCH; /* process exited */

					mutex_lock(&mem_list_mutex);

					if (mem->used == 0) {

				@ -297,6 +287,8 @@ long mm_iommu_put(struct mm_iommu_table_group_mem_t *mem)

					/* @mapped became 0 so now mappings are disabled, release the region */

					mm_iommu_release(mem);

					mm_iommu_adjust_locked_vm(mm, mem->entries, false);

				unlock_exit:

					mutex_unlock(&mem_list_mutex);

				@ -304,14 +296,12 @@ unlock_exit:

				}

				EXPORT_SYMBOL_GPL(mm_iommu_put);

				struct mm_iommu_table_group_mem_t *mm_iommu_lookup(unsigned long ua,

						unsigned long size)

				struct mm_iommu_table_group_mem_t *mm_iommu_lookup(struct mm_struct *mm,

						unsigned long ua, unsigned long size)

				{

					struct mm_iommu_table_group_mem_t *mem, *ret = NULL;

					list_for_each_entry_rcu(mem,

							&current->mm->context.iommu_group_mem_list,

							next) {

					list_for_each_entry_rcu(mem, &mm->context.iommu_group_mem_list, next) {

						if ((mem->ua <= ua) &&

								(ua + size <= mem->ua +

								 (mem->entries << PAGE_SHIFT))) {

				@ -324,14 +314,12 @@ struct mm_iommu_table_group_mem_t *mm_iommu_lookup(unsigned long ua,

				}

				EXPORT_SYMBOL_GPL(mm_iommu_lookup);

				struct mm_iommu_table_group_mem_t *mm_iommu_find(unsigned long ua,

						unsigned long entries)

				struct mm_iommu_table_group_mem_t *mm_iommu_find(struct mm_struct *mm,

						unsigned long ua, unsigned long entries)

				{

					struct mm_iommu_table_group_mem_t *mem, *ret = NULL;

					list_for_each_entry_rcu(mem,

							&current->mm->context.iommu_group_mem_list,

							next) {

					list_for_each_entry_rcu(mem, &mm->context.iommu_group_mem_list, next) {

						if ((mem->ua == ua) && (mem->entries == entries)) {

							ret = mem;

							break;

				@ -373,17 +361,7 @@ void mm_iommu_mapped_dec(struct mm_iommu_table_group_mem_t *mem)

				}

				EXPORT_SYMBOL_GPL(mm_iommu_mapped_dec);

				void mm_iommu_init(mm_context_t *ctx)

				void mm_iommu_init(struct mm_struct *mm)

				{

					INIT_LIST_HEAD_RCU(&ctx->iommu_group_mem_list);

				}

				void mm_iommu_cleanup(mm_context_t *ctx)

				{

					struct mm_iommu_table_group_mem_t *mem, *tmp;

					list_for_each_entry_safe(mem, tmp, &ctx->iommu_group_mem_list, next) {

						list_del_rcu(&mem->next);

						mm_iommu_do_free(mem);

					}

					INIT_LIST_HEAD_RCU(&mm->context.iommu_group_mem_list);

				}

									
										10

arch/powerpc/sysdev/xics/icp-opal.c
									
												View File
												
				@ -91,6 +91,16 @@ static unsigned int icp_opal_get_irq(void)

				static void icp_opal_set_cpu_priority(unsigned char cppr)

				{

					/*

					 * Here be dragons. The caller has asked to allow only IPI's and not

					 * external interrupts. But OPAL XIVE doesn't support that. So instead

					 * of allowing no interrupts allow all. That's still not right, but

					 * currently the only caller who does this is xics_migrate_irqs_away()

					 * and it works in that case.

					 */

					if (cppr >= DEFAULT_PRIORITY)

						cppr = LOWEST_PRIORITY;

					xics_set_base_cppr(cppr);

					opal_int_set_cppr(cppr);

					iosync();

									
										17

arch/powerpc/sysdev/xics/xics-common.c
									
												View File
												
				@ -20,6 +20,7 @@

				#include <linux/of.h>

				#include <linux/slab.h>

				#include <linux/spinlock.h>

				#include <linux/delay.h>

				#include <asm/prom.h>

				#include <asm/io.h>

				@ -198,9 +199,6 @@ void xics_migrate_irqs_away(void)

					/* Remove ourselves from the global interrupt queue */

					xics_set_cpu_giq(xics_default_distrib_server, 0);

					/* Allow IPIs again... */

					icp_ops->set_priority(DEFAULT_PRIORITY);

					for_each_irq_desc(virq, desc) {

						struct irq_chip *chip;

						long server;

				@ -255,6 +253,19 @@ void xics_migrate_irqs_away(void)

				unlock:

						raw_spin_unlock_irqrestore(&desc->lock, flags);

					}

					/* Allow "sufficient" time to drop any inflight IRQ's */

					mdelay(5);

					/*

					 * Allow IPIs again. This is done at the very end, after migrating all

					 * interrupts, the expectation is that we'll only get woken up by an IPI

					 * interrupt beyond this point, but leave externals masked just to be

					 * safe. If we're using icp-opal this may actually allow all

					 * interrupts anyway, but that should be OK.

					 */

					icp_ops->set_priority(DEFAULT_PRIORITY);

				}

				#endif /* CONFIG_HOTPLUG_CPU */

									
										35

arch/s390/boot/compressed/misc.c
									
												View File
												
				@ -141,31 +141,34 @@ static void check_ipl_parmblock(void *start, unsigned long size)

				unsigned long decompress_kernel(void)

				{

					unsigned long output_addr;

					unsigned char *output;

					void *output, *kernel_end;

					output_addr = ((unsigned long) &_end + HEAP_SIZE + 4095UL) & -4096UL;

					check_ipl_parmblock((void *) 0, output_addr + SZ__bss_start);

					memset(&_bss, 0, &_ebss - &_bss);

					free_mem_ptr = (unsigned long)&_end;

					free_mem_end_ptr = free_mem_ptr + HEAP_SIZE;

					output = (unsigned char *) output_addr;

					output = (void *) ALIGN((unsigned long) &_end + HEAP_SIZE, PAGE_SIZE);

					kernel_end = output + SZ__bss_start;

					check_ipl_parmblock((void *) 0, (unsigned long) kernel_end);

				#ifdef CONFIG_BLK_DEV_INITRD

					/*

					 * Move the initrd right behind the end of the decompressed

					 * kernel image.

					 * kernel image. This also prevents initrd corruption caused by

					 * bss clearing since kernel_end will always be located behind the

					 * current bss section..

					 */

					if (INITRD_START && INITRD_SIZE &&

					    INITRD_START < (unsigned long) output + SZ__bss_start) {

						check_ipl_parmblock(output + SZ__bss_start,

								    INITRD_START + INITRD_SIZE);

						memmove(output + SZ__bss_start,

							(void *) INITRD_START, INITRD_SIZE);

						INITRD_START = (unsigned long) output + SZ__bss_start;

					if (INITRD_START && INITRD_SIZE && kernel_end > (void *) INITRD_START) {

						check_ipl_parmblock(kernel_end, INITRD_SIZE);

						memmove(kernel_end, (void *) INITRD_START, INITRD_SIZE);

						INITRD_START = (unsigned long) kernel_end;

					}

				#endif

					/*

					 * Clear bss section. free_mem_ptr and free_mem_end_ptr need to be

					 * initialized afterwards since they reside in bss.

					 */

					memset(&_bss, 0, &_ebss - &_bss);

					free_mem_ptr = (unsigned long) &_end;

					free_mem_end_ptr = free_mem_ptr + HEAP_SIZE;

					puts("Uncompressing Linux... ");

					__decompress(input_data, input_len, NULL, NULL, output, 0, NULL, error);

					puts("Ok, booting the kernel.\n");

									
										2

arch/s390/include/asm/uaccess.h
									
												View File
												
				@ -144,7 +144,7 @@ unsigned long __must_check __copy_to_user(void __user *to, const void *from,

						"	jg	2b\n"				\

						".popsection\n"					\

						EX_TABLE(0b,3b) EX_TABLE(1b,3b)			\

						: "=d" (__rc), "=Q" (*(to))			\

						: "=d" (__rc), "+Q" (*(to))			\

						: "d" (size), "Q" (*(from)),			\

						  "d" (__reg0), "K" (-EFAULT)			\

						: "cc");					\

									
										19

arch/s390/mm/pgtable.c
									
												View File
												
				@ -606,12 +606,29 @@ void ptep_zap_key(struct mm_struct *mm, unsigned long addr, pte_t *ptep)

				bool test_and_clear_guest_dirty(struct mm_struct *mm, unsigned long addr)

				{

					spinlock_t *ptl;

					pgd_t *pgd;

					pud_t *pud;

					pmd_t *pmd;

					pgste_t pgste;

					pte_t *ptep;

					pte_t pte;

					bool dirty;

					ptep = get_locked_pte(mm, addr, &ptl);

					pgd = pgd_offset(mm, addr);

					pud = pud_alloc(mm, pgd, addr);

					if (!pud)

						return false;

					pmd = pmd_alloc(mm, pud, addr);

					if (!pmd)

						return false;

					/* We can't run guests backed by huge pages, but userspace can

					 * still set them up and then try to migrate them without any

					 * migration support.

					 */

					if (pmd_large(*pmd))

						return true;

					ptep = pte_alloc_map_lock(mm, pmd, addr, &ptl);

					if (unlikely(!ptep))

						return false;

									
										2

arch/sparc/kernel/ptrace_64.c
									
												View File
												
				@ -313,7 +313,7 @@ static int genregs64_set(struct task_struct *target,

					}

					if (!ret) {

						unsigned long y;

						unsigned long y = regs->y;

						ret = user_regset_copyin(&pos, &count, &kbuf, &ubuf,

									 &y,

									
										4

arch/x86/events/core.c
									
												View File
												
				@ -2096,8 +2096,8 @@ static int x86_pmu_event_init(struct perf_event *event)

				static void refresh_pce(void *ignored)

				{

					if (current->mm)

						load_mm_cr4(current->mm);

					if (current->active_mm)

						load_mm_cr4(current->active_mm);

				}

				static void x86_pmu_event_mapped(struct perf_event *event)

									
										1

arch/x86/include/asm/kvm_page_track.h
									
												View File
												
				@ -35,6 +35,7 @@ struct kvm_page_track_notifier_node {

				};

				void kvm_page_track_init(struct kvm *kvm);

				void kvm_page_track_cleanup(struct kvm *kvm);

				void kvm_page_track_free_memslot(struct kvm_memory_slot *free,

								 struct kvm_memory_slot *dont);

									
										24

arch/x86/kernel/cpu/mshyperv.c
									
												View File
												
				@ -31,6 +31,7 @@

				#include <asm/apic.h>

				#include <asm/timer.h>

				#include <asm/reboot.h>

				#include <asm/nmi.h>

				struct ms_hyperv_info ms_hyperv;

				EXPORT_SYMBOL_GPL(ms_hyperv);

				@ -158,6 +159,26 @@ static unsigned char hv_get_nmi_reason(void)

					return 0;

				}

				#ifdef CONFIG_X86_LOCAL_APIC

				/*

				 * Prior to WS2016 Debug-VM sends NMIs to all CPUs which makes

				 * it dificult to process CHANNELMSG_UNLOAD in case of crash. Handle

				 * unknown NMI on the first CPU which gets it.

				 */

				static int hv_nmi_unknown(unsigned int val, struct pt_regs *regs)

				{

					static atomic_t nmi_cpu = ATOMIC_INIT(-1);

					if (!unknown_nmi_panic)

						return NMI_DONE;

					if (atomic_cmpxchg(&nmi_cpu, -1, raw_smp_processor_id()) != -1)

						return NMI_HANDLED;

					return NMI_DONE;

				}

				#endif

				static void __init ms_hyperv_init_platform(void)

				{

					/*

				@ -183,6 +204,9 @@ static void __init ms_hyperv_init_platform(void)

						pr_info("HyperV: LAPIC Timer Frequency: %#x\n",

							lapic_timer_frequency);

					}

					register_nmi_handler(NMI_UNKNOWN, hv_nmi_unknown, NMI_FLAG_FIRST,

							     "hv_nmi_unknown");

				#endif

					if (ms_hyperv.features & HV_X64_MSR_TIME_REF_COUNT_AVAILABLE)

									
										1

arch/x86/kernel/head64.c
									
												View File
												
				@ -4,6 +4,7 @@

				 *  Copyright (C) 2000 Andrea Arcangeli <andrea@suse.de> SuSE

				 */

				#define DISABLE_BRANCH_PROFILING

				#include <linux/init.h>

				#include <linux/linkage.h>

				#include <linux/types.h>

									
										16

arch/x86/kernel/reboot.c
									
												View File
												
				@ -223,6 +223,22 @@ static struct dmi_system_id __initdata reboot_dmi_table[] = {

							DMI_MATCH(DMI_BOARD_NAME, "P4S800"),

						},

					},

					{	/* Handle problems with rebooting on ASUS EeeBook X205TA */

						.callback = set_acpi_reboot,

						.ident = "ASUS EeeBook X205TA",

						.matches = {

							DMI_MATCH(DMI_SYS_VENDOR, "ASUSTeK COMPUTER INC."),

							DMI_MATCH(DMI_PRODUCT_NAME, "X205TA"),

						},

					},

					{	/* Handle problems with rebooting on ASUS EeeBook X205TAW */

						.callback = set_acpi_reboot,

						.ident = "ASUS EeeBook X205TAW",

						.matches = {

							DMI_MATCH(DMI_SYS_VENDOR, "ASUSTeK COMPUTER INC."),

							DMI_MATCH(DMI_PRODUCT_NAME, "X205TAW"),

						},

					},

					/* Certec */

					{       /* Handle problems with rebooting on Certec BPC600 */

									
										2

arch/x86/kernel/tsc.c
									
												View File
												
				@ -1287,6 +1287,8 @@ static int __init init_tsc_clocksource(void)

					 * exporting a reliable TSC.

					 */

					if (boot_cpu_has(X86_FEATURE_TSC_RELIABLE)) {

						if (boot_cpu_has(X86_FEATURE_ART))

							art_related_clocksource = &clocksource_tsc;

						clocksource_register_khz(&clocksource_tsc, tsc_khz);

						return 0;

					}

									
										8

arch/x86/kvm/page_track.c
									
												View File
												
				@ -156,6 +156,14 @@ bool kvm_page_track_is_active(struct kvm_vcpu *vcpu, gfn_t gfn,

					return !!ACCESS_ONCE(slot->arch.gfn_track[mode][index]);

				}

				void kvm_page_track_cleanup(struct kvm *kvm)

				{

					struct kvm_page_track_notifier_head *head;

					head = &kvm->arch.track_notifier_head;

					cleanup_srcu_struct(&head->track_srcu);

				}

				void kvm_page_track_init(struct kvm *kvm)

				{

					struct kvm_page_track_notifier_head *head;

									
										1

arch/x86/kvm/x86.c
									
												View File
												
				@ -7976,6 +7976,7 @@ void kvm_arch_destroy_vm(struct kvm *kvm)

					kvm_free_vcpus(kvm);

					kvfree(rcu_dereference_check(kvm->arch.apic_map, 1));

					kvm_mmu_uninit_vm(kvm);

					kvm_page_track_cleanup(kvm);

				}

				void kvm_arch_free_memslot(struct kvm *kvm, struct kvm_memory_slot *free,

									
										2

arch/x86/lib/memcpy_64.S
									
												View File
												
				@ -290,7 +290,7 @@ EXPORT_SYMBOL_GPL(memcpy_mcsafe_unrolled)

					_ASM_EXTABLE_FAULT(.L_copy_leading_bytes, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w0, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w1, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w3, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w2, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w3, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w4, .L_memcpy_mcsafe_fail)

					_ASM_EXTABLE_FAULT(.L_cache_w5, .L_memcpy_mcsafe_fail)

									
										1

arch/x86/mm/kasan_init_64.c
									
												View File
												
				@ -1,3 +1,4 @@

				#define DISABLE_BRANCH_PROFILING

				#define pr_fmt(fmt) "kasan: " fmt

				#include <linux/bootmem.h>

				#include <linux/kasan.h>

									
										4

arch/x86/mm/kaslr.c
									
												View File
												
				@ -48,7 +48,7 @@ static const unsigned long vaddr_start = __PAGE_OFFSET_BASE;

				#if defined(CONFIG_X86_ESPFIX64)

				static const unsigned long vaddr_end = ESPFIX_BASE_ADDR;

				#elif defined(CONFIG_EFI)

				static const unsigned long vaddr_end = EFI_VA_START;

				static const unsigned long vaddr_end = EFI_VA_END;

				#else

				static const unsigned long vaddr_end = __START_KERNEL_map;

				#endif

				@ -105,7 +105,7 @@ void __init kernel_randomize_memory(void)

					 */

					BUILD_BUG_ON(vaddr_start >= vaddr_end);

					BUILD_BUG_ON(IS_ENABLED(CONFIG_X86_ESPFIX64) &&

						     vaddr_end >= EFI_VA_START);

						     vaddr_end >= EFI_VA_END);

					BUILD_BUG_ON((IS_ENABLED(CONFIG_X86_ESPFIX64) ||

						      IS_ENABLED(CONFIG_EFI)) &&

						     vaddr_end >= __START_KERNEL_map);

									
										23

arch/x86/pci/xen.c
									
												View File
												
				@ -234,23 +234,14 @@ static int xen_hvm_setup_msi_irqs(struct pci_dev *dev, int nvec, int type)

						return 1;

					for_each_pci_msi_entry(msidesc, dev) {

						__pci_read_msi_msg(msidesc, &msg);

						pirq = MSI_ADDR_EXT_DEST_ID(msg.address_hi) |

							((msg.address_lo >> MSI_ADDR_DEST_ID_SHIFT) & 0xff);

						if (msg.data != XEN_PIRQ_MSI_DATA ||

						    xen_irq_from_pirq(pirq) < 0) {

							pirq = xen_allocate_pirq_msi(dev, msidesc);

							if (pirq < 0) {

								irq = -ENODEV;

								goto error;

							}

							xen_msi_compose_msg(dev, pirq, &msg);

							__pci_write_msi_msg(msidesc, &msg);

							dev_dbg(&dev->dev, "xen: msi bound to pirq=%d\n", pirq);

						} else {

							dev_dbg(&dev->dev,

								"xen: msi already bound to pirq=%d\n", pirq);

						pirq = xen_allocate_pirq_msi(dev, msidesc);

						if (pirq < 0) {

							irq = -ENODEV;

							goto error;

						}

						xen_msi_compose_msg(dev, pirq, &msg);

						__pci_write_msi_msg(msidesc, &msg);

						dev_dbg(&dev->dev, "xen: msi bound to pirq=%d\n", pirq);

						irq = xen_bind_pirq_msi_to_irq(dev, msidesc, pirq,

									       (type == PCI_CAP_ID_MSI) ? nvec : 1,

									       (type == PCI_CAP_ID_MSIX) ?

									
										6

arch/x86/xen/setup.c
									
												View File
												
				@ -713,10 +713,9 @@ static void __init xen_reserve_xen_mfnlist(void)

						size = PFN_PHYS(xen_start_info->nr_p2m_frames);

					}

					if (!xen_is_e820_reserved(start, size)) {

						memblock_reserve(start, size);

					memblock_reserve(start, size);

					if (!xen_is_e820_reserved(start, size))

						return;

					}

				#ifdef CONFIG_X86_32

					/*

				@ -727,6 +726,7 @@ static void __init xen_reserve_xen_mfnlist(void)

					BUG();

				#else

					xen_relocate_p2m();

					memblock_free(start, size);

				#endif

				}

Compare commits

474 Commits v4.9.15 ... v4.9.22

44 Documentation/arm64/silicon-errata.txt Unescape Escape View File

5 Documentation/devicetree/bindings/arm/arch_timer.txt Unescape Escape View File

8 Documentation/devicetree/bindings/display/sunxi/sun4i-drm.txt Unescape Escape View File

1 Documentation/devicetree/bindings/usb/usb-xhci.txt Unescape Escape View File

9 Documentation/devicetree/bindings/watchdog/samsung-wdt.txt Unescape Escape View File

10 Documentation/kernel-parameters.txt Unescape Escape View File

2 Documentation/stable_kernel_rules.txt Unescape Escape View File

4 Makefile Unescape Escape View File

4 arch/arm/boot/dts/bcm5301x.dtsi Unescape Escape View File

2 arch/arm/boot/dts/sama5d2.dtsi Unescape Escape View File

1 arch/arm/boot/dts/stih407-family.dtsi Unescape Escape View File

4 arch/arm/kernel/armksyms.c Unescape Escape View File

14 arch/arm/kernel/smccc-call.S Unescape Escape View File

23 arch/arm/kvm/mmu.c Unescape Escape View File

18 arch/arm/mach-at91/pm.c Unescape Escape View File

28 arch/arm/mach-bcm/bcm_5301x.c Unescape Escape View File

1 arch/arm/mach-davinci/da8xx-dt.c Unescape Escape View File

10 arch/arm64/Kconfig Unescape Escape View File

2 arch/arm64/boot/dts/hisilicon/hip06.dtsi Unescape Escape View File

4 arch/arm64/kernel/arm64ksyms.c Unescape Escape View File

7 arch/arm64/kernel/asm-offsets.c Unescape Escape View File

10 arch/arm64/kernel/kaslr.c Unescape Escape View File

28 arch/arm64/kernel/pci.c Unescape Escape View File

23 arch/arm64/kernel/smccc-call.S Unescape Escape View File

64 arch/arm64/kvm/hyp/tlb.c Unescape Escape View File

42 arch/arm64/mm/fault.c Unescape Escape View File

41 arch/c6x/kernel/ptrace.c Unescape Escape View File

8 arch/h8300/kernel/ptrace.c Unescape Escape View File

15 arch/metag/include/asm/uaccess.h Unescape Escape View File

19 arch/metag/kernel/ptrace.c Unescape Escape View File

324 arch/metag/lib/usercopy.c Unescape Escape View File

2 arch/mips/Kconfig Unescape Escape View File

4 arch/mips/configs/ip22_defconfig Unescape Escape View File

3 arch/mips/configs/ip27_defconfig Unescape Escape View File

2 arch/mips/configs/lemote2f_defconfig Unescape Escape View File

4 arch/mips/configs/malta_defconfig Unescape Escape View File

4 arch/mips/configs/malta_kvm_defconfig Unescape Escape View File

4 arch/mips/configs/malta_kvm_guest_defconfig Unescape Escape View File

4 arch/mips/configs/maltaup_xpa_defconfig Unescape Escape View File

2 arch/mips/configs/nlm_xlp_defconfig Unescape Escape View File

2 arch/mips/configs/nlm_xlr_defconfig Unescape Escape View File

6 arch/mips/include/asm/mach-ip27/spaces.h Unescape Escape View File

8 arch/mips/include/asm/spinlock.h Unescape Escape View File

2 arch/mips/kernel/cpu-probe.c Unescape Escape View File

4 arch/mips/kernel/genex.S Unescape Escape View File

3 arch/mips/kernel/ptrace.c Unescape Escape View File

17 arch/mips/kernel/traps.c Unescape Escape View File

36 arch/mips/lantiq/irq.c Unescape Escape View File

2 arch/mips/lantiq/xway/sysctrl.c Unescape Escape View File

2 arch/mips/mm/c-r4k.c Unescape Escape View File

25 arch/mips/mm/tlbex.c Unescape Escape View File

9 arch/mips/ralink/prom.c Unescape Escape View File

10 arch/mips/ralink/rt288x.c Unescape Escape View File

11 arch/mips/ralink/rt305x.c Unescape Escape View File

14 arch/mips/ralink/rt3883.c Unescape Escape View File

14 arch/mips/ralink/timer.c Unescape Escape View File

2 arch/mips/sgi-ip22/Platform Unescape Escape View File

7 arch/nios2/kernel/prom.c Unescape Escape View File

3 arch/nios2/kernel/setup.c Unescape Escape View File

23 arch/parisc/include/asm/cacheflush.h Unescape Escape View File

59 arch/parisc/include/asm/uaccess.h Unescape Escape View File

22 arch/parisc/kernel/cache.c Unescape Escape View File

10 arch/parisc/kernel/parisc_ksyms.c Unescape Escape View File

4 arch/parisc/kernel/process.c Unescape Escape View File

2 arch/parisc/lib/Makefile Unescape Escape View File

98 arch/parisc/lib/fixup.S Unescape Escape View File

318 arch/parisc/lib/lusercopy.S Unescape Escape View File

461 arch/parisc/lib/memcpy.c Unescape Escape View File

17 arch/parisc/mm/fault.c Unescape Escape View File

1 arch/powerpc/boot/zImage.lds.S Unescape Escape View File

5 arch/powerpc/crypto/crc32c-vpmsum_glue.c Unescape Escape View File

20 arch/powerpc/include/asm/mmu_context.h Unescape Escape View File

27 arch/powerpc/kernel/align.c Unescape Escape View File

20 arch/powerpc/kernel/idle_book3s.S Unescape Escape View File

4 arch/powerpc/kernel/misc_64.S Unescape Escape View File

2 arch/powerpc/kernel/setup-common.c Unescape Escape View File

9 arch/powerpc/kernel/setup_64.c Unescape Escape View File

20 arch/powerpc/lib/sstep.c Unescape Escape View File

474 Commits

v4.9.15 ... v4.9.22

44

Documentation/arm64/silicon-errata.txt

View File

5

Documentation/devicetree/bindings/arm/arch_timer.txt

View File

8

Documentation/devicetree/bindings/display/sunxi/sun4i-drm.txt

View File

1

Documentation/devicetree/bindings/usb/usb-xhci.txt

View File

9

Documentation/devicetree/bindings/watchdog/samsung-wdt.txt

View File

10

Documentation/kernel-parameters.txt

View File

2

Documentation/stable_kernel_rules.txt

View File

4

Makefile

View File

4

arch/arm/boot/dts/bcm5301x.dtsi

View File

2

arch/arm/boot/dts/sama5d2.dtsi

View File

1

arch/arm/boot/dts/stih407-family.dtsi

View File

4

arch/arm/kernel/armksyms.c

View File

14

arch/arm/kernel/smccc-call.S

View File

23

arch/arm/kvm/mmu.c

View File

18

arch/arm/mach-at91/pm.c

View File

28

arch/arm/mach-bcm/bcm_5301x.c

View File

1

arch/arm/mach-davinci/da8xx-dt.c

View File

10

arch/arm64/Kconfig

View File

2

arch/arm64/boot/dts/hisilicon/hip06.dtsi

View File

4

arch/arm64/kernel/arm64ksyms.c

View File

7

arch/arm64/kernel/asm-offsets.c

View File

10

arch/arm64/kernel/kaslr.c

View File

28

arch/arm64/kernel/pci.c

View File

23

arch/arm64/kernel/smccc-call.S

View File

64

arch/arm64/kvm/hyp/tlb.c

View File

42

arch/arm64/mm/fault.c

View File

41

arch/c6x/kernel/ptrace.c

View File

8

arch/h8300/kernel/ptrace.c

View File

15

arch/metag/include/asm/uaccess.h

View File

19

arch/metag/kernel/ptrace.c

View File

324

arch/metag/lib/usercopy.c

View File

2

arch/mips/Kconfig

View File

4

arch/mips/configs/ip22_defconfig

View File

3

arch/mips/configs/ip27_defconfig

View File

2

arch/mips/configs/lemote2f_defconfig

View File

4

arch/mips/configs/malta_defconfig

View File

4

arch/mips/configs/malta_kvm_defconfig

View File

4

arch/mips/configs/malta_kvm_guest_defconfig

View File

4

arch/mips/configs/maltaup_xpa_defconfig

View File

2

arch/mips/configs/nlm_xlp_defconfig

View File

2

arch/mips/configs/nlm_xlr_defconfig

View File

6

arch/mips/include/asm/mach-ip27/spaces.h

View File

8

arch/mips/include/asm/spinlock.h

View File

2

arch/mips/kernel/cpu-probe.c

View File

4

arch/mips/kernel/genex.S

View File

3

arch/mips/kernel/ptrace.c

View File

17

arch/mips/kernel/traps.c

View File

36

arch/mips/lantiq/irq.c

View File

2

arch/mips/lantiq/xway/sysctrl.c

View File

2

arch/mips/mm/c-r4k.c

View File

25

arch/mips/mm/tlbex.c

View File

9

arch/mips/ralink/prom.c

View File

10

arch/mips/ralink/rt288x.c

View File

11

arch/mips/ralink/rt305x.c

View File

14

arch/mips/ralink/rt3883.c

View File

14

arch/mips/ralink/timer.c

View File

2

arch/mips/sgi-ip22/Platform

View File

7

arch/nios2/kernel/prom.c

View File

3

arch/nios2/kernel/setup.c

View File

23

arch/parisc/include/asm/cacheflush.h

View File

59

arch/parisc/include/asm/uaccess.h

View File

22

arch/parisc/kernel/cache.c

View File

10

arch/parisc/kernel/parisc_ksyms.c

View File

4

arch/parisc/kernel/process.c

View File

2

arch/parisc/lib/Makefile

View File

98

arch/parisc/lib/fixup.S

View File

318

arch/parisc/lib/lusercopy.S

View File

461

arch/parisc/lib/memcpy.c

View File

17

arch/parisc/mm/fault.c

View File

1

arch/powerpc/boot/zImage.lds.S

View File

5

arch/powerpc/crypto/crc32c-vpmsum_glue.c

View File

20

arch/powerpc/include/asm/mmu_context.h

View File

27

arch/powerpc/kernel/align.c

View File

20

arch/powerpc/kernel/idle_book3s.S

View File

4

arch/powerpc/kernel/misc_64.S

View File

2

arch/powerpc/kernel/setup-common.c

View File

9

arch/powerpc/kernel/setup_64.c

View File

20

arch/powerpc/lib/sstep.c

View File

7

arch/powerpc/mm/hash_native_64.c

View File