-
Notifications
You must be signed in to change notification settings - Fork 74.7k
Insights: tensorflow/tensorflow
Overview
Could not load contribution data
Please try again later
208 Pull requests merged by 5 people
-
Change AsyncRunTask to be move-only.
#95365 merged
Jun 14, 2025 -
[IFRT] Use SerDes versioning in most IFRT and PjRt-IFRT types
#94890 merged
Jun 14, 2025 -
Return error when non-zero checks fail for div.
#95290 merged
Jun 14, 2025 -
Use blocking_thread_pool for H2D transfer
#95358 merged
Jun 14, 2025 -
Reverts 75aa15587bec5fd8b24c22b21f8d6c0ed6726f86
#95363 merged
Jun 13, 2025 -
Record subphase action information in HloRunnerPjRt.
#94700 merged
Jun 13, 2025 -
Update dependencies for
XNNPACK
andKleidiAI
.#95259 merged
Jun 13, 2025 -
Update visibility for python/data/experimental/ops/global_shuffle_op.
#95291 merged
Jun 13, 2025 -
[IFRT] Change
DeviceTest
toDeviceTestFixture
#95353 merged
Jun 13, 2025 -
[XLA:CPU] Emit xla.exp from legacy pipeline.
#95090 merged
Jun 13, 2025 -
[XLA:benchmarks] Create an HLO folder for benchmarking and add a sample hlo
#95355 merged
Jun 13, 2025 -
Reverts 0acc54bd1bb16186eb04b4681395b58935da52d0
#95350 merged
Jun 13, 2025 -
Delete all uses of
test_macros.h
as this file is now empty.#95284 merged
Jun 13, 2025 -
Rename
options
tooptions_ptr
intflite::xnnpack::Delegate::Delegate()
.#94957 merged
Jun 13, 2025 -
Added heartbeat_timeout argument.
#95292 merged
Jun 13, 2025 -
Add the FusedConv2DBiasActivation op tf dialect.
#95147 merged
Jun 13, 2025 -
Remove
lite/quantization/ir:QuantOps
fromtf_quant_opt.cc
#95285 merged
Jun 13, 2025 -
Change
SerDesVersion
's enum toSerDesVersionNumber
#95288 merged
Jun 13, 2025 -
Update dynamic shape support to be based on raw buffers.
#95224 merged
Jun 13, 2025 -
[pjrt] Improved CUDA detection in the Triton PjRt extension
#95338 merged
Jun 13, 2025 -
[XLA:GPU] Add the Triton emitter support for pad in nested fusions.
#95267 merged
Jun 13, 2025 -
PR #27481: [ROCm] enable hidden unit tests on rocm-1
#95337 merged
Jun 13, 2025 -
PR #27266: [ROCm] analytical latency estimator support for rocm
#95187 merged
Jun 13, 2025 -
[XLA:GPU] Fix an issue with not resetting preferred_consumer_
#95334 merged
Jun 13, 2025 -
[XLA:GPU] Add scoped timer to HLO execution in functional HLO runner.
#95279 merged
Jun 13, 2025 -
Cache invalidation for XProf
#93080 merged
Jun 13, 2025 -
[XLA:GPU] Add separate expected precision tolerance for BLAS.
#95330 merged
Jun 13, 2025 -
Reduce severity of logging in
StreamExecutorGpuClient::gpu_run_option
.#95331 merged
Jun 13, 2025 -
PR #26948: SPMD Partial Windowed Einsums
#95191 merged
Jun 13, 2025 -
[XLA:GPU] Improve dot algorithm precision tests with reference computation
#95328 merged
Jun 13, 2025 -
PR #27737: [ROCm] fixed gpu_hlo_unoptimized_llvm.hlo.test on rocm
#95287 merged
Jun 13, 2025 -
PR #26632: Add support for NVSHMEM collective_permute
#95263 merged
Jun 13, 2025 -
Move Shardy mesh inlining to right after deserialization.
#95281 merged
Jun 13, 2025 -
Replace outdated select() on --cpu in tensorflow/BUILD and related files with platform API equivalent.
#94745 merged
Jun 13, 2025 -
Extract HloOperandIndex into its own target (NFC).
#95323 merged
Jun 13, 2025 -
Automated Code Change
#95229 merged
Jun 13, 2025 -
strings_ops: Test "invalid" unicode_encode/unicode_decode inputs
#95321 merged
Jun 13, 2025 -
Replace deprecated tsl::errors::Internal with absl::InternalError
#95225 merged
Jun 13, 2025 -
Add missing rocm_headers dependency in rocm_library macro
#95276 merged
Jun 13, 2025 -
[IFRT] Add
empty
method to baseDeviceList
.#95289 merged
Jun 13, 2025 -
[DOC] Update Docker image in build_from_source and dev_guide docs.
#95220 merged
Jun 13, 2025 -
Refactor
subprocess_compilation
target.#95063 merged
Jun 13, 2025 -
Refactor CopyRawDeviceToHost into CopyRawDeviceToHostAndReturnEvent
#95201 merged
Jun 13, 2025 -
Always use staging buffer for ToLiteral.
#95198 merged
Jun 13, 2025 -
Don't reshape resource tensors in XNNPACK
#95283 merged
Jun 13, 2025 -
Use incarnation ids in GPU clique keys.
#93271 merged
Jun 12, 2025 -
cleanup for attrs_and_constraints target
#95129 merged
Jun 12, 2025 -
Replace
DISABLED_ON_TPU
withtest::DeviceTypeIs(test::kTpu)
inxla
#95221 merged
Jun 12, 2025 -
Set --gunit_fail_if_no_test_selected for XLA.
#95065 merged
Jun 12, 2025 -
#sdy Move unreduced frontend attribute to xla constants and clean up.
#95202 merged
Jun 12, 2025 -
Reverts 693e65e93e8f547d42ed01816bf5f56175b191bc
#95227 merged
Jun 12, 2025 -
Update xprof to 2.20.0
#95179 merged
Jun 12, 2025 -
Don't record callstack for XLA:CPU in OSS since streamz doesn't work there
#95207 merged
Jun 12, 2025 -
LiteralUtil: add method to get literal pointers
#95258 merged
Jun 12, 2025 -
[XLA:GPU] Make ScopedLoggingTimer also logs the elapsed time in micros.
#95270 merged
Jun 12, 2025 -
[xla:gpu] Add blas backends to the precision tests
#95272 merged
Jun 12, 2025 -
PR #27188: [XLA:GPU] Hermetic Build oneAPI
#95199 merged
Jun 12, 2025 -
[xla:cpu] Do not fuse too many reductions together
#95223 merged
Jun 12, 2025 -
[XLA:CPU] Add reduce-window and reduce over outer dimension microbenchmarks.
#95266 merged
Jun 12, 2025 -
[XLA:GPU]: Use Store/Load with a counter instead of CAS for all-reduce
#95189 merged
Jun 12, 2025 -
Bump the version of StableHLO and VHLO indicating that mixed serialization is supported.
#95256 merged
Jun 12, 2025 -
Clean up dependencies of ROCm targets
#95252 merged
Jun 12, 2025 -
Create separate methods for creating kernel from ptx and cubin
#95251 merged
Jun 12, 2025 -
[XLA:GPU] print histogram of the relative errors in the Triton dot algorithm test.
#95257 merged
Jun 12, 2025 -
PR #27700: [debug-options-dump] Fix dumping for repeated string fields
#95197 merged
Jun 12, 2025 -
PR #27721: Always evaluate while init in loop analysis.
#95250 merged
Jun 12, 2025 -
Automated Code Change
#95238 merged
Jun 12, 2025 -
PR #23688: [ROCm] Triton performance fixes
#95255 merged
Jun 12, 2025 -
Remove unused ForwardsValue hook (NFC).
#95246 merged
Jun 12, 2025 -
Internal change only
#95254 merged
Jun 12, 2025 -
PR #27718: Adds latency hiding scheduling support for DUS s5
#95226 merged
Jun 12, 2025 -
PR #27128: Make copy_fusion respect the budget limit
#95186 merged
Jun 12, 2025 -
PR #27680: Build nvshmem on arm64 machines
#95237 merged
Jun 12, 2025 -
Work around a compiler issue in GCC 8
#95233 merged
Jun 12, 2025 -
Automated Code Change
#95161 merged
Jun 12, 2025 -
Bump requests from 2.32.3 to 2.32.4
#95112 merged
Jun 12, 2025 -
Automated Code Change
#95167 merged
Jun 12, 2025 -
Add retry_on_oom paramater to AllocateRawBuffer to allow
#95088 merged
Jun 12, 2025 -
Integrate LLVM at llvm/llvm-project@842377882a3f
#95174 merged
Jun 12, 2025 -
remove old copy of quantization_lib and rename new one
#95085 merged
Jun 12, 2025 -
Remove all uses of
XLA_TYPED_TEST
#95211 merged
Jun 11, 2025 -
Support QAT aware conversion with dynamic shape models
#94830 merged
Jun 11, 2025 -
Revamp StableHLO folder patterns to use MLIR folding infra, to not expand splats.
#95154 merged
Jun 11, 2025 -
#sdy Skip all-reduce along sharding dimension on ops with unreduced frontend attribute.
#94903 merged
Jun 11, 2025 -
ifdef guard recording GPU compilation stacks to Google
#95204 merged
Jun 11, 2025 -
Add
DeviceTypeIs
toxla_test_backend_predicates
to replaceXLA_TEST_BACKEND_TPU
and similar#95209 merged
Jun 11, 2025 -
[IFRT IR] Add method for fingerprinting ModuleOp.
#95196 merged
Jun 11, 2025 -
#sdy Mark unreduced ops with frontend attributes.
#94979 merged
Jun 11, 2025 -
Implement Shardy to HLO transformation
#95200 merged
Jun 11, 2025 -
[XLA] Early reject for Reduce Window Rewriter to eliminate cases faster
#95145 merged
Jun 11, 2025 -
[GPU] Replace CudnnLegacyFusedConvRunner with CudnnExecutionPlanRunner
#95078 merged
Jun 11, 2025 -
Optimize rotation loop during Eval
#95159 merged
Jun 11, 2025 -
Extract hlo input / output format enums to separate files outside of functional_hlo_runner.
#94136 merged
Jun 11, 2025 -
Add user-agent for GCS C-API
#95131 merged
Jun 11, 2025 -
[XLA] Allow ReshapeMover to perform chain moves.
#95148 merged
Jun 11, 2025 -
Fix xnnpack-delegate performance regression
#94944 merged
Jun 11, 2025 -
[XLA:GPU]: Use double buffering for the one-shot all-reduce kernel
#95004 merged
Jun 11, 2025 -
[XLA:GPU] NFC sort ops in support test
#95190 merged
Jun 11, 2025 -
[XLA:GPU]: Add tests for all-reduce within a while-loop
#94934 merged
Jun 11, 2025 -
[xla:gpu] NestGemmFusion: also hoist bitcasts downwards across transposes/broadcasts.
#94866 merged
Jun 11, 2025 -
[XLA:GPU] update symbolic tile analysis to propagate rutime variables
#95123 merged
Jun 11, 2025 -
[XLA:GPU] Convert
RendezvousArg
pair to a struct.#95182 merged
Jun 11, 2025 -
PR #27635: [ROCm] enable hidden unit tests on rocm-2
#95181 merged
Jun 11, 2025 -
[XLA:CPU] Create memory mappers named after LLVM modules.
#95055 merged
Jun 11, 2025 -
Make GpuTestKernels use the GpuKernelRegistry
#95109 merged
Jun 11, 2025 -
Enable cublas backend. Skip codegen backends which do not support an instruction.
#95116 merged
Jun 11, 2025 -
[XLA:CPU] Save obj file names in the executable proto
#95057 merged
Jun 11, 2025 -
PR #27498: [ROCm] Add new hip_runtime bazel target
#95111 merged
Jun 11, 2025 -
[XLA:CPU] Block fusions of subcomputations if the parent can be fused.
#95117 merged
Jun 11, 2025 -
fix HloToStablehlo copy of backend_config into frontend_attributes
#95139 merged
Jun 11, 2025 -
[XLA:GPU] Return empty vector instead of throwing and error in the Triton autotuner backend.
#95178 merged
Jun 11, 2025 -
PR #27371: PJRT_Executable_DeserializeAndLoad: plumb compile options
#95175 merged
Jun 11, 2025 -
[XLA:CPU][autotuning] LLVM kernel autotuner implementation
#94880 merged
Jun 11, 2025 -
[XLA:CPU] Fix order of lowered work item id
#94941 merged
Jun 11, 2025 -
Automated Code Change
#95165 merged
Jun 11, 2025 -
Remove unused multi PTX feature from CudaPtxInMemory
#95125 merged
Jun 11, 2025 -
Automated Code Change
#95169 merged
Jun 11, 2025 -
Make IFRT Proxy create device list using the underlying IFRT Client's MakeDeviceList
#95172 merged
Jun 11, 2025 -
Allow out of bounds strided slice ops to delegate
#95144 merged
Jun 11, 2025 -
[XLA:Collective] Fix reduction computation type bug in while_loop_all_reduce_code_motion_setup.
#95082 merged
Jun 11, 2025 -
Reverts a854d8322fa93c9b644e3474fe0e4866d769669b
#94713 merged
Jun 11, 2025 -
Reverts 667a7af1e172dec7fd24718c15dc9e961d990384
#95155 merged
Jun 11, 2025 -
[XLA] Make EraseElementFromVector return a bool instead of a status.
#95153 merged
Jun 11, 2025 -
remove lite quant deps from tfr-opt and use tf quant deps instead
#95143 merged
Jun 10, 2025 -
cleanup for lift_as_function_call
#95126 merged
Jun 10, 2025 -
Bump googletest revision to pick up new fail_if_no_test_selected flag.
#95146 merged
Jun 10, 2025 -
Elide
gpu_
prefix from GPU backends inxla_test
#94752 merged
Jun 10, 2025 -
[XLA:GPU] Make functional runner respect disabled SPMD partitioning
#95067 merged
Jun 10, 2025 -
Add
num_sparsecores_per_device
attribute to custom combiner BWD ops.#95093 merged
Jun 10, 2025 -
[XLA] Refactoring for Reduce Window Rewriter
#95073 merged
Jun 10, 2025 -
[XLA] Make reshape-mover a bit less dependent on algsimp.
#95134 merged
Jun 10, 2025 -
Reverts cc47e60b44b55fab0a7731b5e1f411b67aef57b4
#94948 merged
Jun 10, 2025 -
[XLA] Remove XLA_TEST_P and XLA_TEST_F
#95135 merged
Jun 10, 2025 -
Use
NVSHMEM
tar files in RBE CUDA builds for XLA and Tensorflow.#95140 merged
Jun 10, 2025 -
Add helper for stripping out dynamic shape metadata for a raw buffer.
#95091 merged
Jun 10, 2025 -
Account for control dependencies in HloSchedule::Update().
#95074 merged
Jun 10, 2025 -
Fix variable ops in the flex delegate
#95009 merged
Jun 10, 2025 -
Move memory space clearance to right before host offloader work
#93890 merged
Jun 10, 2025 -
Add
last_checkpoint_step
parameter to CheckpointManager init.#95136 merged
Jun 10, 2025 -
Fix ops attempting to handle filter and bias of mismatched floating point types
#95054 merged
Jun 10, 2025 -
#HLODiff Extends HloComputationGraphMatcher to match leaf nodes other than constants and parameters.
#94760 merged
Jun 10, 2025 -
Extend tf2xla to support Shardy.
#94489 merged
Jun 10, 2025 -
Add python extension for registering AutoSharding to XLA pipeline.
#95133 merged
Jun 10, 2025 -
Improve device assignment string format and add AbslStringify implementation.
#94882 merged
Jun 10, 2025 -
Reverts 9882facbb94c0b0e5a07bc38e445cd177440c9e9
#95128 merged
Jun 10, 2025 -
When subgraph reshaping is enabled, we need to clear all the externals when reshaping, not just the inputs.
#95053 merged
Jun 10, 2025 -
Reverts fc61865027fbe24c6755f71e979b78b2474db1e9
#94980 merged
Jun 10, 2025 -
Build python3.14.0b1 from source and install it in the linux docker images
#95068 merged
Jun 10, 2025 -
cleanup for fake_quant_utils
#95124 merged
Jun 10, 2025 -
[XLA] Replace XLA_TEST_P with TEST_P
#94972 merged
Jun 10, 2025 -
[XLA] Replace XLA_TEST_P with TEST_P
#94975 merged
Jun 10, 2025 -
[XLA] Replace XLA_TEST_F with TEST_F
#94968 merged
Jun 10, 2025 -
[XLA] Replace XLA_TEST_F with TEST_F
#94962 merged
Jun 10, 2025 -
Remove unused LlvmHostKernel
#95122 merged
Jun 10, 2025 -
Fix CopyToMemorySpace typo bug.
#95076 merged
Jun 10, 2025 -
[XLA:CPU] Add ExpF64Avx512 benchmark.
#95081 merged
Jun 10, 2025 -
Move REVERSE_V2 implementation to separate header file.
#94856 merged
Jun 10, 2025 -
[XLA:CPU] Fix xla math lib to scan for vectorized function names.
#95080 merged
Jun 10, 2025 -
[XLA:GPU] Add proto serialization for PartitionIdThunk
#94659 merged
Jun 10, 2025 -
[XLA:GPU] indexing map serialization: print undefined + tests
#94809 merged
Jun 10, 2025 -
[XLA] Remove merging pad into reduce-window - because of creating unsual patterns
#94924 merged
Jun 10, 2025 -
Sync patches to unbreak windows build
#95118 merged
Jun 10, 2025 -
PR #27537: [run-hlo-module] Fix the debug options test
#95070 merged
Jun 10, 2025 -
PR #27596: Bump github/codeql-action from 3.28.10 to 3.28.19
#95110 merged
Jun 10, 2025 -
[XLA:GPU] Run SoL cost model for -On.
#94861 merged
Jun 10, 2025 -
PR #26268: [ROCm] Introduce xla_gpu_use_inprocess_lld to invoke ldd as a library
#94923 merged
Jun 10, 2025 -
Better error messages for thunk deserialization errors
#94867 merged
Jun 10, 2025 -
#sdy Replace manual axes that are bound to a duplicate mesh in SdyRoundTripDedupMeshesPass
#94936 merged
Jun 10, 2025 -
Add serialization support for DeviceToDeviceCopyThunk
#94862 merged
Jun 10, 2025 -
Adds interface to update target specific states in latency hiding scheduler.
#94763 merged
Jun 10, 2025 -
Remove unused Bazel macro build_cub_sort_kernels
#95108 merged
Jun 10, 2025 -
Make the kv store timeout configurable for cross-host device transfers.
#95069 merged
Jun 10, 2025 -
Remove tensorflow-intel from Linux wheel metadata
#94854 merged
Jun 10, 2025 -
Improve error handling in transmission of buffer metadata for experimental cross-host device transfers.
#94685 merged
Jun 10, 2025 -
Integrate LLVM at llvm/llvm-project@649020c68016
#95064 merged
Jun 10, 2025 -
* Support buffer coloring by updating MSA behavior:
#94483 merged
Jun 10, 2025 -
Move trace event after we select the device
#95084 merged
Jun 10, 2025 -
[XLA] Better logs for cycle detector for scheduling groups
#95083 merged
Jun 10, 2025 -
Fix buffer overflow bug in
HloLexer::LexInt64Impl
and add regression tests.#94982 merged
Jun 10, 2025 -
[XLA] Handle more cases in IotaTileAssignment::Transpose.
#94947 merged
Jun 10, 2025 -
Integrate hermetic
nvshmem
repository in XLA and TF projects.#94894 merged
Jun 9, 2025 -
Remove redundant build dep.
#95077 merged
Jun 9, 2025 -
Uses the correct SparseCore count for custom combiner BWD op in AoT compilation.
#95071 merged
Jun 9, 2025 -
remove old copy of uniform_op_quant_spec, tf_to_uniform_attribute_utils, tf_op_quant and rename new one
#94754 merged
Jun 9, 2025 -
remove old copy of fuse_convolution_pass and rename new one
#94321 merged
Jun 9, 2025 -
Fix TF nightly auditwheel repair due to pywrap
#94921 merged
Jun 9, 2025 -
Update rules_python patch file to get python 3.14.0b1
#95066 merged
Jun 9, 2025 -
remove old copy of constant_fold and rename new one
#94813 merged
Jun 9, 2025 -
Remove all uses of
DISABLED_ON_DEBUG
and delete it. We can check ifNDEBUG
is set directly#94978 merged
Jun 9, 2025 -
[tosa] Support variable bias for convolutional ops
#94438 merged
Jun 9, 2025 -
Remove all uses of
DISABLED_ON_INTERPRETER_TSAN
#94976 merged
Jun 9, 2025 -
Add a test case in collective pipeliner that runs backward & forward passes back-to-back.
#94974 merged
Jun 9, 2025 -
Remove
TestPlatform
fromtest_macros.{cc,h}
as it is unused#94981 merged
Jun 9, 2025 -
Add support for converting
sdy.reduce_scatter
into (1 or more)stablehlo.reduce_scatter
.#94938 merged
Jun 9, 2025 -
Files should be open with O_BINARY on Windows.
#95010 merged
Jun 8, 2025 -
Introduces an IOPDDL-based implementation of the heuristic solver(s).
#94942 merged
Jun 7, 2025
102 Pull requests opened by 5 people
-
Automated Code Change
#95026 opened
Jun 8, 2025 -
Automated Code Change
#95028 opened
Jun 8, 2025 -
Fix: Safely Capture and Store TF Operation Stack Trace at Creation to Prevent Dangling Reference Errors
#95034 opened
Jun 8, 2025 -
Automated Code Change
#95044 opened
Jun 9, 2025 -
Automated Code Change
#95045 opened
Jun 9, 2025 -
Automated Code Change
#95046 opened
Jun 9, 2025 -
Cleanup: rename from tensorflow_stats to framework_op_stats
#95072 opened
Jun 9, 2025 -
[Phase Compilation] Part-1: PJRT extensions to implement phase compilation.
#95079 opened
Jun 9, 2025 -
Add an API to overwrite the current execution_stream_id and respect it in XLA CPU dispatch.
#95086 opened
Jun 10, 2025 -
[Phase Compilation] Part-2: Introduces xla::PjRtPhaseCompiler
#95087 opened
Jun 10, 2025 -
[Phase Compilation] Part-3: Add C++ layers to test and interact with C PJRT API.
#95089 opened
Jun 10, 2025 -
[xla:gpu] Reimplement FindBlockLevelParameters()
#95107 opened
Jun 10, 2025 -
PR #27481: [ROCm] enable hidden unit tests on rocm-1
#95115 opened
Jun 10, 2025 -
Adjust passes that deal with aliasing logic to have a callback.
#95119 opened
Jun 10, 2025 -
Replace outdated select() on --cpu in tensorflow/BUILD and related files with platform API equivalent.
#95120 opened
Jun 10, 2025 -
[XLA:benchmarks] Add a filter to skip blocking performance presubmit
#95132 opened
Jun 10, 2025 -
Reverts 18d3864ae7b5951d10389fcffd15780b544f6cd1
#95156 opened
Jun 10, 2025 -
Enhance hadamard_rotation_test before optimizing rotation algorithm.
#95157 opened
Jun 10, 2025 -
Register ChloDialect in tf_tfl_translate.cc
#95158 opened
Jun 10, 2025 -
Enable subgraph reshaping by default in XNNPACK delegate
#95173 opened
Jun 11, 2025 -
Add AliasHints class that will replace the separate alias hint hooks.
#95183 opened
Jun 11, 2025 -
[XLA:GPU] Legalize dot precision into casts+algorithm.
#95188 opened
Jun 11, 2025 -
Commented out the legacy test files. They are not buildable.
#95192 opened
Jun 11, 2025 -
Changing the evaluator to add support for a custom call
#95195 opened
Jun 11, 2025 -
Allow TFL to TOSA pipeline to load TFL dialect
#95205 opened
Jun 11, 2025 -
Refactor `custom_call` to use common `FindCudaExecutable` method from XLA repository to find CUDA binaries.
#95208 opened
Jun 11, 2025 -
Port recent MHLO changes to StableHLO optimization path.
#95210 opened
Jun 11, 2025 -
Break sparse tensors to find users
#95213 opened
Jun 11, 2025 -
[XLA:benchmarks] Test onboard a new hlo from repo path
#95215 opened
Jun 11, 2025 -
add dynamic registration helper
#95216 opened
Jun 11, 2025 -
Reserve the last 100 custom XPlane IDs for NCCL Net Plugin.
#95217 opened
Jun 11, 2025 -
#HLODiff Add a BipartiteTopDownMatcher after strict GreedyTopDownMatcher
#95219 opened
Jun 11, 2025 -
Integrate LLVM at llvm/llvm-project@02550da93291
#95234 opened
Jun 12, 2025 -
Simplify MultiKernelLoaderSpec
#95260 opened
Jun 12, 2025 -
[PROTOTYPE] Cleanup TFL dependencies in tosa
#95262 opened
Jun 12, 2025 -
Add the method to Autotune HloModule.
#95264 opened
Jun 12, 2025 -
Split `ImplicitArithOpBuilder` into its own target.
#95265 opened
Jun 12, 2025 -
[XLA:CPU] Add pass to rewrite f32 <-> bf16 conversions.
#95268 opened
Jun 12, 2025 -
Introduce repo environment variable CUDA_EXTRA_COPTS
#95269 opened
Jun 12, 2025 -
[XLA:CPU] Disable loop unrolling for certain reduce operations.
#95271 opened
Jun 12, 2025 -
[XLA:benchmarks] Add README guide for onboarding new benchmarks to OpenXLA.
#95275 opened
Jun 12, 2025 -
Implement HLO to Shardy transformation.
#95277 opened
Jun 12, 2025 -
Use TensorShape instead of RuntimeShape in TF.
#95278 opened
Jun 12, 2025 -
[XLA:benchmarks] Upload performance regression in presubmit to GCS buckets for tracking
#95280 opened
Jun 12, 2025 -
[xla:gpu] Nest gemm fusion: only hoist bitcasts upwards.
#95282 opened
Jun 12, 2025 -
Correct the int type of `output_id` in NanoRT IFRT Client.
#95286 opened
Jun 12, 2025 -
Remove old heartbeat flags and arguments.
#95293 opened
Jun 12, 2025 -
Remove old heartbeat options.
#95294 opened
Jun 12, 2025 -
Set heartbeat_timeout argument and flag.
#95295 opened
Jun 12, 2025 -
Use heartbeat_timeout argument.
#95296 opened
Jun 12, 2025 -
Test fusion model for an operand not belonging to fusion op.
#95297 opened
Jun 12, 2025 -
Upgrade TF_SYSROOT for rbe_linux_cpu to /dt10
#95299 opened
Jun 13, 2025 -
Change ownership of the file descriptor from the weight cache builder to the provider.
#95300 opened
Jun 13, 2025 -
Test CPU build
#95301 opened
Jun 13, 2025 -
[XLA] Refactor if-else chain into a switch.
#95302 opened
Jun 13, 2025 -
Add Hermetic C++ Toolchains for XLA project.
#95304 opened
Jun 13, 2025 -
Experiment!
#95305 opened
Jun 13, 2025 -
Support for nested while loops in while_loop_unroller.
#95306 opened
Jun 13, 2025 -
Add AcquireScopedRawBuffer(...) to CommonPjRtBuffer which
#95307 opened
Jun 13, 2025 -
Integrate LLVM at llvm/llvm-project@8890706db673
#95319 opened
Jun 13, 2025 -
Automated Code Change
#95320 opened
Jun 13, 2025 -
Fix compilation error in tensorflow/python/tfcompile_wrapper.cc on s390x
#95322 opened
Jun 13, 2025 -
Automated Code Change
#95325 opened
Jun 13, 2025 -
PR #27784: Use 256 byte alignment to avoid breakages starting with cublas 12.9.1.4.
#95329 opened
Jun 13, 2025 -
[XLA:CPU] Naming module memory regions after emitters that produced the modules.
#95333 opened
Jun 13, 2025 -
Fix subprocess.check_output decoding issue in pip_smoke_test.py to handle byte output safely
#95335 opened
Jun 13, 2025 -
[XLA:GPU] NFC optional separator when printing tiled HLO instruction and tiling
#95336 opened
Jun 13, 2025 -
[XLA:GPU] Flip LHS SoL by default on Hopper and supported HLOs.
#95339 opened
Jun 13, 2025 -
[XLA:GPU] Extract dynamic slicing related utils into a separate file.
#95340 opened
Jun 13, 2025 -
Migrate MultiKernelLoaderSpec users to the new APIs
#95341 opened
Jun 13, 2025 -
Remove the tfl dialect dependency from tf-tfrt-opt.
#95343 opened
Jun 13, 2025 -
[XLA:CPU] Ensure that the work splitting is done on the outer dimension
#95346 opened
Jun 13, 2025 -
[XLA:CPU] Use legacy fusion for dot fusion.
#95347 opened
Jun 13, 2025 -
[XLA:CPU][XLA:GPU] Add nuw to add/mul when affine map is lowered
#95348 opened
Jun 13, 2025 -
Split ImplicitArithOpBuilder into its own target.
#95349 opened
Jun 13, 2025 -
Extend the CUDA root candidates and add `FindNvdisasmExecutable` to `subprocess_compilation.cc`.
#95351 opened
Jun 13, 2025 -
Delete `test_macros.h` and final remaining uses
#95352 opened
Jun 13, 2025 -
Adds flags for NCCL non-blocking communicators and async execution.
#95354 opened
Jun 13, 2025 -
[xla:cpu] Deprecate API_VERSION_STATUS_RETURNING_UNIFIED custom calls
#95356 opened
Jun 13, 2025 -
Only abort collectives on failure.
#95359 opened
Jun 13, 2025 -
Load PyInfo from rules_python (Attempt 2)
#95360 opened
Jun 13, 2025 -
Add `ShouldWarmupAllBatchSizes` overload that accepts name/version directly
#95361 opened
Jun 13, 2025 -
na na na
#95362 opened
Jun 13, 2025 -
[XLA][Numerics][HLO Value Tracking] Deduplicate original values in an HLO module
#95366 opened
Jun 14, 2025 -
Integrate LLVM at llvm/llvm-project@2c440232e261
#95367 opened
Jun 14, 2025 -
[PjRt] Block on external references when `CommonPjRtBuffer` is destroyed
#95368 opened
Jun 14, 2025 -
Automated Code Change
#95373 opened
Jun 14, 2025 -
Automated Code Change
#95374 opened
Jun 14, 2025 -
Automated Code Change
#95376 opened
Jun 14, 2025 -
Automated Code Change
#95378 opened
Jun 14, 2025 -
Automated Code Change
#95379 opened
Jun 14, 2025 -
Automated Code Change
#95380 opened
Jun 14, 2025 -
Automated Code Change
#95382 opened
Jun 14, 2025 -
Automated Code Change
#95383 opened
Jun 14, 2025 -
Automated Code Change
#95384 opened
Jun 14, 2025 -
Automated Code Change
#95385 opened
Jun 14, 2025
6 Issues closed by 5 people
-
Looking for the the reasoning speed comparison of different reaasoning frameworks
#94534 closed
Jun 12, 2025 -
Unexpected UnicodeDecodeError: invalid continuation byte when reading lines from a file
#27537 closed
Jun 10, 2025 -
Bug: Chained HashedCrossing in TF 2.16.2 results in (None, D) vs (batch_size, D) input shapes
#93830 closed
Jun 10, 2025 -
Build error in tensorflow lite minimal example
#70730 closed
Jun 9, 2025
4 Issues opened by 4 people
-
Deeplabcut issue
#95274 opened
Jun 12, 2025 -
Some sorting related ops produce results inconsistent with NumPy when tensor contains NaN
#95235 opened
Jun 12, 2025 -
how to build libtensorflowlite_c.so with Address Sanitizer
#95222 opened
Jun 12, 2025 -
TensorFlow disables SwiftUI Previews
#95106 opened
Jun 10, 2025
60 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
build(aarch64): Update to oneDNN-3.7 + ACL-24.12 (fix)
#93951 commented on
Jun 12, 2025 • 5 new comments -
Fix comparison functions and add unit tests
#94484 commented on
Jun 12, 2025 • 1 new comment -
Fix: Ensure boolean_mask_v2() only accepts boolean dtype for mask
#89370 commented on
Jun 11, 2025 • 0 new comments -
[ONEDNN] upgrading onednn 3.7
#90388 commented on
Jun 11, 2025 • 0 new comments -
Optimize `xla::HloSharding::PartialTile` by using vectorized highway sort `hwy::VQSort`.
#92165 commented on
Jun 11, 2025 • 0 new comments -
Major deps update:
#92241 commented on
Jun 14, 2025 • 0 new comments -
TfLite elementwise_ops add type support (#104)
#92706 commented on
Jun 11, 2025 • 0 new comments -
Only abort collectives with failed tasks.
#93273 commented on
Jun 12, 2025 • 0 new comments -
Allow Xprof frontend to use the query parameter `use_saved_result=False` which will skip the intermediate analysis and regenerate the tool data from XSpace
#93411 commented on
Jun 13, 2025 • 0 new comments -
Stable delegate python api
#93850 commented on
Jun 12, 2025 • 0 new comments -
[NCCL] Upgrade TF NCCL version to 2.26.5
#94053 commented on
Jun 10, 2025 • 0 new comments -
This change introduces a new mechanism for tracking CUDA graph events within the profiling system.
#94057 commented on
Jun 11, 2025 • 0 new comments -
Enable Stablehlo -> HLO lowering by default.
#94296 commented on
Jun 12, 2025 • 0 new comments -
Add source_target_pairs to send/recv ops in StableHLO
#94495 commented on
Jun 14, 2025 • 0 new comments -
Add hlo_module_name parameter to Compiler::CreateMetricsHook to make available the hlo module name of the recorded hlo program in the logging infrastructure.
#94649 commented on
Jun 10, 2025 • 0 new comments -
Use Hermetic C++ toolchain for Linux x86_64 builds.
#94704 commented on
Jun 13, 2025 • 0 new comments -
Added strongly typed int for `IncarnationId`.
#94761 commented on
Jun 13, 2025 • 0 new comments -
[XLA] Flatten nested tuple return shape of async computations in tuple simplifier.
#94765 commented on
Jun 9, 2025 • 0 new comments -
[XLA:CPU][tfcompile] Enable thunk runtime by default.
#94787 commented on
Jun 10, 2025 • 0 new comments -
Move converting a mixed SDY+GSPMD module down to a pure GSPMD targeting module to `MlirToXlaComputation`.
#94798 commented on
Jun 9, 2025 • 0 new comments -
derive all the lines of each device, instead of deriving events from only the line with most event.
#94829 commented on
Jun 14, 2025 • 0 new comments -
Move Mosaic into XLA
#94877 commented on
Jun 9, 2025 • 0 new comments -
PR #27412: Command buffer respect control dependency of HloInstruction when running with concurrent mode.
#94883 commented on
Jun 12, 2025 • 0 new comments -
Refactor CUPTI callback ID logic to cupti_tracer.
#94898 commented on
Jun 11, 2025 • 0 new comments -
Add test and refactor Device Assignment.
#94904 commented on
Jun 10, 2025 • 0 new comments -
Implementation of phase-compilation using PJRT Extensions
#94922 commented on
Jun 9, 2025 • 0 new comments -
Make `Thunk::ToProto()` return an error if serialization is not implemented
#94930 commented on
Jun 10, 2025 • 0 new comments -
Integrate Triton up to [](https://github.com/openai/triton/commits/)
#94937 commented on
Jun 10, 2025 • 0 new comments -
[XLA] Add stack trace breakdown to `HloLiveRange::ToString` for peak memory usage
#94954 commented on
Jun 10, 2025 • 0 new comments -
Use raw string literals for regex patterns in kernel test assertions
#95005 commented on
Jun 12, 2025 • 0 new comments -
Adding TensorFlow Hub KerasLayer to Sequential Model Raises ValueError
#63849 commented on
Jun 8, 2025 • 0 new comments -
`../tensorflow/third_party/xla/third_party/tsl/tsl/platform/ml_dtypes.h:19:10: error: 'ml_dtypes/include/float8.h' file not found [clang-diagnostic-error]`
#93130 commented on
Jun 9, 2025 • 0 new comments -
tf.data.Dataset .map().batch() pattern is not matched to use fused implementation.
#53572 commented on
Jun 9, 2025 • 0 new comments -
Segmentation fault in tf.sets.size
#94863 commented on
Jun 9, 2025 • 0 new comments -
Pybind11 Exception
#60534 commented on
Jun 9, 2025 • 0 new comments -
tf.data.experimental.prefetch_to_device has no effect inside tf.distribute.Strategy.distribute_datasets_from_function.
#94735 commented on
Jun 9, 2025 • 0 new comments -
TensorFlow Docker `tensorflow/tensorflow:latest-gpu` fails to detect GPU due to CUDA/cuDNN mismatch
#94593 commented on
Jun 10, 2025 • 0 new comments -
Crash in `tf.raw_ops.BiasAdd` when executing on GPU
#94379 commented on
Jun 10, 2025 • 0 new comments -
Muting Tensorflow Lite logs
#92216 commented on
Jun 10, 2025 • 0 new comments -
tf.transpose crashes with negative perm value: "Check failed: d >= 0 (0 vs. -1)"
#94433 commented on
Jun 11, 2025 • 0 new comments -
16KB pagination support for TF Lite Select Ops
#94048 commented on
Jun 11, 2025 • 0 new comments -
Enhance Memory Optimizer with Dynamic Cost Model for Operation Recomputation
#94653 commented on
Jun 11, 2025 • 0 new comments -
Support/Feature Request: Pre-processing very large corpus text file as tokens to train GPT Models.
#60539 commented on
Jun 11, 2025 • 0 new comments -
tf.linalg.matrix_rank results has different results with or without @tf.function for numpy inputs under tensorflow-cpu
#60547 commented on
Jun 11, 2025 • 0 new comments -
TensorFlow DLL failed to load with newer version of TF
#91656 commented on
Jun 11, 2025 • 0 new comments -
crash when two model parallel inference in two instances using libtensorflowlite_c.so and run delegate gpu opencl
#94274 commented on
Jun 12, 2025 • 0 new comments -
AttributeError with Protobuf >= 6.30
#94030 commented on
Jun 12, 2025 • 0 new comments -
rejection_resample loses track of ragged tensors
#60583 commented on
Jun 12, 2025 • 0 new comments -
[TFLite] flatbuffer64 support for TFlite
#60570 commented on
Jun 12, 2025 • 0 new comments -
Weird memory usage of shuffling in `tf.data.Dataset`
#60599 commented on
Jun 12, 2025 • 0 new comments -
control_flow_ops_test unit test is flaky
#60629 commented on
Jun 12, 2025 • 0 new comments -
graph execution error bug with tfm.nlp.layers.MultiHeadRelativeAttention
#94599 commented on
Jun 13, 2025 • 0 new comments -
java.lang.IllegalArgumentException: Internal error: Error applying delegate:
#93525 commented on
Jun 13, 2025 • 0 new comments -
Mismatch Between Quantized TFLite Layer Outputs and Expected Mathematical Values When Using get_tensor()
#93917 commented on
Jun 13, 2025 • 0 new comments -
cuDNN, cuFFT, and cuBLAS Errors
#62075 commented on
Jun 13, 2025 • 0 new comments -
How to run Android demo which uses NPU to inference?
#94853 commented on
Jun 13, 2025 • 0 new comments -
Numpy and tf experimental Numpy differ in vander matrix creation case for N=0
#60628 commented on
Jun 13, 2025 • 0 new comments -
tensorflow-macos still required for version 2.16.1
#63495 commented on
Jun 14, 2025 • 0 new comments -
Tensorflow is aborting with CompositeTensorVariant already registered
#94709 commented on
Jun 14, 2025 • 0 new comments -
Fix compile error in tensorflow/python/tfcompile_wrapper.cc on s390x
#87676 commented on
Jun 11, 2025 • 0 new comments